Community
2023 was a troublesome year for many on our planet – wars, violence, population displacement, catastrophes, extremism, higher costs of living and poverty. Folks working in our industry were comparatively lucky, some of us invigorated by the exciting GenerativeAI tornado. Just as HFT transformed the vocabulary of capital markets in the 2000s, and digitalization transformed banking and financial services vocabulary in the 2010s, GenAI has brought us a new AI lexicon, at quite some speed too.
With it, we encountered many terms, many barely used in 2022 but which now carry new or very different meanings. I and many others in Financial Services, use them intraday every day. If you’re one of the (un)lucky few who doesn’t, here’s a quick refresher of my favorite three!
Term 1: The Vector Database
The so-called vector database has become core to many enterprise GenAI stacks, as a means of improving the quality of responses to prompts. The alternatives, e.g., “fine tuning” large language models [LLM] without an accompanying database, are expensive, and fraught with risk and compliance overheads. A vector database captures proprietary enterprise information, brings cost effectiveness, and gives comparative control. Financial services firms are certainly in the queue to use vector databases.
Ironically in finance, vectors have for years been integral to the matrix algebra predominant in trading and risk management. Data storage of such “vectors” and matrices have also been around for decades, typically in columnar databases, or as tables or dataframes used in languages like Python (Pandas), R, MATLAB and SAS. When retrieved and used, for example as financial time-series and panel data, in conjunction with techniques like linear and time-series regressions, they drive predictive analytics, anomaly detection, and econometrics. They also help inform backtesting, most notably of trading, portfolio management and risk strategies. While capital markets – the front and middle office - led the matrix algebra charge, increasingly analytic-centric use cases such as marketing, fraud detection, and digitalization generally took data science - and vectors - across the financial organizations.
I was fascinated, therefore, when a former colleague, Dave Bergstein, went to work for a “vector database” start-up in June 2021. His article on Solving Complex Problems with Vector Databases from pre-ChatGPT March 2022 caught my eye because he highlighted very specific vector types – vector embeddings - encoded easily-searched navigable vectors capturing knowledge from unstructured information such as words, images, etc. When ChatGPT launched later that year, vector stores of such embedding types were elevated to being key vehicles of managing semantic meaning. Most commonly, stores are vector databases, of which there are now many. Already, they power financial services and capital markets apps, most commonly natural language processing use cases, e.g., summarizing legal documents and financial reports, or capturing sentiment from social media and news feeds. However, they are also tackling more involved applications, augmenting trading and risk management insights for example, often alongside traditional statistics and machine learning.
Incidentally, the company Dave went to join, Pinecone, became a GenAI unicorn, valued at a healthy $750m. Nice work if you can get it!
Term 2: RAG, aka Retrieval Augmented Generation
RAG was barely a word on anyone’s lips in Spring 2023, at least in the capitalized RAG “Retrieval Augmented Generation” sense of the term. Google search stats for the terms accelerated from about July 2023 and by Autumn/Fall, RAG was everywhere, the predominant pipeline approach by which vector databases help tame Large Language Model “stochastic parrots”. On one hand, RAG encapsulates pipelines to provision enterprise data workflows and on the other pragmatically helps financial firms reduce hallucinations and accommodate internal – and external - risk management and AI compliance processes.
There are many types of RAG pipelines, and they can appear intimidatingly complex. However, think of RAG simply as providing a data pipeline between prompts, your enterprise data, and large language models. To learn more, and see how it affects finance, read my finextra blog or watch this great webcast summarizing the risk management opportunities of RAG. If you begin to implement them at any stage, you’ll likely explore “RAG-friendly” environments like LangChain & LlamaIndex.
Term 3: Hallucinations
I used the term “hallucinations” in my prior section, posing it as a problem solved by RAG, and, in turn, by vector databases. With GenAI, hallucinations are no longer simply triggers of mind-stimulating creativity, such as the drug-inspired Sergeant Pepper’s Lonely Hearts Club Band by the Beatles, or the Beachboys’ Good Vibrations. Neither are they the preserve of shamanic dreaming practised by many peoples, e.g. eastern Siberian Chukchi peoples, nor physical activities which utilize mind-altering techniques, such as yoga, massage and tantric sex. The word “hallucination” also now applies to the failings of LLMs to navigate information the models don’t have access to, or misuse existing information. It became very obvious very quickly that ChatGPT, Bard and similar systems were prone to manufactured “hallucinatory” responses, and these brought risk when ill-informed actions ensued.
Here’s the twist. AI investor Marc Andreessen suggests that while most see hallucinations as bugs, they can be helpful as features when AI is used as a cocreator, a suggester, and a guesser. As a brainstorming aid, their made-up guesses can fuel human creativity. Andreessen for example highlights how lawyers use AI’s “made-up” suggestions during case preparation to imagine novel legal strategies. In Financial services, traders on Wall Street are already use generative AI and vector databases to find trading opportunities — to zig when the masses zag.
Whatever you think of GenAI, it has certainly brought us a delightful new lexicon!
This content is provided by an external author without editing by Finextra. It expresses the views and opinions of the author.
Andrew Ducker Payments Consulting at Icon Solutions
19 December
Jamel Derdour CMO at Transact365 / Nucleus365
17 December
Alex Kreger Founder & CEO at UXDA
16 December
Dan Reid Founder & CTO at Xceptor
Welcome to Finextra. We use cookies to help us to deliver our services. You may change your preferences at our Cookie Centre.
Please read our Privacy Policy.