Each input is weighted using a deep neural network where each neuron represents 100s of mathematical calculations enabling a predicted output response. The process is illustrated below (Bahi, M., & Batouche, M. (2018). Deep learning for ligand-based virtual screening in drug discovery. IEEE)
Understanding how Gen AI operates at a surface level, helps to provide context for when to use it and why its outputs must be evaluated. Text generation tools (e.g. ChatGPT) go beyond automated queries to build unique content including images, text, audio, video, and code. Content is drawn from books, Wikipedia, web, news, free articles, and social media as illustrated below. Most of these tools do not draw on subscription library databases except where they are built into the database itself. However, copyrighted works are also being used without permission, attribution, or compensation resulting in multiple lawsuits.
Koenig, A. (2020). The algorithms know me and I know them: Using student journals to uncover algorithmic literacy awareness. Computers and Composition, 58, 102611.
A jargon-free explanation of how AI LLMs work. (2023). Offers visual model.
Representation of words in dimensional space. (A gentle introduction to vector space)
Word vectors are processed through many layers of transformation
ChatGPT-3 uses 96 layers where each word can be represented by up to 12,288 numbers
Each layer adds information to clarify word meaning and better predict the next word