Industry, education and everyday users are making increasing use of large language models (LLMs) driven in part by the prominence of ChatGPT and other AI tools. The technology is developing at a pace. The analysis of commentators, critics and legislators also gain traction as they evaluate the implications of the technology and seek to influence…More
Monthly Archives: August 2023
Meaningful, imaginative, fluid
The recent escalation of interest in text-based AI coincides with our students completing their project work, dissertations, research degrees, etc. So it’s been a useful period to test the capabilities of platforms such as ChatGPT4 as a tutor — or perhaps as co-tutor. As yet there are no constraints in place that seriously impede its…More
Training your AI
The training data for a GPT model such as ChatGPT4 consists of many (hundreds of billions!) tokens harvested in sequence from various online sources, such as Wikipedia amongst many others. This source data is processed as a continuous stream independent of document boundaries. Tokenize The initial training task is to tokenize the entire corpus, identifying…More
More on automated recollection
Conversational AI platforms such as ChatGPT generate predictions for what should come next after a user inputs a question, statement, paragraph, or other text prompt. In early text prediction software, a simplistic language model might calculate the most likely word to follow a given input like “door,” based on pre-calculated statistical analysis of word co-occurrences…More