"ChatGPT is a computer program that can chat with you like a human and understand the meaning behind your words. You can interact with ChatGPT by typing in questions or statements, and it will respond with its best understanding of what you're saying. It's important to note that while ChatGPT is highly advanced, it's still a machine, and its responses may not always be perfect." Whilst ChatGPT is very efficient at describing itself clearly and honestly, it is important to have a real life expert explain the pros and cons of this game-changing AI technology.
With a Master’s degree in Big Data and Business Analytics, Visiting Lecturer Nicolas Maréchal teaches the fundamentals of Computational Thinking and Python Programming at EHL. Today, he guides us through the future AI landscape.
ChatGPT (Chat Generative Pre-Trained Transformer) is an AI language model (also referred to as an AI-powered chatbot and coding assistant) launched by OpenAI in November 2022. In just two months, ChatGPT reached over 100 million monthly active users, estimated to be the fastest-growing internet service ever. To put it in perspective, it took three and a half years for WhatsApp and nine months for TikTok to achieve the same performance.
ChatGPT is a state-of-the-art Natural Language Processing (NLP) model, based on the Transformer technology (introduced in the paper "Attention is all you need", 2017) that is designed to generate human-like text responses to questions and prompts. It has been built on top of GPT-3 (Generative Pre-Trained Transformer 3), and more precisely on top of “Davinci”, another language model developed by OpenAI in May 2020.
ChatGPT however is not the first of its kind. BERT (Bidirectional encoder Representations from Transformers) developed by Google in 2018, was the first very successful transformer-based language model.
Amongst others, ChatGPT can be used for natural language processing (NLP) oriented tasks such as text generation, language translation and text summarization. It can also be used for specific applications such as question answering, sentiment analysis and text classification. Overall, ChatGPT is a powerful tool for any application that involves generating or understanding human language.
OpenAI’s recent $10 billion deal with Microsoft, and Google’s new chatbot ‘Bard’ (powered by LaMDA), unveil a new chapter in the Tech giant’s AI search war, and more breakthroughs are expected to quickly emerge in this field. We have an exciting time ahead of us!
ChatGPT works by training on large amounts of text data (175 billion parameters gathered from the internet and other sources) to learn patterns and relationships between words and phrases in the language, and then uses this knowledge to generate human-like responses to a given input. When a user provides input to ChatGPT, the model processes it and generates a response based on the most likely sequence of words, given the input and its training.
One major difference between GPT-3 and ChatGPT is the use of reinforcement learning from human feedback (RLHF), whose process can be divided into three parts: 1) Supervised fine-tuning (SFT model), 2) Train reward model (RM), 3) fine-tuning the SFT model with reinforcement learning (technical details in appendix).
Reinforcement learning (RL) is very useful, as it specializes in a model's predictions on human preferences, optimizing weights on metrics that better reflect human judgment than aseptic loss functions. RL also allows the use of otherwise unusable metrics (because not differentiable), such as human preferences in the form of reward functions.
ChatGPT, like other language models, also uses the principle of "memorization" to generate text. This means that the model has seen a vast amount of text data during its training process and has memorized the patterns and relationships between words and phrases in that data. When the model generates text, it uses the information memorized about the language to make predictions about the next word/token or sequence of words based on the specific input provided. The model can then generate a wide variety of outputs, (such as one-word answers or complex paragraphs), based on the information it has stored in its memory.
It's essential to note that the model doesn't understand the meaning of the text in the same way a human would. Instead, it relies on statistical associations between words and phrases to generate text. As an illustration, consider the sentence: "The Roman Empire [MASK] with the reign of Augustus."
A probabilistic model might predict "began" or “ended”, as both words score a high likelihood of occurrence, but have a strictly different meaning. Hence, predicting the next word (or a masked word) in a text sequence, may not necessarily be learning some higher-level representations of its meaning.
The value of ChatGPT to the public resides in its ability to generate human-like text based on the user’s input, making it very useful for a variety of applications and use cases. Some examples, leveraging ChatGPT’s API or its interface, include:
4. ChatGPT and education
ChatGPT and other advanced language models have proven to have a significant impact on education, providing new and innovative ways to personalize learning, generate content and support students as they learn.
But there are also some potentially negative impacts to consider:
While politicians are raising concerns about the educational public systems lacking technological maturity (not even AI), some argue that “The College Essay is Dead”, illustrating how big the impact of ChatGPT is on education and why it cannot be ignored.
Since the past decade, Generative Pre-Trained Transformers (GPT) have been rapidly advancing and many more breakthroughs are expected in this field. Some potential directions on the technology development include:
Note: GPT-4 is not an officially announced or released product by OpenAI. It is a reference to a potential future iteration of the GPT language model series (which is currently at version 3), which is likely to be trained on many more parameters, using more accurate and robust models.
GPTZero (built by Edward Tian at Princeton University) is a recently developed tool designed to detect AI-generated text. It uses machine learning algorithms to analyze text and determine if it was written by a human or generated by a language model such as ChatGPT. OpenAI themselves also launched a similar classifier trained to identify AI plagiarism.
Amongst others, perplexity is a metric used by GPTZero to evaluate the quality of a language model's predictions. It measures how well a model predicts the likelihood of a given sequence of words (formally the average per-word log probability). Human-generated texts tend to have a higher average perplexity and higher burstiness (perplexity is not uniform across sentences). Hence, it is possible to detect if a human or a machine created specific portions of texts.
Whether or not GPTZero can live up to its promise to detect AI-produced written text depends on a variety of factors, including the accuracy of the underlying machine learning algorithms and the quality of the training data used to develop the model. In general, tools like GPTZero can be effective at detecting AI-generated text in certain cases, but they are not ‘foolproof’ and can sometimes produce false positive or false negative results.
It is important to keep in mind that the field of AI-generated text detection is rapidly evolving, and new and more advanced tools are always being developed. The effectiveness of GPTZero and other similar tools may certainly improve in the future as the technology continues to advance.
One major difference between GPT-3 and ChatGPT is the use of reinforcement learning from human feedback (RLHF), whose process can be divided into three parts: