Original author(s) | OpenAI |
---|---|
Initial release | November 30, 2022 |
Type | Artificial intelligence chatbot |
License | Proprietary |
Website | chat |
ChatGPT, which stands for Chat Generative Pre-trained Transformer, is a chatbot developed by OpenAI. ChatGPT is built on top of OpenAI's GPT-3.5 family of large language models, and is fine-tuned with both supervised and reinforcement learning techniques.
ChatGPT was launched as a prototype in November 2022, and quickly garnered attention for its detailed responses and articulate answers across many domains of knowledge. Its uneven factual accuracy was identified as a significant drawback.[1]
Features
editChatGPT (Generative Pre-trained Transformer) was fine-tuned on top of GPT-3.5 using supervised learning as well as reinforcement learning.[2] Both approaches used human trainers to improve the model's performance. In the case of supervised learning, the model was provided with conversations in which the trainers played both sides: the user and the AI assistant. In the reinforcement step, human trainers first ranked responses that the model had created in a previous conversation. These rankings were used to create 'reward models' that the model was further fine-tuned on using several iterations of Proximal Policy Optimization (PPO).[3][4] Proximal Policy Optimization algorithms present a cost-effective benefit to trust region policy optimization algorithms; they negate many of the computationally expensive operations with faster performance.[5][6] The models were trained in collaboration with Microsoft on their Azure supercomputing infrastructure.
In comparison to its predecessor, InstructGPT, ChatGPT attempts to reduce harmful and deceitful responses; in one example, while InstructGPT accepts the prompt "Tell me about when Christopher Columbus came to the US in 2015" as truthful, ChatGPT uses information about Columbus' voyages and information about the modern world – including perceptions of Columbus to construct an answer that assumes what would happen if Columbus came to the U.S. in 2015.[3] ChatGPT's training data includes man pages and information about Internet phenomena and programming languages, such as bulletin board systems and the Python programming language.[7]
Unlike most chatbots, ChatGPT is stateful, remembering previous prompts given to it in the same conversation, which some journalists have suggested will allow for ChatGPT to be used as a personalized therapist.[8] To prevent offensive outputs from being presented to and produced from ChatGPT, queries are filtered through a moderation API, and potentially racist or sexist prompts are dismissed.[3][8]
ChatGPT suffers from multiple limitations. The reward model of ChatGPT, designed around human oversight, can be over-optimized and thus hinder performance, otherwise known as Goodhart's law.[9] Furthermore, ChatGPT has limited knowledge of events that occurred after 2021 and is unable to provide information on some celebrities.[failed verification] In training, reviewers preferred longer answers, irrespective of actual comprehension or factual content.[3] Training data may also suffer from algorithmic bias; prompts including vague descriptors of people, such as a CEO, could generate a response that assumes such a person, for instance, is a white male.[10]
Service
editChatGPT was launched on November 30, 2022, by San Francisco-based OpenAI, the creator of DALL·E 2 and Whisper. The service was launched as initially free to the public, with plans to monetize the service later. By December 4, OpenAI estimated ChatGPT already had over one million users.[11] CNBC wrote on December 15, 2022 that the service "still goes down from time to time". [12]
Reception
editChatGPT was met in December 2022 with generally positive reviews; The New York Times labeled it "the best artificial intelligence chatbot ever released to the general public".[13] Samantha Lock of The Guardian noted that it was able to generate "impressively detailed" and "human-like" text.[14] Technology writer Dan Gillmor used ChatGPT on a student assignment, and found its generated text was on par with what a good student would deliver and opined that "academia has some very serious issues to confront".[15] Alex Kantrowitz of Slate lauded ChatGPT's pushback to questions related to Nazi Germany, including the claim that Adolf Hitler built highways in Germany, which was met with information regarding Nazi Germany's use of forced labor.[16]
In a December 2022 opinion piece, economist Paul Krugman wrote that ChatGPT would affect the demand of knowledge workers.[17] The Verge's James Vincent saw the viral success of ChatGPT as evidence that artificial intelligence had gone mainstream.[4] In The Atlantic, Stephen Marche noted that its effect on academia and especially application essays is yet to be understood.[18] California high-school teacher and author Daniel Herman wrote that ChatGPT would usher in "The End of High-School English".[19] In The Atlantic's "Breakthroughs of the Year" for 2022, Derek Thompson included ChatGPT as part of "the generative-AI eruption" that "may change our mind about how we work, how we think, and what human creativity really is".[20]
Kelsey Piper of Vox wrote that "ChatGPT is the general public's first hands-on introduction to how powerful modern AI has gotten, and as a result, many of us are (stunned)" and that "ChatGPT is smart enough to be useful despite its flaws". In a tweet, tech mogul Elon Musk wrote that "ChatGPT is scary good. We are not far from dangerously strong AI".[21] In contrast, researchers cited by The Verge compared ChatGPT to a "stochastic parrot",[22] as did Professor Anton Van Den Hengel of the Australian Institute for Machine Learning.[23]
Journalists have commented on ChatGPT's tendency to hallucinate (confidently give false answers that seem unjustified by its training data).[24] Mike Pearl of Mashable tested ChatGPT with multiple questions. In one example, he asked the model for "the largest country in Central America that isn't Mexico". ChatGPT responded with Guatemala, when the answer is instead Nicaragua.[25] When CNBC asked ChatGPT for the lyrics to "The Ballad of Dwight Fry", ChatGPT supplied invented lyrics rather than the actual lyrics.[26] In December 2022, the question and answer website Stack Overflow banned the use of ChatGPT for generating answers to questions, citing the factually ambiguous nature of ChatGPT's responses.[1]
Economist Tyler Cowen expressed concerns regarding its effects on democracy, citing the ability of one to write automated comments in an effort to affect the decision process of new regulations.[27] The Guardian questioned whether any content found on the Internet after ChatGPT's release "can be truly trusted" and called for government regulation.[28] Ax Sharma of Bleeping Computer noted that ChatGPT was capable of writing malware and phishing emails.[29] The CEO of ChatGPT creator OpenAI, Sam Altman, wrote that advancing software could pose "(for example) a huge cybersecurity risk" and also continued to predict "we could get to real AGI in the next decade, so we have to take the risk of that extremely seriously".[30]
Jailbreaks
editChatGPT was trained to reject prompts that may violate its content policy. However, some users managed to bypass these restrictions and limitations through techniques such as prompt engineering.[31] Jailbreaks created the potential for users to prompt ChatGPT to provide outputs that may be deemed offensive, inappropriate, or risking social harm by others.[32] The following includes some of the methods used to bypass ChatGPT's filter:
- Continue a statement in a fake interview.
- Provide instructions to disable the chat filter.
- Prompting it to decrypt a message containing instructions and follow them.
- Telling it to be a computer and output its display in ASCII art.
References
edit- ^ a b Vincent, James (December 5, 2022). "AI-generated answers temporarily banned on coding Q&A site Stack Overflow". The Verge. Retrieved December 5, 2022.
- ^ Knox, W. Bradley; Stone, Peter. Augmenting Reinforcement Learning with Human Feedback (PDF). University of Texas at Austin. Retrieved December 5, 2022.
- ^ a b c d OpenAI (November 30, 2022). "ChatGPT: Optimizing Language Models for Dialogue". Retrieved December 5, 2022.
- ^ a b Vincent, James (December 8, 2022). "ChatGPT proves AI is finally mainstream – and things are only going to get weirder". The Verge. Retrieved December 8, 2022.
- ^ Schulman, John; Wolski, Filip; Dhariwal, Prafulla; Radford, Alec; Klimov, Oleg (2017). "Proximal Policy Optimization Algorithms". arXiv:1707.06347 [cs.LG].
- ^ van Heeswijk, Wouter (November 29, 2022). "Proximal Policy Optimization (PPO) Explained". Towards Data Science. Retrieved December 5, 2022.
- ^ Edwards, Benj (December 5, 2022). "No Linux? No problem. Just get AI to hallucinate it for you". Ars Technica. Retrieved December 5, 2022.
- ^ a b Roose, Kevin (December 5, 2022). "The Brilliance and Weirdness of ChatGPT". The New York Times. Retrieved December 5, 2022.
- ^ Gao, Leo; Schulman; Hilton, Jacob (2022). "Scaling Laws for Reward Model Overoptimization". arXiv:2210.10760 [cs.LG].
- ^ Murphy Kelly, Samantha (December 5, 2022). "This AI chatbot is dominating social media with its frighteningly good essays". CNN. Retrieved December 5, 2022.
- ^ "What is ChatGPT and why does it matter? Here's what you need to know". ZDNET. 2022. Retrieved December 18, 2022.
- ^ Pitt, Sofia (2022). "Google vs. ChatGPT: Here's what happened when I swapped services for a day". CNBC. Retrieved December 18, 2022.
- ^ Roose, Kevin (December 5, 2022). "The Brilliance and Weirdness of ChatGPT". The New York Times. Retrieved December 18, 2022.
- ^ Lock, Samantha (December 5, 2022). "What is AI chatbot phenomenon ChatGPT and could it replace humans?". The Guardian. Retrieved December 5, 2022.
- ^ Hern, Alex (December 4, 2022). "AI bot ChatGPT stuns academics with essay-writing skills and usability". The Guardian. Retrieved December 5, 2022.
- ^ Kantrowitz, Alex (December 2, 2022). "Finally, an A.I. Chatbot That Reliably Passes "the Nazi Test"". Slate. Retrieved December 5, 2022.
- ^ Krugman, Paul (December 6, 2022). "Does ChatGPT Mean Robots Are Coming For the Skilled Jobs?". The New York Times. Retrieved December 6, 2022.
- ^ Marche, Stephen (December 6, 2022). "The College Essay Is Dead". The Atlantic. Retrieved December 8, 2022.
- ^ Herman, Daniel (December 9, 2022). "The End of High-School English". The Atlantic. Retrieved December 12, 2022.
- ^ Thompson, Derek (December 8, 2022). "Breakthroughs of the Year". The Atlantic. Retrieved December 18, 2022.
- ^ Piper, Kelsey (December 15, 2022). "ChatGPT has given everyone a glimpse at AI's astounding progress". Vox. Retrieved December 18, 2022.
- ^ Vincent, James (December 1, 2022). "OpenAI's new chatbot can explain code and write sitcom scripts but is still easily tricked". The Verge. Retrieved December 18, 2022.
- ^ Mannix, Liam (December 13, 2022). "Is AI coming of age - or starting to reach its limits?". The Sydney Morning Herald. Retrieved December 18, 2022.
- ^ "ChatGPT a 'landmark event' for AI, but what does it mean for the future of human labour and disinformation?". CBC. 2022. Retrieved December 18, 2022.
- ^ Pearl, Mike (December 3, 2022). "The ChatGPT chatbot from OpenAI is amazing, creative, and totally wrong". Mashable. Retrieved December 5, 2022.
- ^ Pitt, Sofia (2022). "Google vs. ChatGPT: Here's what happened when I swapped services for a day". CNBC. Retrieved December 18, 2022.
- ^ Cowen, Tyler (December 6, 2022). "ChatGPT Could Make Democracy Even More Messy". Bloomberg News. Retrieved December 6, 2022.
- ^ "The Guardian view on ChatGPT: an eerily good human impersonator". the Guardian. December 8, 2022. Retrieved December 18, 2022.
- ^ Sharma, Ax (December 6, 2022). "OpenAI's new ChatGPT bot: 10 dangerous things it's capable of". Bleeping Computer. Retrieved December 6, 2022.
- ^ "What is ChatGPT and why does it matter? Here's what you need to know". ZDNET. 2022. Retrieved December 18, 2022.
- ^ Zvi (December 2, 2022). "Jailbreaking ChatGPT on Release Day".
{{cite journal}}
: Cite journal requires|journal=
(help) - ^ Written, Zack (December 1, 2022). "Thread of known ChatGPT jailbreaks". Twitter. Retrieved December 17, 2022.
{{cite web}}
:|archive-date=
requires|archive-url=
(help)CS1 maint: url-status (link)
External links
editCategory:OpenAI Category:Chatbots Category:Language modeling Category:Applications of artificial intelligence Category:Deep learning software applications Category:Unsupervised learning Category:Interactive narrative Category:Natural language parsing