1.ChatGPT is an AI chatbot by OpenAI (2022), known for its accuracy and human-like responses, nicknamed the "Google Search killer". Microsoft have recently announced integration of a ChatGPT-esque OpenAI model into Bing and Edge.

2. ChatGPT reached 100 million users 2 months after launch, and OpenAI now offers a subscription plan. (It took Instagram about 3 months to reach 1 million users.)

3. Emerging use cases: Technical (code assist, classification tasks, prompt engineering), Non-technical (content generation, education/ query response, blue-sky ideation, copy writing, itineary planning) and more

4. ChatGPT's hyper-realism poses risks such as fraud/plagiarism, data ownership ambiguity, misinformation spread, model hallucination, and embedded bias in model and training data.

5. Google released "Bard," a rival to ChatGPT powered by LaMDA, which last year made headlines for claims of sentience by a Google engineer.


6. InstructGPT, ChatGPT's sister model, was released a year prior and uses RLHF to provide detailed responses based on instructions.

7. RLHF is a training method in AI alignment, steering systems towards intended goals and reducing harmful language (if that is the goal). It is an advancement from traditional prediction-based training; predicting the most likely next character.

8. RLHF significantly improves model performance, with models using 1B parameters outperforming those with 175B.

9. ChatGPT was fine-tuned from a GPT-3.5 series model using prompts and responses from various use cases, generated by both human annotators and those submitted through the OpenAI API.

10. Training process overview: 1) Fine-tune GPT-3.5 in supervised fashion (supervised fine-tuning model - SFT), 2) Train a reward model by ranking responses to prompts, 3) Optimize the SFT (policy) against the reward model using proximal policy optimization (PPO), repeat 2 and 3.

