Reinforcement Learning with Human Feedback (RLHF) for Large Language Models (LLMs) Post date October 24, 2024 Post author By Hakeem Abbas Post categories In deeplearning, humanfeedback, rlhf, techinnovation
Anomaly Detection Using Machine Learning Post date October 21, 2024 Post author By Hakeem Abbas Post categories In development, llm, machinelearning, python
How to Use Knowledge Distillation to Create Smaller, Faster LLMs? Post date September 30, 2024 Post author By Hakeem Abbas