Llama 3 degrades much more than Llama 2 when quantized 🤔 | New LLM Paper Finds out

Published: 12 May 2024
on channel: Rohan-Paul-AI

512

Paper : https://arxiv.org/abs/2404.14047

🐦 Connect with me in TWITTER:   / rohanpaul_ai

Llama 3 degrades much more than Llama 2 when quantized. 🤔

📌 Most possible reason because Llama 3, trained on a record 15T tokens, captures extremely nuanced data relationships, utilizing even the minutest decimals in BF16 precision fully. Making it more sensitive to quantization degradation.

📌 So sensitive that even the smallest decimal points of each parameter offered by BF16 precision were filled and had a purpose. Other LLMs were trained for far less (2T), and thus did not have time to saturate smaller precision ranges of the parameters like Llama-3 did, and thus are not affected by quantization as much.

----

Checkout the MASSIVELY UPGRADED 2nd Edition of my Book (with 1300+ pages of Dense Python Knowledge) 🐍🔥

Covering 350+ Python 🐍 Core concepts ( 1300+ pages ) 🚀

🟠 Book Link - https://rohanpaul.gumroad.com/l/pytho...

-----------------

Hi, I am a Machine Learning Engineer | Kaggle Master. Connect with me on 🐦 TWITTER:   / rohanpaul_ai   - for daily in-depth coverage of Large Language Model bits

----------------

You can find me here:

**********************************************

🐦 TWITTER:   / rohanpaul_ai
👨🏻‍💼 LINKEDIN:   / rohan-paul-ai
👨‍🔧 Kaggle: https://www.kaggle.com/paulrohan2020
👨‍💻 GITHUB: https://github.com/rohan-paul
🧑‍🦰 Facebook :   / rohan.paul.562
📸 Instagram:   / rohan_paul_2020

**********************************************

Other Playlist you might like 👇

🟠 MachineLearning & DeepLearning Concepts & interview Question Playlist - https://bit.ly/380eYDj

🟠 ComputerVision / DeepLearning Algorithms Implementation Playlist - https://bit.ly/36jEvpI

🟠 DataScience | MachineLearning Projects Implementation Playlist - https://bit.ly/39MEigt

🟠 Natural Language Processing Playlist : https://bit.ly/3P6r2CL

----------------------

#LLM #Largelanguagemodels #Llama3 #LLMfinetuning #opensource #NLP #ArtificialIntelligence #datascience #textprocessing #deeplearning #deeplearningai #100daysofmlcode #neuralnetworks #datascience #generativeai #generativemodels #OpenAI #GPT #GPT3 #GPT4 #chatgpt #genai

Watch video Llama 3 degrades much more than Llama 2 when quantized 🤔 | New LLM Paper Finds out online, duration hours minute second in high quality that is uploaded to the channel Rohan-Paul-AI 12 May 2024. Share the link to the video on social media so that your subscribers and friends will also watch this video. This video clip has been viewed 512 times and liked it 20 visitors.

1,377