Profile Picture
  • All
  • Search
  • Images
  • Videos
    • Shorts
  • Maps
  • News
  • More
    • Shopping
    • Flights
    • Travel
  • Notebook
Report an inappropriate content
Please select one of the options below.
NVIDIA
Intel
Bread
Furkivideoeditor2024
Polar Greenscreen
NVIDIA
GPU
RTX Physics
Bread
Bread
Toaster
Ai Bread
Spreading
Tech Deck
NVIDIA
GeForce Now
NVIDIA NVIDIA
Artificial
Bread
Creative Bread
Service
Let's Talk About Buttered
英伟达官网
NVIDIA
Control Panel
Huge RTX Meme
The
Bread
Artificial Intelligence Knotty Mouth
Bread
Picture in Your Mind Video Live
Animatin Hotdag and
Bread
Leila Hormouzi Build Wealth
GeForce RTX 2070 Super
Snipars
控制器 面板
Bread
Falling Over Video
Nirvanaponygr4m
Uyeda Lab
控制 面板
Invertir En NVIDIA
Ahora Y AMD
  • Length
    AllShort (less than 5 minutes)Medium (5-20 minutes)Long (more than 20 minutes)
  • Date
    AllPast 24 hoursPast weekPast monthPast year
  • Resolution
    AllLower than 360p360p or higher480p or higher720p or higher1080p or higher
  • Source
    All
    Dailymotion
    Vimeo
    Metacafe
    Hulu
    VEVO
    Myspace
    MTV
    CBS
    Fox
    CNN
    MSN
  • Price
    AllFreePaid
  • Clear filters
  • SafeSearch:
  • Moderate
    StrictModerate (default)Off
Filter
    NVIDIA
    Intel
    Bread
    Furkivideoeditor2024
    Polar Greenscreen
    NVIDIA
    GPU
    RTX Physics
    Bread
    Bread
    Toaster
    Ai Bread
    Spreading
    Tech Deck
    NVIDIA
    GeForce Now
    NVIDIA NVIDIA
    Artificial
    Bread
    Creative Bread
    Service
    Let's Talk About Buttered
    英伟达官网
    NVIDIA
    Control Panel
    Huge RTX Meme
    The
    Bread
    Artificial Intelligence Knotty Mouth
    Bread
    Picture in Your Mind Video Live
    Animatin Hotdag and
    Bread
    Leila Hormouzi Build Wealth
    GeForce RTX 2070 Super
    Snipars
    控制器 面板
    Bread
    Falling Over Video
    Nirvanaponygr4m
    Uyeda Lab
    控制 面板
    Invertir En NVIDIA
    Ahora Y AMD
You now convert any LLM into a faster one without retraining from scratch.NVIDIA just did this to their 30B model. Here's the trick:1. Duplicate the model into two copies2. Freeze one copy, it just reads the prompt and remembers context3. Train the other copy to write chunks of text at once instead of one word at a time4. Run them togetherThe frozen copy barely costs anything (it's already trained). The new copy only needed ~8% of the original training data to learn the new trick.Result: 2.4x fa
0:13
You now convert any LLM into a faster one without retraining from scratch.NVIDIA just did this to their 30B model. Here's the trick:1. Duplicate the model into two copies2. Freeze one copy, it just reads the prompt and remembers context3. Train the other copy to write chunks of text at once instead of one word at a time4. Run them togetherThe frozen copy barely costs anything (it's already trained). The new copy only needed ~8% of the original training data to learn the new trick.Result: 2.4x fa
103.4K views1 day ago
x.comLior Alexander
See more
Static thumbnail place holder
More like this
  • Privacy
  • Terms