New best story on Hacker News: Fine tune a 70B language model at home

Fine tune a 70B language model at home
589 by jph00 | 145 comments on Hacker News.
Jeremy from Answer.AI here. This is our first project since launching our new R&D lab at the start of this year. It's the #1 most requested thing I've been hearing from open source model builders: the ability to use multiple GPUs with QLoRA training. So that's why we decided to make it our first project. Huge thanks to Tim Dettmers for helping us get started to this -- and of course for creating QLoRA in the first place! Let me know if you have any questions or thoughts.

TechCompressed

Search This Blog

New best story on Hacker News: Fine tune a 70B language model at home

Labels

Comments

Post a Comment

Popular posts from this blog

New best story on Hacker News: Southwest operational meltdown as hundreds of flights canceled or delayed

New best story on Hacker News: PostgreSQL 14

New best story on Hacker News: ChatControl: EU wants to scan all private messages, even in encrypted apps