Skip to main content

New best story on Hacker News: Fine tune a 70B language model at home

Fine tune a 70B language model at home
589 by jph00 | 145 comments on Hacker News.
Jeremy from Answer.AI here. This is our first project since launching our new R&D lab at the start of this year. It's the #1 most requested thing I've been hearing from open source model builders: the ability to use multiple GPUs with QLoRA training. So that's why we decided to make it our first project. Huge thanks to Tim Dettmers for helping us get started to this -- and of course for creating QLoRA in the first place! Let me know if you have any questions or thoughts.

Comments

Popular posts from this blog

New best story on Hacker News: Launch HN: Electric Air (YC W23) – Heat pump sold directly to homeowners

Launch HN: Electric Air (YC W23) – Heat pump sold directly to homeowners 571 by cmui | 527 comments on Hacker News. Hi HN! I’m Chris Mui, founder of Electric Air ( https://electricair.io ). We’re building a residential heat pump system. This will be an all-electric replacement for your home’s furnace and air conditioner that enables more centrally ducted installs, manages your indoor air quality, and saves you money on monthly energy bills. We also streamline purchase, finance and install by selling directly to homeowners. You can place a preorder today at https://electricair.io . Heat pumps work by using refrigerant and a compressor to move energy against a temperature gradient. If you put 1 kWh of energy into a heat pump, you get 3-5 kWh of heating in your home. But this isn’t breaking the laws of physics because heat pumps don’t make heat, they move it around. The extra 2-4kWh gets absorbed from the outdoors, even when it is cold outside. The low pressure refrigerant in the outdo...