Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Blaed@lemmy.world · 5 months ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Blaed@lemmy.world · 6 months ago

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Blaed@lemmy.world · edit-2 6 months ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Blaed@lemmy.world · 6 months ago

What sort of tokens per second are you seeing with your hardware? Mind sharing some notes on what you’re running there? Super curious!

Blaed@lemmy.world · edit-2 6 months ago

I was pleasantly surprised by many models of the Deepseek family. Verbose, but in a good way? At least that was my experience. Love to see it mentioned here.

Blaed@lemmy.world · 6 months ago

Develop Alongside Local LLMs w/ Open Interpreter

Blaed@lemmy.world · 6 months ago

What open-source LLMs are you using in 2024?

Blaed@lemmy.world · 6 months ago

FOSAI 2024

Blaed@lemmy.world · edit-2 8 months ago

Blaed's Hiatus (Part I)

Blaed@lemmy.world · edit-2 9 months ago

What kind of content do you want to see more of?

Blaed@lemmy.world · edit-2 9 months ago

Llama 2 / WizardLM Megathread

Blaed@lemmy.world · 9 months ago

HyperTech News Report #0003 - Expanding Horizons

Blaed@lemmy.world · 9 months ago

HyperTech News Report #0003 - Expanding Horizons

Blaed@lemmy.world · 9 months ago

Mistral seems to be the popular choice. I think it’s the most open-source friendly out of the bunch. I will keep function calling in mind as I design some of our models! Thanks for bringing that up.

Blaed@lemmy.world · edit-2 8 months ago

We're building FOSAI models! Cast your votes and pick your tunings.

Blaed@lemmy.world · 9 months ago

HyperTech News Report #0002 - A New Challenger Approaches!

Blaed@lemmy.world · 9 months ago

HyperTech News Report #0002 - A New Challenger Approaches!

Blaed@lemmy.world · 9 months ago

HyperTech News Report #0001 - Happy FOSAI Friday!

Blaed@lemmy.world · 10 months ago

LM Studio - A new tool to discover, download, and run local LLMs

Blaed@lemmy.world · 11 months ago

After finally having a chance to test some of the new Llama-2 models, I think you’re right. There’s still some work to be done to get them tuned up… I’m going to dust off some of my notes and get a new index of those other popular gen-1 models out there later this week.

I’m very curious to try out some of these docker images, too. Thanks for sharing those! I’ll check them when I can. I could also make a post about them if you feel like featuring some of your work. Just let me know!

Blaed@lemmy.world · 11 months ago

Free Open-Source AI LLM Guide

Blaed@lemmy.world · edit-2 9 months ago

Free Open-Source AI LLM Guide (Summer 2023)

Blaed@lemmy.world · 1 year ago

What is FOSAI? Reddit / Lemmy Migration Guide

Blaed@lemmy.world · edit-2 1 year ago

Hugging Face and AMD partner on accelerating state-of-the-art models for CPU and GPU platforms

Moderates