Charles H. Martin, PhD’s Post

View profile for Charles H. Martin, PhD, graphic

AI Specialist and Distinguished Engineer (NLP & Search). Inventor of weightwatcher.ai . TEDx Speaker. Need help with AI ? #talkToChuck

From the X-verse. many at open source LLms on the leaderboards appear to be overfit to the leaderboard metrics Here’s an example. The data preparation for wizard-coder uses the human eval pass@1 scores to decide to evolve the dataset further or not Optimizing solely for the test that defeats the point having a test set. https://lnkd.in/gaqyr9Us This is consistent with results from weightwatcher as well when applied to the LoRA updates directly

Shahul Es on X

Shahul Es on X

twitter.com

Charles H. Martin, PhD

AI Specialist and Distinguished Engineer (NLP & Search). Inventor of weightwatcher.ai . TEDx Speaker. Need help with AI ? #talkToChuck

11mo

Here's an example from weightwatcher

  • No alternative text description for this image

Although many closed LLMs are doing the same but we won’t even know it (so at least kudos to the Wizard team to openly state their own bias)

Like
Reply
See more comments

To view or add a comment, sign in

Explore topics