SuperAnnotate’s Post

View organization page for SuperAnnotate, graphic

17,932 followers

We released a new open-source model to detect AI-written text with performance rivaling popular closed-source alternatives. It is available on HuggingFace and GitHub for you to try out now! 💡 Why We Built It: Our work with numerous companies crafting high-performing #LLM fine-tuning datasets highlighted the need for a robust AI text detection tool, as lower-quality AI-generated text can degrade fine-tuning dataset quality. Existing solutions fell short of our requirements, so we built our own. 🔍 What is it? We’ve fine-tuned a RoBERTa Large model on a dataset comprising 20,000 LLM-generated and human-written text samples. We focused on achieving a high-quality calibration of the model to get reasonable confidence estimates and good overall performance. We are happy to say that our model achieves a similar level of accuracy as closed-source alternatives while being open to all! Model Access: Access the model and model weights on our Hugging Face page: https://lnkd.in/ebSnpubx Model Serving: Our GitHub repository contains the code to run inference and deploy your HTTP LLM content detection service. It also includes a step-by-step tutorial on integrating this tool with the SuperAnnotate platform to prepare high-quality data for your LLM training: https://lnkd.in/euNXsf5R #SuperAnnotate #GeneratedTextDetection #AI #NLP #FineTuning

  • No alternative text description for this image

Hello, do you have any job opportunities remote?

Like
Reply

Very promising!

Like
Reply
See more comments

To view or add a comment, sign in

Explore topics