We released a new open-source model to detect AI-written text with performance rivaling popular closed-source alternatives. It is available on HuggingFace and GitHub for you to try out now! 💡 Why We Built It: Our work with numerous companies crafting high-performing #LLM fine-tuning datasets highlighted the need for a robust AI text detection tool, as lower-quality AI-generated text can degrade fine-tuning dataset quality. Existing solutions fell short of our requirements, so we built our own. 🔍 What is it? We’ve fine-tuned a RoBERTa Large model on a dataset comprising 20,000 LLM-generated and human-written text samples. We focused on achieving a high-quality calibration of the model to get reasonable confidence estimates and good overall performance. We are happy to say that our model achieves a similar level of accuracy as closed-source alternatives while being open to all! Model Access: Access the model and model weights on our Hugging Face page: https://lnkd.in/ebSnpubx Model Serving: Our GitHub repository contains the code to run inference and deploy your HTTP LLM content detection service. It also includes a step-by-step tutorial on integrating this tool with the SuperAnnotate platform to prepare high-quality data for your LLM training: https://lnkd.in/euNXsf5R #SuperAnnotate #GeneratedTextDetection #AI #NLP #FineTuning
Very promising!
--
3wHello, do you have any job opportunities remote?