Rama Ramakrishnan’s Post

Anthropic released its latest and greatest LLM, Claude 3.5 Sonnet, last week. It has a cool feature called Artifacts where it can run the code it generates and show the results side-by-side (i.e., we don't have to copy-paste the code somewhere else). People are creating fun interactive apps with it and inspired by something I saw, I asked it to generate a T-test calculator app. My journey is chronicled in the attached images 🙂. PSA: - LLMs like Claude are amazing (and the Artifacts feature is very nice) but please check the output, especially for questions where correctness is important. Just because pages of code and verbiage were involved in generating the answer doesn't mean it will be correct. - The error in my example was glaring but what if it was subtly wrong? So use LLMs in areas where you are knowledgeable (since you will be able to check the output). Be careful in areas where you are a novice.

  • No alternative text description for this image
  • No alternative text description for this image
  • No alternative text description for this image
  • No alternative text description for this image
  • No alternative text description for this image

Thank you for sharing and for the great advice on the output generated by LLMs. Being in the pharma industry, it's crucial that the information we share externally is 100% accurate. Human verification of the content generated by LLMs is essential. However, having a system where experts can verify the output can be a significant asset. This would provide an excellent starting point, saving us long hours of effort.

Fayner Costa

Executive | Entrepreneur | Engineer

1mo

Critical thinking is more valuable than ever nowadays. Unfortunately, I fear that many people take LLM-generated responses at face value and such behavior will provoke dramatic changes in day-to-day debates.

Bastian Fietje

Head of IT and Digitalization at Plus Pack | Driving Digital Transformation in Circular Food Packaging

1mo

Valid point! Tom's Guide conducted an advanced test of two LLMs this week, and Claude was the clear winner. It's incredibly impressive!" https://www.tomsguide.com/ai/chatgpt-4o-vs-claude-35-sonnet-which-ai-platform-wins

Dr. Jagreet Kaur

Researcher, Author, Intersection of AI and Quantum and helping Enterprises Towards Responsible AI, AI governance and Data Privacy Journey

1mo

Can Claude 3.5 Sonnet handle complex multi-step workflows?

Like
Reply
Shabana Khanam

Sr Data Engineering Manager | Angel Investor | Seeder | AI & LLM Enthusiast | Passionate Technologist

4w

Awesome

Like
Reply
See more comments

To view or add a comment, sign in

Explore topics