Helicone (YC W23)’s cover photo
Helicone (YC W23)

Helicone (YC W23)

Software Development

The open-source LLM observability platform for developers.

About us

The open-source LangSmith alternative for logging, monitoring, and debugging AI applications. 1-line integration by simply changing the baseurl to access metrics, prompt management and more. 🚀 Support us on PH: www.producthunt.com/products/helicone-ai 🌐 Docs: docs.helicone.ai ⭐️ Github: github.com/Helicone 🌍 Open stats: us.helicone.ai/open-stats

Website
https://www.helicone.ai/
Industry
Software Development
Company size
2-10 employees
Headquarters
San Francisco
Type
Privately Held
Founded
2023
Specialties
Observability and Monitoring

Locations

Employees at Helicone (YC W23)

Updates

  • Huge win for the open-source community! Great work from the Together AI team. To the moon 🚀🫡

    View profile for Hassan El Mghari

    DevRel Lead at Together.ai

    Super excited to introduce a project I've been working on for the last 8 weeks: chat.together.ai! It lets you use DeepSeek R1 & other top open source models to do web search, coding, image generation, & image analysis – and it's 100% free. Link to the app to try it for free: https://chat.together.ai/ Full blog post with use cases: https://lnkd.in/ecZyDxdF I built this to give everyone an easy way to use the top open source AI models in a great UI for free. Here are some cool things you can do in the app: 1. Chat with DeepSeek R1 and other top OSS models 2. Do web search with DeepSeek R1 and other top OSS models 3. Generate code with Qwen Coder 32B 4. Generate images with Flux Schnell 5. Analyze images with Qwen 2.5 VL Also, here's the tech stack for the app: - Together AI for all the AI models - Firecrawl for web scraping - Brave for web search - Neon for my database - Helicone (YC W23) for observability - Clerk for authentication - Next.js for my app framework - Vercel for hosting Please let me know what you all think! I want to build the best AI chat app in the world and I'm just getting started 🚀 #ai #opensource #artificialintelligence

  • OpenAI has just released its o1-Pro model to its API, bringing its most powerful reasoning model to developers. Previously only available to ChatGPT Pro subscribers, this model comes with remarkable capabilities—and a price tag to match. 💰$150/1M input tokens, $600/1M output tokens it's now officially OpenAI's most expensive model, eclipsing GPT-4.5 by a factor of 2x for input and 4x for output costs. But is it worth the premium? Here’s our take: https://lnkd.in/gMrHSvQP

  • We just improved the Properties pages with new metrics and better user experience! 🚀 Properties lets you add custom metadata to LLM requests for advanced segmentation and analysis. Tag requests with session IDs, conversation context, or application data to gain deeper insights into your AI application performance. Check out the Properties tab in Helicone.

    • No alternative text description for this image
  • Helicone (YC W23) reposted this

    View profile for Lina Lam

    Designing @ Helicone | Prev. Intuit, Epic Games, GM

    Helicone (YC W23) is growing fast and we need help!! 🔥 We're hiring a DevRel Engineer to join our founding team. You'll work directly with me and Cole Gottdank to build our growing developer community, create content and help shape the future of AI observability. This is an incredible opportunity for someone passionate about AI and open source to make a real impact. If this sounds like you or someone you know, please apply here (IN PERSON - SF): https://lnkd.in/gWMqWjRb (P.S. feel free to message me or Cole if you have any questions!)

    • No alternative text description for this image
  • Today, we are introducing our new Generate API. 🚀 Now you can deploy your Editor prompts effortlessly with a light and modern package. Take the prompt ID in the editor and deploy it everywhere. Supports all the Helicone features natively, while we keep it updated in the Editor. See documentation in the comments.

    • No alternative text description for this image
  • View profile for Cole Gottdank

    Co-Founder @ Helicone

    What happens when you put W25 founders, YC alumni, and an open bar in one room? We'll find out March 18th. Join us... Helicone (YC W23) and Mintlify are hosting a post-batch Happy Hour on March 18th, 5:30-8:00pm. 🍻 Open tab. 🍔 Food included. 👕 Plus merch. 𝗪𝟮𝟱 𝗯𝗮𝘁𝗰𝗵𝗺𝗮𝘁𝗲𝘀: You just crushed Demo Day and survived the fundraising gauntlet. You've earned this break. 𝗬𝗖 𝗮𝗹𝘂𝗺𝗻𝗶: Come share your wisdom (and war stories). Join us by RSVPing with the link in comments!

    • No alternative text description for this image
  • 🚀 It's an exciting time in AI Research! OpenAI's new Deep Research tool is turning heads—and for good reason. It's designed for users who need in-depth analysis of complex topics, yielding quite impressive results. 💡 Meanwhile, free alternatives like Perplexity’s Deep Research and Open Deep Research are gaining traction as a response to OpenAI's $200/month price tag. Our latest blog dives into key capabilities of OpenAI Deep Research, and how it compares to more budget-friendly alternatives (Google, Perplexity, other open-source research tools). 🔗 Link in comment. #DeepResearch #LLM #AIDeveloper #AIObservability

    • No alternative text description for this image
    • No alternative text description for this image
    • No alternative text description for this image
  • 🚨 Grok 3 just dropped and it's making big claims about being the "Smartest AI in the world". Here's how it compares with top models right now: ✔️ 𝘈𝘥𝘷𝘢𝘯𝘤𝘦𝘥 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯𝘨 Early users found Grok 3’s “Thinking” mode solves problems better than many competitors. ✔️ 𝘓𝘰𝘨𝘪𝘤 Grok 3 performed well on structured logic problems with proper chains of thought. ✔️ 𝘋𝘦𝘦𝘱 𝘚𝘦𝘢𝘳𝘤𝘩 𝘵𝘰𝘰𝘭 Found high-quality information on recent events, similar in depth and quality to Perplexity's Deep Research, but not at the level of OpenAI's. 🆇 𝘊𝘰𝘥𝘪𝘯𝘨 𝘗𝘦𝘳𝘧𝘰𝘳𝘮𝘢𝘯𝘤𝘦 Early user found Grok 3 struggled with complex coding. GPT-4o and Claude provided better solutions. 🆇 𝘔𝘢𝘵𝘩 & 𝘚𝘺𝘮𝘣𝘰𝘭𝘪𝘤 𝘓𝘰𝘨𝘪𝘤 While strong in structured problem-solving, it failed Andrej Karpathy’s Unicode emoji mystery challenge, whereas DeepSeek's R1 performed better. 🆇 𝘏𝘶𝘮𝘰𝘳 & 𝘊𝘳𝘦𝘢𝘵𝘪𝘷𝘪𝘵𝘺 The model lacks any advanced abilities for humor. When asked for jokes, it repeatedly gave variations of the same puns, similar to older LLMs. 🆇 𝘍𝘢𝘤𝘵-𝘤𝘩𝘦𝘤𝘬𝘪𝘯𝘨 𝘐𝘴𝘴𝘶𝘦𝘴 Early users found Grok 3 hallucinating citations and even inventing fake URLs, similar to problems seen in other LLMs. Overall, Grok 3 has been impressive but not perfect—and still lags behind OpenAI’s o3 in benchmarks. Detailed comparison in the comments.👇 #xAI #LLM #AIMonitoring

    • No alternative text description for this image
    • No alternative text description for this image

Similar pages

Browse jobs

Funding