To learn more, check out the repo: https://github.com/diffbot/diffbot-llm-inference
30.1.2025 03:16To learn more, check out the repo: https://github.com/diffbot/diffbot-llm-inference89,886 developers are building their own Perplexity on-prem with Diffbot LLM —
https://huggingface.co/diffbot/Llama-3.1-Diffbot-Small-2412
30.1.2025 03:1589,886 developers are building their own Perplexity on-prem with Diffbot LLM —https://huggingface.co/diffbot/Llama-3.1-Diffbot-Small-2412The model isn't the moat. Perplexity can be recreated as a side project. #DeepSeek proved this. We proved this.
Download Diffbot LLM. Run it off your own GPU. Congrats, your on-prem #AI is smarter than Perplexity.
30.1.2025 03:15The model isn't the moat. Perplexity can be recreated as a side project. #DeepSeek proved this. We proved this.Download Diffbot LLM. Run...1. Diffbot LLM is a side project. Sonar is Perplexity's entire business.
2. We used the profits from our primary business to train Diffbot LLM. Perplexity raised $915M to train theirs.
3. We open sourced Diffbot LLM. Perplexity chose to keep theirs secret.
30.1.2025 03:141. Diffbot LLM is a side project. Sonar is Perplexity's entire business.2. We used the profits from our primary business to train...Let's be frank here. The score difference is insignificant. And we'll probably play SimpleQA tag for awhile.
What IS significant is how we got here vs. Perplexity.
30.1.2025 03:14Let's be frank here. The score difference is insignificant. And we'll probably play SimpleQA tag for awhile.What IS significant is...While working on my talk last week, Perplexity released Sonar Pro API with a special emphasis on its factuality benchmark F1 score of 0.858, handily beating other internet connected LLMs like Gemini-2.0-flash.
The SimpleQA benchmark they used is open source and LLM judged, so I set it up to run the 4000 question eval on Diffbot LLM overnight and went to bed.
The next morning, we beat Sonar Pro.
30.1.2025 03:14While working on my talk last week, Perplexity released Sonar Pro API with a special emphasis on its factuality benchmark F1 score of 0.858,...Perplexity Sonar Pro API launched last week as the best performing model on factuality.
24 hours later, it's the 2nd best performing model (and it's not because of DeepSeek).
30.1.2025 03:12Perplexity Sonar Pro API launched last week as the best performing model on factuality.24 hours later, it's the 2nd best performing...Our hack night last year with @neo4j
and @weaviate was a blast so we’re kicking off 2025 with another!
Same place, same time. We might even demo Diffbot LLM!
RSVP here: https://lu.ma/hacknight-github-01-22-25
18.1.2025 03:36Our hack night last year with @neo4j and @weaviate was a blast so we’re kicking off 2025 with another!Same place, same time. We might even...A demo is also available at https://diffy.chat.
We look forward to building a future of grounded AI with you all.
9.1.2025 21:53A demo is also available at https://diffy.chat.We look forward to building a future of grounded AI with you all.Diffbot LLM's lighter footprint puts on-prem hosting well within reach.
And we are excited to share that we are releasing Diffbot LLM open source on @github with weights available for download on #Hugginface
https://github.com/diffbot/diffbot-llm-inference
9.1.2025 21:52Diffbot LLM's lighter footprint puts on-prem hosting well within reach. And we are excited to share that we are releasing Diffbot LLM...At Diffbot, we believe that general purpose reasoning will eventually be distilled down to ~1B parameters.
Knowledge is best retrieved at inference, outside of model weights.
9.1.2025 21:50At Diffbot, we believe that general purpose reasoning will eventually be distilled down to ~1B parameters. Knowledge is best retrieved at...The benefit of full source attribution goes two ways.
Not only is credit provided to publishers, every fact is also independently verifiable.
9.1.2025 21:50The benefit of full source attribution goes two ways. Not only is credit provided to publishers, every fact is also independently...Every response from Diffbot LLM draws from the results of real-time expert web searching and queries to the Diffbot Knowledge Graph.
Naturally, this means Diffbot LLM always provides full attribution to its cited sources.
9.1.2025 21:48Every response from Diffbot LLM draws from the results of real-time expert web searching and queries to the Diffbot Knowledge Graph....We launched the world's most grounded #LLM — Diffbot #GraphRAG LLM.
Instead of training on ever larger corpuses of data, Diffbot LLM is trained to be an expert web researcher.
In fact, Diffbot LLM makes zero assumptions about its knowledge of the world.
9.1.2025 21:48We launched the world's most grounded #LLM — Diffbot #GraphRAG LLM.Instead of training on ever larger corpuses of data, Diffbot LLM...Start your hunt: https://app.diffbot.com/get-started
Some notes:
- Searching on the dashboard does not consume credits
- Sign up for a fresh trial if your trial has expired
🪺 We've hidden 16 easter eggs in the Diffbot Knowledge Graph.
🐣 Each egg contains a clue to the next.
👀 Eggs are only visible in entities on the Dashboard (link in comment), not via API.
🔍 This is not school. Google/ChatGPT/query all you want.
🥇 First person to eggstract all 16 easter eggs will win a lifetime personal plan and the swaggiest Diffbot swag.
Your first clue — "The Easter Bunny was first mentioned in an essay by this German physician."
29.3.2024 15:54🪺 We've hidden 16 easter eggs in the Diffbot Knowledge Graph. 🐣 Each egg contains a clue to the next.👀 Eggs are only visible in...Tapping into data in the Diffbot Knowledge Graph can feel a lot like you're doing the research yourself, except in light speed.
Every fact includes a link to a public source. So you're never left wondering if you've broken a privacy law.
Frankly, I just find it fun to see where facts come from. In this LeadGraph screenshot, the KG was able to link Microsoft Teams to adidas by analyzing an employee's job description with NLP.
6.2.2024 22:37Tapping into data in the Diffbot Knowledge Graph can feel a lot like you're doing the research yourself, except in light speed.Every...Every few years or so, someone tries to reinvent Wikipedia.
The honor system works for a bit, but they all inevitably run into trolls and spammers.
The solution to more reliable facts isn't to have a single consolidated knowledge base, but a #knowledgegraph that links a single fact to several sources.
7.6.2023 22:49Every few years or so, someone tries to reinvent Wikipedia. The honor system works for a bit, but they all inevitably run into trolls and...Guess I’ll have to start practicing dictating DQL… #WWDC
5.6.2023 19:13Guess I’ll have to start practicing dictating DQL… #WWDCWhen friends ask me why fact finding AI is necessary. #JustAIThings
30.5.2023 16:36When friends ask me why fact finding AI is necessary. #JustAIThings⬆️
⬇️