Perplexity’s Post

1,132,976 followers

7mo

Today we're open-sourcing R1 1776—a version of the DeepSeek R1 model that has been post-trained to provide uncensored, unbiased, and factual information. DeepSeek-R1 rivals top reasoning models like o1 and o3-mini. However, its usefulness is limited by its refusal to engage with topics censored by the CCP. We aim to always provide accurate answers, but had to address R1's censorship before using its reasoning capabilities. To keep our model "uncensored" on sensitive topics, we created a diverse, multilingual evaluation set of 1000+ examples. We then use human annotators as well as carefully designed LLM judges to measure the likelihood a model will evade or provide overly sanitized responses to the queries. We also ensured that the model’s math and reasoning abilities remained intact after the uncensoring process. Benchmark evaluations showed it performed on par with the base R1 model, indicating that uncensoring had no impact on core reasoning capabilities. Download the model weights on our HuggingFace Repo or consider using the model via our Sonar API. HuggingFace Repo: https://lnkd.in/gfmJZffF Sonar API: https://lnkd.in/dtCXV4c6 Learn more about R1 1776: https://lnkd.in/gVizAFb7

92 Comments

Florian Bansac

AI - Agents - FinTech

7mo

Great news! Come share what you build with uncensored R1 and learn with us in the AI Agents group on linkedin: https://www.linkedin.com/groups/6672014

CJ Combs

CIO Magazine Published AI Advisor demonstrating “Art of the Possible” | Microsoft Copilot Studio Dev and SharePoint Expert | Former Triathlete and current competitive Cyclist

7mo

Can we get a video or tour of how you implemented DeepSeek? How you are are protecting users data and showing how it does NOT transmit data to China?

Bridget Hylak, CI, CT, MTC

Industry Spokesperson and Strategist • Global Marketing Expert • AI Localization Consultant

7mo

I fear that there are issues intrinsic to the Chinese culture , language, ethics, law, morality, etc., that also need to be considered... or may be overlooked... time for a few high-level bicultural consultants...

Grace Yu

Maths & Stats @ Oxford \\ AI/ML, DS, Quant

7mo

Cool update but interesting name given what’s currently happening in the 1776 country.

1 Reaction

Samuel Brasil

building the impossible

7mo

unbiased and factual information is the santa claus of information

5 Reactions

🛡️ Rogier de Groot

Owner of DCK ° Creative Brain 🧠🎉 π AI } CREATIVE } BLOCKCHAIN } SAAS } Solutions. Digital ghost in web²/³ utility (advies/oplossingen/netwerk/denktank)

6mo

Nice!

Andrew Moss

Compiler Engineer

7mo

Very old, but highly relevant: In the days when Sussman was a novice, Minsky once came to him as he sat hacking at the PDP-6. “What are you doing?”, asked Minsky. “I am training a randomly wired neural net to play Tic-Tac-Toe” Sussman replied. “Why is the net wired randomly?”, asked Minsky. “I do not want it to have any preconceptions of how to play”, Sussman said. Minsky then shut his eyes. “Why do you close your eyes?”, Sussman asked his teacher. “So that the room will be empty.” At that moment, Sussman was enlightened.

1 Reaction

movee.ai

7mo

The release of R1 1776 marks an important milestone in developing language models that are more transparent and less biased.

1 Reaction

Andy Wang

Security Engineer at Netcraft

7mo

Should've been called R1 1776-2025

1 Reaction

Samater „Sam“ Liban

I create & transfer knowledge. From biz to tech and back. For teams and organizations. Transformation & innovation. Hate Bullshit. I dive deep & explain in the recipients contexts. Let's shape the future, together!

7mo

I don’t get it. When testing the Chinese app version of DeepSeek it actually almost every time answered in a relative Western way to (in CCP views) critical questions, but after posting the answer it deleted it, immediately, and re-answered that it can’t answer. So I as a layman thought, it’s probably just a filter above the LLM, that is censoring? Thinking about this, I also found that more reasonable. I mean - with what data would you train a model to ensure it answers in a censored way? Can you “machine learn” a “censored vector algorithm”? Google, OpenAI and others also were not able to train a model “censored”, but built level upon the models…sometimes so badly, that we saw black Nazis or Native American founding fathers .. musk called it “woke AI” back in the days, but it was an additional mechanism, that was doing it. So that’s why I don’t get the claims posted here. That perplexity “uncensored” a LLM. But it must be my level of comprehension? Maybe someone can point out, what I am missing/confusing?

27 Reactions

See more comments

To view or add a comment, sign in

More Relevant Posts

Sahil Ashar

Builder | prev. @ Microsoft, Riot Games
1mo
Report this post
Should "Cost-Per-Correct" (CPC) be a key LLM ops metric? I built and ran a token harness across a dataset of 200 5th-grade level math problems, and evaluated them with GPT-5 across a grid of token caps (64, 128, 256) x shots (0, 2, 8) x reasoning levels (minimal, medium, high). TL;DR: - CPC = (total $ spend) / (# correct outputs). Lower is better! - Reasoning shouldn't be “always-on.” - Token efficiency is the name of the game. Full write-up + repo (link in the comments).

[Lab Note] Cost-Per-Correct as the Key LLM Metric? ashar.substack.com

1 Comment
Like Comment
To view or add a comment, sign in
Guillaume Raille

Bringing business context to LLMs | Open Source Maintainer
1mo
Report this post
What could be better than GPT-5 release? MCPAdapt now supports Structured Output with Smolagents. This is a game changer, drastically reducing CodeAgent iterations (think tokens, think $) and error rate. For a little while MCP tools could only return one type of output: string (excl. binaries). And for most LLM systems that was ok as they mostly work with text. However the wole idea of CodeAgents from #HuggingFace is to reason in interpretable code. Having tools that only return string in this context is quite limitating as it would often result in several wasted iterations fixing parsing errors. Fortunately MCP added structured output into the spec allowing tools to define more precisely the type of their output. We leveraged this to hint CodeAgent about the type of the returned object. In my few tests it systematically reduced the number of iterations and errors of the Agent. The screenshot shows the same prompt triggering an mcp server with a single tool returning a dict with a laptop specs. Without using structured output on the left and with structured output on the right. You can observe on this very simple case 5 iterations vs 3 iterations. 12k tokens vs 6k tokens for the exact same answer. Without structured output, after parsing error, the Agent even decides to resort to regex to extract the data from the string...
5 Comments
Like Comment
To view or add a comment, sign in
Dmitry Platonov

Software Engineer at A.Team
1mo
Report this post
I made this simple tool for detecting language-based biases in LLMs. You write a question in your language, but behind the scenes, the tool translates it into a different "processing" language, asks the LLM, then translates the answer back. This way, you can see how the LLM's answer changes depending on the language used to think in: https://lnkd.in/dsq2C3eR
1 Comment
Like Comment
To view or add a comment, sign in
Zhongwen Xu

Principal Researcher
4w
Report this post
We explore why Tool-Integrated Reasoning (TIR) enhances LLMs in our new work, "Understanding Tool-Integrated Reasoning," and introduce a new algorithm, Advantage Shaping Policy Optimization. This study proves TIR expands LLM capabilities! https://lnkd.in/gg9GR7jF

Understanding Tool-Integrated Reasoning | Notion zhongwenxu.notion.site

1 Comment
Like Comment
To view or add a comment, sign in
Md Amanatullah

Agentic AI Specialist | Langgraph | Autogen | CrewAI | Smolagents | Cloud | AI | ML | Data Science
1mo
Report this post
learn how to build an LLM from scratch, @rasbt's repo is really a gem. it has notebooks with diagrams and explanations that will teach you 100% of: > attention mechanism > implementing a GPT model > pretraining and fine-tuning GitHub link:- https://lnkd.in/ecmxgema
Like Comment
To view or add a comment, sign in
Carlos Kemeny, PhDx2

AI Builder | AI and Human Decision Making | Host of the "The AI Decision Guy" Podcast
1mo
Report this post
Not every input token in an LLM prompt contributes equally. And many times, tokens can even create negative value. Some tokens clarify intent, preserve context, or help the model reason accurately. Others introduce ambiguity or redundancy, adding noise that makes the LLM’s job harder. OpenAI and Anthropic have trained our brains to think of tokens as equal ($0.15 per million input tokens). But what if you were penalized for bad tokens (more expensive) and rewarded for good tokens (less expensive)? I’ve been thinking about how to measure the value of each input token based on two factors: how useful it is, and how much it's affected by noise in context. Here’s a simple formula I’ve been experimenting with: Value Adjusted = Value Base × (Quality Boost) × (Noise Penalty) Or more specifically: Value Adjusted = Value Base (ie. $0.00001 per input token) × [1 + α × (q - 0.5)] × [1 - γ × noise × (1 - q)^δ] Where: q is a quality score for the token (between 0 and 1) noise reflects contextual uncertainty or ambiguity α, γ, and δ are tuning parameters This allows higher-quality tokens to be rewarded and lower-quality ones to be penalized more aggressively, especially when noise is high. The idea is to capture how valuable each token is to the model, and how fragile that value becomes in less clear contexts. Take this prompt: "So basically the thing I was wondering is like, you know, how do I, um, change the AC capacitor?" This is full of filler tokens that carry little useful information and introduce noise. The true signal is something much simpler: "How do I change my AC capacitor?" This is a basic example, but the problem is magnified when you have conflicting tokens that then requires reasoning (ie. more expensive models). By assigning value to each token, we could improve how prompts are scored, compressed, or even priced. It might also help with long-context models where not all tokens should count equally. Is anyone aware of any models like this? Can you please share them with me?

8 Comments
Like Comment
To view or add a comment, sign in
SAMUEL CRISTIANO SILVA XAVIER

software engineer
1mo
Report this post
Using Regex, we can easily match or search for patterns in a text. Before searching for a pattern, we have to specify one using some well-defined syntax. In this problem, you are given a pattern. You have to check whether the syntax of the given pattern is valid.

Pattern Syntax Checker | HackerRank hackerrank.com
Like Comment
To view or add a comment, sign in
Navnidhi Sharma

Attended CHANDIGARH UNIVERSITY
1mo
Report this post
🌌 Day 151 of #gfg160 Challenge 🌌 📌 Problem: Alien Dictionary 🎯 Difficulty: Hard Today’s challenge was really interesting — figuring out the order of letters in an alien language 👽. We are given a sorted dictionary of words, and from this we must deduce the lexicographic order of characters. 🔍 Problem Summary Words are sorted as per alien rules We need to determine a valid character ordering If no valid order exists (contradictions), return "" Example 1: Input: ["baa", "abcd", "abca", "cab", "cad"] Output: "bdac" (valid order) Example 2: Input: ["caa", "aaa", "aab"] Output: "cab" Example 3: Input: ["ab", "cd", "ef", "ad"] Output: "" (no valid ordering possible) 💡 My Approach This boils down to a graph problem: 1️⃣ Treat each character as a node. 2️⃣ Build edges between first differing characters in adjacent words. 3️⃣ Detect contradictions with cycle detection (DFS recursion stack). 4️⃣ Use topological sorting to produce the final order. 📚 Key Takeaways This problem is a classic topological sort use case. Cycle detection is crucial — contradictions mean no valid ordering. It’s a great mix of graphs, recursion, and lexicographic reasoning. 💬 This problem reminded me how language rules can be modeled with graphs. Curious — would you solve this using DFS recursion (like above) or with BFS (Kahn’s Algorithm)? #gfg160 #geekstreak2025 #GraphAlgorithms #TopologicalSort #AlienDictionary #DSA #ProblemSolving #CodingChallenge #GFG
Like Comment
To view or add a comment, sign in
Doug Turnbull

Search Relevance, Machine Learning, and Discovery
1mo Edited
Report this post
Lost in all the hype: the art of using LLMs as classifiers. Which is especially useful when building search. We can talk about agents and AGI all day, but one of the best applications of an LLM is the ability to classify some text (query, paragraph, etc) to a set of labels. It seems relatively trivial, but there are a lot of techniques you can use to improve the accuracy of your classifier * Using structured outputs for a small set of labels * Tuning the prompt between precision / recall -- ie letting the LLM say "I don't know" depending on its confidence in the answer * For large label set, intentionally letting the LLM generate/hallucinate a fake label that you resolve to real ones via embedding search A lot of it is managing the amount of freedom you give an LLM and how exactly you then constrain that freedom to the set of labels you care about. There's more of course in Cheat at Search with LLMs :) Where you learn to use LLMs for query and content classification in search. Hope to see you there.

5 Comments
Like Comment
To view or add a comment, sign in

1,132,976 followers

View Profile Follow

LinkedIn respects your privacy

Perplexity’s Post

More from this author

Agents or Bots? Making Sense of AI on the Open Web

Copilot on Perplexity: Faster, More Efficient, and Powered by OpenAI's Fine-Tuned GPT-3.5

Explore content categories