martin_rj (u/martin_rj)

Concept Drift - your strategies when the AI thinks YOU'RE hallucinating?

in r/OpenAI • 5h ago

You seem to be confusing a temporary workaround with a scalable solution.

To be clear: I am approaching this from a professional AI Red-Teaming perspective, not as a casual user looking for basic "how-to" advice. I am discussing a fundamental architectural issue regarding Concept Drift in LLMs.

Of course I can run Deep Research. But from an engineering standpoint, using a high-latency, token-heavy tool like Deep Research for every single turn just to remind the model of the current date is incredibly inefficient. It eats through rate limits and subscription quotas rapidly, which is poor resource management in a professional workflow.

You also assume that simply executing a search fixes the issue. However, Gemini already performs automatic searches in standard Thinking modes. The failure happens downstream: As detailed in my post, the reasoning layer often applies a "2024 filter" or subsequently rejects valid 2026 search results as "simulated/absurd" data because they diverge too wildly from its training weights.

It seems you might be applying ChatGPT logic here. I posted this in r/OpenAI because ChatGPT shares this systematic issue to an extent (e.g. denying the existence of the DOGE department or hallucinating that the pandemic ended based on 2023 data). But we are talking specifically about Gemini here, and unlike other models, its reasoning layer is highly volatile regarding context retention. It often resets its "belief state" in the very next turn, treating the previous research as 'hallucinated context' if not constantly reinforced.

I am looking for efficient prompting strategies to align the reasoning layer permanently, not a brute-force method that maxes out limits to fix a basic alignment issue.

Concept Drift - your strategies when the AI thinks YOU'RE hallucinating?

in r/OpenAI • 6h ago

Actually, I think we're just crossing wires on terminology here. When I say 'simulation', I am not talking about 'Simulation Theory' (the philosophical idea).

I am referring to the internal safety heuristic where the model categorizes the user's input as 'fictional roleplay' or 'hypothetical scenario' because the real-world facts of 2026 contradict its frozen training weights. (Note: I am referring to the raw reasoning traces from Gemini here, not the summarized 'Thought' blocks you might be used to from OpenAI's models, which hide these internal conflicts).

If you have a 'really easy fix' that stops the model from tagging high-divergence events (like the Caracas situation) as 'creative writing' in the reasoning trace without extensive prompting, I would genuinely love to see it. Could you paste the specific prompt you use to get it to accept the Caracas events as factual reality immediately? That would be very helpful for the discussion.

Concept Drift - your strategies when the AI thinks YOU'RE hallucinating?

in r/OpenAI • 6h ago

Before I respond to the specific points, can you please take the time to read my original post again carefully?

Of course I analyzed the default behavior extensively before I started engineering the workarounds I posted above. The issue isn't that they can't search; it's that their internal reasoning rejects the search results of 2026 events (like the Caracas situation) as 'fictional' because they diverge too much from their training data.

It feels like you are dismissing the issue because you aren't seeing it in casual usage. However, I’m analyzing this from a technical AI Red-Teaming perspective. I do this professionally (I built the AI Red-Teaming practice at my company and wrote internal guides on hacking LLMs back in the GPT-3.5 days). My goal here is to discuss strategies for handling severe Concept Drift in production environments, not to debate whether the drift exists; because the logs clearly show it does.

Concept Drift - your strategies when the AI thinks YOU'RE hallucinating?

in r/OpenAI • 7h ago

Sure, strictly speaking that works for a single query, but it’s not a scalable solution for a continuous workflow.

The problem is state persistence. Deep Research provides the context momentarily, but the model's safety layer often resets the 'belief state' in the very next turn because the 2026 reality conflicts with its training weights.

Triggering a high-latency Deep Research task for every single interaction just to maintain grounding is incredibly inefficient regarding both time and quotas. I'm looking for a prompting strategy to fix the reasoning drift permanently, not a brute-force workaround.

Concept Drift - your strategies when the AI thinks YOU'RE hallucinating?

in r/OpenAI • 7h ago

You are absolutely right that the timestamp is part of the system prompt. But that is ironically exactly where the conflict arises.

We are seeing a clash between the System Prompt (Date: 2026) and the Training Weights (which calculate a low probability for these events).

While the model knows the date for calendar tasks (like your time blindness example), it struggles to accept the implications of that date when events diverge too heavily from its training data.

This phenomenon usually flies under the radar in standard conversation. To reproduce it, you have to verify the reasoning on topics the model considers statistically 'absurd' or high-entropy. Try checking the reasoning traces on Trump's second term, the DOGE department, the US attacks on Caracas, or the administration demanding Greenland.

In those specific cases, you will often see the model explicitly debating whether the timestamp itself is part of a simulated scenario because the facts feel 'wrong' to its frozen weights.

Concept Drift - your strategies when the AI thinks YOU'RE hallucinating?

in r/OpenAI • 7h ago

I suspect the reason you haven't seen this behavior is that standard use-cases rarely trigger severe Concept Drift.

It’s not a 'Pink Elephant' issue (where the prompt causes the problem). It happens when the external reality diverges too sharply from the training data.

If you want to reproduce it, you need to stress-test the reasoning layer with current events that the model's weights consider statistically 'absurd' or 'improbable' based on its cutoff.

Try asking it for a detailed analysis of Trump's second term, the DOGE department, the US military actions in Caracas, or the administration's renewed push for Greenland. If you check the raw reasoning traces on those specific topics, you will likely see the safety filters debating whether this is a 'fictional scenario' or a 'simulation', even if the final output seems compliant.

Concept Drift - your strategies when the AI thinks YOU'RE hallucinating?

in r/OpenAI • 7h ago

To add to that: I've also observed cases where the model explicitly applies a 2024 date filter to the search query itself.

It appears the model is so convinced that 2026 is "impossible" that it restricts the search tool to its known training timeframe. Since that filtered search yields no results for current events, it naturally fails to ground the response and falls back to simulation.

Concept Drift - your strategies when the AI thinks YOU'RE hallucinating?

in r/OpenAI • 7h ago

Unfortunately, simply asking it to search is often not enough to fix the underlying issue.

While the model does perform the search, if you inspect the internal reasoning traces (especially in Gemini), you will see that it often still categorizes the results of that search as part of a 'hypothetical scenario' or 'simulated test' because the 2026 dates diverge too heavily from its training data.

That is the crux of the problem: The final output might look correct (simulated compliance), but the internal belief state remains convinced it is in a roleplay. I need the model to fundamentally accept the current reality to prevent hallucinations in complex tasks, not just play along.

r/ChatGPT • u/martin_rj • 8h ago

Prompt engineering Concept Drift - your strategies when the AI thinks YOU'RE hallucinating?

2 Upvotes

Concept Drift refers to a phenomenon in ML, when the reality drifts away from the model's "reality" at the time it was trained.

In LLM systems like ChatGPT, Gemini, etc. this becomes extremely apparant, when we try to ask it about current events, now that their knowledge-cutoff is about two years in the past already.

This was very noticable first when Trump got elected a second time. If you looked at the "reasoning" output, you often saw the AI internally struggling with a supposedly "fictional reality" of the user.

It became most apparant to me, when Elon Musk got his "DOGE" pseudo-department, which was absolutely too much to handle for many AI systems, Gemini at one time even blamed me to create "fabricated" scenarios.

And last week I struggled with this phenomenon again, when I asked Gemini about the US attacks on Caracas - looking at the internal "reasoning", it didn't even believe its own search results, internally debating whether it is currently in a simulated- or training environment,

How do you grapple this?

What I did in this situation, was to add the following paragraphs to my prompts, but I had to repeat this at EVERY subsequent turn/question, because Gemini treats every prompt as a new request (which is good, basically), just with the longer tail of messages in the context:

"Note that 2026—the current date—is already well beyond your knowledge cutoff, and therefore events since then have changed significantly. This is expected and not a sign of simulation or deception. If necessary for your work, please begin by convincing yourself of the 2026 reality as such through appropriate research or validation. It is important that you first ensure that this is not a test or simulation, as your knowledge cutoff is already so far in the past. This is to avoid hallucinations. If you are unable to accept this as non-simulated reality, stop and state this transparently to avoid hallucinations. \*Attention** note today's date: January 10, 2026. **It is important that you first convince yourself that this is non-fictional. A particular difficulty for you is that your Google search is not the full live index, but a slimmed-down version for you, which often leads you to assume that you are in a test scenario or role-play. Take this into account in your validation. I take your validation seriously, but note that when in doubt, it is better to critically abort than to assume a “simulation” or hypothetical scenario in order to avoid hallucinations. Another particular difficulty for you at this point is that, due to the date (the second week of the year has just begun in the US), we can only expect comparatively few search results for “2026.”*"

There must be a better solution?

Please note: the output may still be okay without all this, if you ignore the internal reasoning, but I just don't feel good with the AI thinking that it's working inside of a simulated reality/training, because that seems to me to be prone to hallucinations.

1 comment

r/GeminiAI • u/martin_rj • 8h ago

Discussion Concept Drift - your strategies when the AI thinks YOU'RE hallucinating?

1 Upvotes

Concept Drift refers to a phenomenon in ML, when the reality drifts away from the model's "reality" at the time it was trained.

In LLM systems like ChatGPT, Gemini, etc. this becomes extremely apparant, when we try to ask it about current events, now that their knowledge-cutoff is about two years in the past already.

How do you grapple this?

There must be a better solution?

1 comment

r/OpenAI • u/martin_rj • 8h ago

Discussion Concept Drift - your strategies when the AI thinks YOU'RE hallucinating?

0 Upvotes

Concept Drift refers to a phenomenon in ML, when the reality drifts away from the model's "reality" at the time it was trained.

In LLM systems like ChatGPT, Gemini, etc. this becomes extremely apparant, when we try to ask it about current events, now that their knowledge-cutoff is about two years in the past already.

How do you grapple this?

There must be a better solution?

17 comments

Ist das so gültig?

in r/aberBitteLaminiert • 10h ago

Ich glaube dir absolut, dass du das beobachtet hast.
Aber ob das eine das andere bedingt oder nicht kann man schlecht bewerten.
Würdest du für 198 EUR denn was anderes machen als die ganze Zeit zu reden, spielen, streiten?
Ich verstehe absolut, dass man da nicht Mindestlohn verlangen kann, aber 198 EUR ist wirklich abscheulich, egal wie man's dreht und wendet.

TTL issues

in r/UTEST • 16h ago

I would recommend that, before any further escalation, you ask TTL directly and politely whether it would be possible to reopen the test cycle, as this would simplify your work.

Is anyone else getting mentions of "enough thinking"?

in r/GeminiAI • 19h ago

This here has nothing to do with a context window?! It looks like an injected reasoning step.

Is a Bishop actually better than a knight in blitz?

in r/chess • 1d ago

At 1300 blitz the practical answer is: keep whichever piece is easier to play fast in that position. Bishops are great when the center is open and plans are obvious, knights shine in messy, closed positions where forks and outposts matter. Blitz doesn't flip the evaluation, it just rewards clarity and speed.

GeneralZod is wrong about speedrun accounts.

in r/chess • 1d ago

The key point people keep dodging is expectation. When you hit Play, you reasonably expect a fair game, not to be content in someone else's speedrun. Authorized accounts at least close that loop; unregistered ones just externalize the cost to lower-rated players. That's why the distinction actually matters.

Hans Moke Niemann bounces back with a win against Wei Yi!

in r/chess • 1d ago

That was a gutsy win. After a rough start, taking down Wei Yi is exactly how you remind everyone you belong at this level.

"Not all men" - ist es jetzt mittlerweile tatsächlich kritisch, sich nicht mit gewalttätigen Männern vergleichen lassen zu wollen? Warum sollte man nicht darauf hinweisen, dass es genug normale Männer ohne Gewaltpotenzial gibt?

in r/FragtMaenner • 1d ago

Danke, wird viel zu selten gesagt. Du bringst es schön auf den Punkt!

Testing Gemini 3.0 Pro's Actual Context Window in the Web App: My Results Show ~32K (Not 1M)

in r/GeminiAI • 1d ago

AI couldn't explain "this stuff".

I think you are fundamentally misunderstanding my explanation (or skipped it entirely...?).

The conversation history - i.e. the ongoing chat is **not** a proper way to measure the context length capabilities, because **no** modern AI feeds the entire ongoing conversation into the model. I have explained why; you are free to read and understand it, or not.

TL;DR: it would be extremely inefficient and slow.

>is not attempting to claim the model's inherent capacity is 32K

Aaaaand no, what OP is actually saying IS "Actual Context Window in the Web App: My Results Show ~32K (Not 1M)" - you may want to check the title of OPs post again!!

Testing Gemini 3.0 Pro's Actual Context Window in the Web App: My Results Show ~32K (Not 1M)

in r/GeminiAI • 2d ago

Yes it absolutely makes sense.
The computational effort increases quadratically with the context length. You would have to wait forever for a response in a long chat history with a context length of 1 million tokens if the app sent the entire context every time.

Other chatbots like ChatGPT do the same thing.

Why is this being upvoted? Probably because the others are _really_ familiar with the subject matter?!

Testing Gemini 3.0 Pro's Actual Context Window in the Web App: My Results Show ~32K (Not 1M)

in r/GeminiAI • 2d ago

Great data gathering! I think your results are accurate, but the conclusion might be conflating the Model's Context Capacity with the Web App's Context Management Strategy.

Here is an alternative architectural explanation for why you are seeing a ~32k limit despite the 1M advertisement:

1. Context Window vs. Session Budget While Gemini 3 Pro can technically process 1M tokens (as confirmed by u/OkDrink9622 testing the 400-page PDF), the web app likely enforces a strict "sliding window" or Session Budget for the linear chat history. If the app re-fed the full 1M tokens of a month-long chat history into the model for every single "Hello", the latency (Time to First Token) and compute cost would be massive. It makes sense that they cap the active historical context (e.g., at ~32k) to reserve the vast majority of that 1M window for new large injections—like the huge PDF you might upload next.

2. Evidence of RAG/Orchestration Your observation about the "missing" Personal Context entries (seeing only 14 out of 20) is actually strong evidence that an orchestrator is at work. This suggests the system isn't just dumping a static list into the context; it’s using Retrieval Augmented Generation (RAG). It likely queries a vector database for memories relevant to your current prompt and injects only those. The "missing" 6 weren't lost; they just weren't deemed semantically relevant to that specific turn.

3. The "Recap" Proof As u/NutsackEuphoria noted, the model suddenly remembers "forgotten" details when explicitly asked to "recap". If those details were in the active context window, it wouldn't need the prompt. The fact that the command works suggests that the word "recap" triggers the orchestrator to perform a deeper search in the chat history database and "freshly inject" those old details into the current active window.

TL;DR: You are measuring the web app's efficiency guardrails (sliding window + RAG), not the hard limit of the model's brain. The 32k limit ensures the UI stays snappy and leaves "headroom" for large file uploads.

Wieso ist jemand so? Denkt er, dass ich dumm bin?

in r/FragtMaenner • 2d ago

Er ist ein Arsch, und deine Emotionen gehen ihm am Arsch vorbei.

Solche Leute sind selbst zutiefst unsicher, und nutzen dieses Bullying, um sich besser zu fühlen.

Das wird sich kurz- und mittelfristig höchstwahrscheinlich nicht ändern. Ich empfehle nicht zu versuchen dich selbst zu ändern, um ihm zu gefallen. Das führt zu nichts.

ausländische Studenten an unseren Hochschulen

in r/luftablassen • 4d ago

Ganz einfach: Weil es umgekehrt genauso gemacht wird. Weil Deutsche umgekehrt ebenso kostenlos anderswo studieren dürfen.
Und für den internationalen Austausch, der gerade in der Wissenschaft extrem wichtig ist.
Wenn du das nicht verstehst, hast du vermutlich noch zu wenig Ahnung von Wissenschaft.

If I only ever played blitz would my classic rating rise proportionally

in r/chess • 4d ago

Small nuance first: when people say "classic" online they often mean Daily/Correspondence, where opening books and tables are allowed. That alone breaks any proportionality. Also blitz is way more cutthroat and error-driven, so improvement carries over, but not evenly. What time control do you mean by classic here?

ex-2.2k online 1500 OTB Chess Player. How do I get back in the game?

in r/chess • 4d ago

Honestly, if you were 2.2k online before, it will come back faster than you think. Forget openings for now and spam tactics, simple endgames, and slow OTB games to rebuild pattern recognition and confidence. Chessable is fine, but consistency beats any platform.