tracking: context limit issues with @-mentioned files #3455

abeatrix · 2024-03-18T15:46:41Z

This is an issue spinned off from #2965 where the original goal was to display warnings for the @-mentioning massive files

Context: https://sourcegraph.slack.com/archives/C05AGQYD528/p1710331429302729

Current state

Here is the current state of @-mentioning massive files summarized by @chillatom:

The file size "check" that we use to display the warning is not the real token limit applied by the system. It is an approximation. This leads to the odd behavior where we trigger a warning, but you can still include the file or you can successfully copy and paste the file contents into the chat message.
The warning check is computed per file, which means that if file A & B are both individually under the estimated limit, we trigger no warning, even if A+B is above the limit.
If the file is actually too large to be included, but the user submits anyway (e.g. vscode/src/completions/logger.ts which is 860 LOC and ~6,700 tokens. It appears that we silently exclude the file from context and Cody hallucinates. 🔴
If the file triggers the warning, but isn't too large to be included e.g. vscode/src/local-context/symf.ts which is 641 LOC and 4,867 tokens, I see the content actually used as context and referenced correctly.

At a minimum it feels like we should do a few things

Tie our warning to the actual operation of the product. If we say it's too large, the file should not be able to be input
We should not silently exclude a file that has been explicitly @ mentioned
Consider the case of multiple @ mentioned files
Prioritize @ mention files over other fetched context (not sure if we do this today) as to avoid silently removing the file explicitly referenced

Over a longer term, I think we should explore

Expanding the context windows, especially for some of the flagship models (I'll be writing a proposal here)
Consider summarization or proposition extraction from the file

Design Tasks

Design Tasks:

Give feedback

We should not silently exclude a file that has been explicitly @ mentioned - figma
Options

Engineering Tasks

Engineering Tasks:

Give feedback

Tie our warning to the actual operation of the product. If we say it's too large, the file should not be able to be input
We should not silently exclude a file that has been explicitly @ mentioned #3522

cody
Prioritize @ mention files over other fetched context (not sure if we do this today) as to avoid silently removing the file explicitly referenced
Bug: Super large message in Cody is completely ignored silently with no context and an added fake preamble #3364

bug clients/jetbrains cody
Chat: fix at-mention token size #3526
Chat: update @-input token background #3548
Chat: display excluded @-files in UI #3528
Chat: Disable adding large-file via @-mention #3523
Support range for large @-mention files #3589

cody
Options

Other design ideas from @toolmantim : #3439 (comment)

taylorsperry · 2024-03-18T19:33:03Z

Just a note that once we've aligned on a path forward and those changes have shipped, we should ping @MaedahBatool to make sure the docs are up to date. (We know users have been confused about this.)

kalanchan · 2024-04-03T21:07:41Z

landed in v1.12

abeatrix self-assigned this Mar 18, 2024

abeatrix changed the title ~~Deal with @-mentioning massive files~~ tracking: context limit issues with @-mentioned files Mar 18, 2024

abeatrix mentioned this issue Mar 20, 2024

Chat: sync token limit at model import time #3486

Merged

varungandhi-src added the cody label Mar 22, 2024

abeatrix closed this as completed Mar 27, 2024

abeatrix reopened this Mar 28, 2024

kalanchan closed this as completed Apr 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tracking: context limit issues with @-mentioned files #3455

tracking: context limit issues with @-mentioned files #3455

abeatrix commented Mar 18, 2024 •

edited

Design Tasks:

Engineering Tasks:

taylorsperry commented Mar 18, 2024

kalanchan commented Apr 3, 2024

tracking: context limit issues with @-mentioned files #3455

tracking: context limit issues with @-mentioned files #3455

Comments

abeatrix commented Mar 18, 2024 • edited

Current state

Design Tasks

Design Tasks:

Engineering Tasks

Engineering Tasks:

taylorsperry commented Mar 18, 2024

kalanchan commented Apr 3, 2024

abeatrix commented Mar 18, 2024 •

edited