Skip to content

Conversation

@skcussm
Copy link

@skcussm skcussm commented Nov 7, 2025

-Added Pseudo FIM feature which provides in line completion for use with models not specifically designed for FIM. This allows for in line completions with models such as gpt-oss-20b/120b.
-Added Pseudo FIM system prompt preference which is used when Pseudo FIM is enabled in the models preferences.
-This feature works well in LM Studio/OpenAI and Ollama. Implemented, but untested for Mistral.
-When Pseudo FIM is not enabled, the behavior should be identical to the original FIM implementation (requiring a FIM enabled model).

Notes: Interestingly, Pseudo FIM mode still works well with the qwen2.5-coder models. It works reasonably well with meta-llama-3.1-8b-instruct, gemma-3-12b. It works very well with gpt-oss-20b.

Future possibilities: This new feature uses a system prompt to instruct the model to behave like a FIM model. Future work should be done to appropriately automatically supply more context with this system prompt (or the user prompt) so the Pseudo FIM model may consider fuller context in its completions.

Please test with Mistral as I am not set up to do that.

I tried to follow the existing codes style and approach. Please, feel free to improve this contribution.

-Added Pseudo FIM feature which provides in line completion for use with
models not specifically designed for FIM.  This allows for in line
completions with models such as gpt-oss-20b/120b.
-Added Pseudo FIM system prompt preference which is used when Pseudo FIM
is enabled in the models preferences.
-This feature works well in LM Studio/OpenAI and Ollama.  Implemented,
but untested for Mistral.
-When Pseudo FIM is not enabled, the behavior should be identical to the
original FIM implementation (requiring a FIM enabled model).

Notes: Interestingly, Pseudo FIM mode still works well with the
qwen2.5-coder models.  It works reasonably well with
meta-llama-3.1-8b-instruct, gemma-3-12b. It works very well with
gpt-oss-20b.

Future possibilities: This new feature uses a system prompt to instruct
the model to behave like a FIM model.  Future work should be done to
appropriately automatically supply more context with this system prompt
(or the user prompt) so the Pseudo FIM model may consider fuller context
in its completions.
@hetzge
Copy link
Owner

hetzge commented Nov 8, 2025

Wow, thank you very much for the contribution.

I'm currently traveling, but I'll take a deeper look at it as soon as possible.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants