bug: Model generates response non-stop (default prompt template is wrong?) #3584

imtuyethan · 2024-09-06T09:06:13Z

I have searched the existing issues

Current behavior

https://discord.com/channels/1107178041848909847/1281512188762128467/1281537450111406080
https://discord.com/channels/1107178041848909847/1239846009258119178/1281127913525088318
https://discord.com/channels/1107178041848909847/1107178593945129060/1281169582811250719

Models downloaded from the Jan Hub (including Llama 3) have incorrect default settings:
- Fixed context length of 2048
- Incorrect prompt template
- Incorrect stop word
The Llama 3 model is repeating responses, producing duplicate or very similar output multiple times.
When importing GGUF models directly (not through the hub), settings appear correct.

Minimum reproduction step

Download a GGUF model (e.g., Llama 3) from the Jan Hub
Observe incorrect default settings
Start a new conversation thread
Send a prompt to the model
Observe the repeated responses and incorrect formatting
Try importing the same GGUF model directly and compare settings

Expected behavior

Models downloaded from the Jan Hub should have correct default settings.
The Llama 3 model (and other models) should provide a single, coherent response without repetition.
The correct prompt template should be used for proper interaction.
Directly imported models and hub-downloaded models should have consistent, correct settings.

Screenshots / Logs

Screen.Recording.2024-09-06.at.3.37.18.PM.mov

Jan version

v0.5.3-623

In which operating systems have you tested?

macOS
Windows
Linux

Environment details

Operating System: MacOS Sonoma 14.2
Processor: Apple M2
RAM: 16GB

louis-jan · 2024-09-09T02:13:47Z

Findings:

Ashley had already removed some of the models, and the old threads lost their model references. When she opens the thread, it automatically selects another available model, which has a legacy of bad UX.

There is another issue created to address that (searching for it...)

0xSage · 2024-09-09T12:34:30Z

@louis-jan @imtuyethan I dont understand the issue, can you guys explain a bit more clearly?

is this an edge case where:

Old thread
Model used for that thread is deleted (along with model.json)
User sends more messages in old thread
Thread autoselects some other model
New model has corrupted settings? How?

tikikun · 2024-09-16T01:17:21Z

Should be able to extract chat template from gguf, not good ux

louis-jan · 2024-09-16T01:58:23Z

@louis-jan @imtuyethan I dont understand the issue, can you guys explain a bit more clearly?

is this an edge case where:

Old thread

Model used for that thread is deleted (along with model.json)

User sends more messages in old thread

Thread autoselects some other model

New model has corrupted settings? How?

Hi @0xSage @tikikun. This is an outdated issue, it should link to the updated one. This is the case where the model is deleted and the app selects another available model to continue the conversation. However, that is an old behavior that we will update soon.

@imtuyethan please help link the new issue.

imtuyethan · 2024-09-18T04:54:21Z

There are two issues related to this:

For this one: https://discord.com/channels/1107178041848909847/1281512188762128467/1281537450111406080, it's because I deleted model A, threads defaulted to model B & the settings were not updated, so threads used model B but settings from model A, causing the weird responses (wrong prompt template, etc.,...). This should be fixed via feat: Pick new model in thread if original model is unavailable #3385.
bug: Dropping GGUF metadata when creating model.yaml for Model Downloaded via URL #3558

0xSage · 2024-10-15T07:23:23Z

dupe of #3385

imtuyethan added type: bug Something isn't working P0: critical Mission critical labels Sep 6, 2024

imtuyethan assigned louis-jan Sep 6, 2024

imtuyethan changed the title ~~bug: Model generates response non-stop~~ bug: Model generates response non-stop (default prompt template is wrong?) Sep 6, 2024

imtuyethan added this to the v0.5.5 milestone Sep 18, 2024

imtuyethan added category: model running and removed P0: critical Mission critical labels Sep 18, 2024

imtuyethan modified the milestones: v0.5.5, v0.5.6 Sep 23, 2024

0xSage added category: threads & chat Threads & chat UI UX issues and removed category: engines labels Oct 14, 2024

0xSage closed this as completed Oct 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: Model generates response non-stop (default prompt template is wrong?) #3584

bug: Model generates response non-stop (default prompt template is wrong?) #3584

imtuyethan commented Sep 6, 2024 •

edited

Loading

louis-jan commented Sep 9, 2024 •

edited

Loading

0xSage commented Sep 9, 2024

tikikun commented Sep 16, 2024

louis-jan commented Sep 16, 2024

imtuyethan commented Sep 18, 2024 •

edited

Loading

0xSage commented Oct 15, 2024

bug: Model generates response non-stop (default prompt template is wrong?) #3584

bug: Model generates response non-stop (default prompt template is wrong?) #3584

Comments

imtuyethan commented Sep 6, 2024 • edited Loading

Current behavior

Minimum reproduction step

Expected behavior

Screenshots / Logs

Jan version

In which operating systems have you tested?

Environment details

louis-jan commented Sep 9, 2024 • edited Loading

Findings:

0xSage commented Sep 9, 2024

tikikun commented Sep 16, 2024

louis-jan commented Sep 16, 2024

imtuyethan commented Sep 18, 2024 • edited Loading

0xSage commented Oct 15, 2024

imtuyethan commented Sep 6, 2024 •

edited

Loading

louis-jan commented Sep 9, 2024 •

edited

Loading

imtuyethan commented Sep 18, 2024 •

edited

Loading