You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Qwen 7B is reporting impressive numbers for both English and Chinese.
Qwen is similar to Llama model. Need to add bias params for QKV, and change the tokenizer to tiktoken.
alphaarea, lss0510, MrJungle1 and MarsTechHANMrJungle1MrJungle1