Support YaRN models (RoFormer implementation in rotary_embedding kernel) #1027

kevaldekivadiya2415 · 2023-09-13T06:19:34Z

Feature request
Nous Research and EleutherAI have recently released the YaRN model, which comes in two versions with context sizes of 64k and 128k. This model utilizes RoFormer-style embeddings, distinguishing it from GPT-NeoX and GPT-J. It is built upon the foundation of the LLaMa 2 model, making it largely compatible with some minor adjustments required for optimal support.

Motivation
The YaRN model's longer context length (up to 128k) is highly valuable for tasks involving extensive context, compared to the limited 4096 context length of the llama2 base model.

Other
YaRN paper: YaRN: Efficient Context Window Extension of Large Language Models
YaRN Code: YaRN Github

viktor-ferenczi · 2023-09-22T05:54:16Z

Duplicate of #980

zhongwei1968 · 2023-09-23T05:52:55Z

+1

hmellor · 2024-03-08T12:21:52Z

Closing as duplicate of #980

Fixed test logs redirection

viktor-ferenczi mentioned this issue Sep 23, 2023

Support YaRN models (RoFormer implementation in rotary_embedding kernel) #980

Closed

hmellor added the duplicate label Mar 8, 2024

hmellor closed this as completed Mar 8, 2024

hmellor closed this as completed Feb 27, 2025

yiliu30 pushed a commit to yiliu30/vllm-fork that referenced this issue Apr 15, 2025

[SW-224648] Fix test logs redirection (vllm-project#1027)

ff61f89

Fixed test logs redirection

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Support YaRN models (RoFormer implementation in rotary_embedding kernel) #1027

Support YaRN models (RoFormer implementation in rotary_embedding kernel) #1027

kevaldekivadiya2415 commented Sep 13, 2023

viktor-ferenczi commented Sep 22, 2023 •

edited

Loading

Uh oh!

zhongwei1968 commented Sep 23, 2023

Uh oh!

hmellor commented Mar 8, 2024

Uh oh!

Uh oh!

Support YaRN models (RoFormer implementation in rotary_embedding kernel) #1027

Support YaRN models (RoFormer implementation in rotary_embedding kernel) #1027

Comments

kevaldekivadiya2415 commented Sep 13, 2023

viktor-ferenczi commented Sep 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zhongwei1968 commented Sep 23, 2023

Uh oh!

hmellor commented Mar 8, 2024

Uh oh!

viktor-ferenczi commented Sep 22, 2023 •

edited

Loading