New discussion

Falcon models slow inference

10
#59 opened over 1 year ago by
mikeytrw

I need an API of Falcon

8
#56 opened over 1 year ago by
JustMe4Real

Extracting attention maps

#49 opened over 1 year ago by
roeehendel

Fix the kv-cache dimensions

1
#47 opened over 1 year ago by
cchudant

Multi GPU inference issue

1
#39 opened over 1 year ago by
eastwind

Fine-tuning on a new language

4
#35 opened over 1 year ago by
AliMirlou

Flash attention

2
#34 opened over 1 year ago by
utensil

about evaluating on humaneval

#33 opened over 1 year ago by
dongZheX

Finetune on "uncensored" dataset?

1
#32 opened over 1 year ago by
sivarajan

Tokenizer Details

#31 opened over 1 year ago by
kye

Import dataset and chat with it

2
#27 opened over 1 year ago by
phdykd

请求:DOI

#16 opened over 1 year ago by
Huanghai

Finetune wtih QLoRA please

7
#14 opened over 1 year ago by
supercharge19

[Bug] Does not work

58
#3 opened over 1 year ago by
catid