Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
Phi-3-mini-128k-instruct
like
1.63k
Follow
Microsoft
7.98k
Text Generation
Transformers
Safetensors
English
phi3
nlp
code
conversational
custom_code
text-generation-inference
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
97
Train
Deploy
Use this model
fix(modeling_phi3): Fixes inv_freq not being re-computed for extended RoPE.
#25
by
gugarosa
- opened
Apr 24, 2024
base:
refs/heads/main
←
from:
refs/pr/25
Discussion
Files changed
+96853
-0
chore(root): Initial files upload.
780ec161
fix(root): Another readme typo.
f33e280a
Update the chat format (#1)
b992ce9c
fix multiple typo in README (#2)
65b45158
fix(readme): Fixes 128k using results table from 4k.
f0541385
Add Hardware section (#4)
b0ec25d5
Update configuration_phi3.py
f9ba902c
Update configuration_phi3.py
d931c54a
Update modeling_phi3.py
0ef07d7c
Update README.md
edb43c85
Update README.md (#5)
1865373c
Update README.md (#6)
57524a85
Update Phi-3 Mini-128K-Instruct ONNX model link (#7)
af89cc8e
Update sample_finetune.py (#8)
d28759bb
Update README.md (#9)
f775c377
Update introduction and examples
1a62a12b
Update README.md
49a5cb30
chore(root): Updates source files with RC versions.
b657a9af
fix(root): `eval_strategy` is not available as `TrainingArguments` anymore.
eda34e12
Fix grammatical errors
94d2ad21
fix(root): Updating to the almost-ready release candidate.
534cce79
gugarosa
Microsoft org
Apr 24, 2024
No description provided.
fix(modeling_phi3): Fixes inv_freq not being re-computed for extended RoPE.
39924aa7
gugarosa
changed pull request status to
merged
Apr 24, 2024
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment