Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
Llama-3_1-Nemotron-51B-Instruct
like
203
Follow
NVIDIA
9.22k
Text Generation
Transformers
Safetensors
PyTorch
English
nemotron-nas
nvidia
llama-3
conversational
custom_code
arxiv:
4 papers
License:
nvidia-open-model-license
Model card
Files
Files and versions
Community
24
Train
Use this model
fixed flash_attention backward_compat
#3
by
itlevy
- opened
Sep 24, 2024
base:
refs/heads/main
←
from:
refs/pr/3
Discussion
Files changed
+14
-70
transformers>=4.44.2
e9d7c68d
flash_attention_utils_backward_compat (#2)
186a08a2
fixed flash_attention backward_compat
c7f5725d
itlevy
NVIDIA org
Sep 24, 2024
No description provided.
itlevy
changed pull request status to
closed
Sep 24, 2024
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment