Efficient Large Language Model Collection Shortened LLMs from Depth Pruning; https://github.com/Nota-NetsPresso/shortened-llm • 15 items • Updated Dec 18, 2024 • 4