Niri_LLM: Fine-Tuned LLaMA-2 Model for Civil Engineering Tasks
Model: NousResearch/Llama-2-7b-chat-hf
Dataset: Custom Civil Engineering Dataset
Version: 1.0.0
Date: August 2024
Model Description
Niri_LLM is a fine-tuned version of the LLaMA-2 model, specifically designed to address civil engineering challenges. It is particularly effective in generating accurate and contextually relevant responses to queries related to structural health monitoring, corrosion management, and other civil engineering disciplines.
The fine-tuning process focused on enhancing the model's ability to understand and generate detailed technical content, making it a valuable tool for engineers, researchers, and professionals in the field.
Model Architecture
- Base Model: LLaMA-2 (7B Parameters)
- Quantization: 4-bit with NF4 quantization type
- LoRA Configuration:
- Dimension (r): 64
- Alpha: 16
- Dropout: 0.1
- Attention Mechanism: Scaled Dot-Product
- Tokenizer: LLaMA Tokenizer with EOS token as the padding token
Libraries Used
The following Python libraries were essential in the development, fine-tuning, and deployment of Niri_LLM:
- Transformers (v4.31.0): For loading and fine-tuning the LLaMA-2 model.
- BitsAndBytes (v0.40.2): For 4-bit quantization and efficient GPU usage.
- PEFT (v0.4.0): For parameter-efficient fine-tuning (LoRA) of the model.
- Accelerate (v0.21.0): To optimize model training on multi-GPU setups.
- TRL (Transformers Reinforcement Learning) (v0.4.7): For supervised fine-tuning (SFT) of the model.
- PyMuPDF: For extracting text from PDF documents used in the dataset.
- PyArrow: To handle and manipulate dataset structures during training.
- Datasets: For loading and processing the training data from text files.
- Torch: PyTorch was used as the primary framework for training and fine-tuning the model.
- TensorBoard: For monitoring the training process.
Training Data
The model was trained on a custom dataset comprising documents, guidelines, and manuals specific to civil engineering. These documents covered various topics, including:
- Structural Health Monitoring Techniques
- Inspection Procedures and Standards
- Corrosion Types, Causes, and Mitigation Strategies
- Material Science and Engineering Properties
- Case Studies in Infrastructure Management
Training Process
The training was conducted on a single GPU with 6GB of memory using the following steps:
- Data Preparation: Text data was extracted from PDFs using PyMuPDF and preprocessed to remove irrelevant content.
- Tokenization: The LLaMA tokenizer was employed to convert text into tokens.
- Model Fine-Tuning: The model was fine-tuned using the QLoRA technique, focusing on domain-specific language understanding.
- Evaluation: The model was evaluated using a subset of the data to ensure the quality and relevance of the generated outputs.
Hyperparameters
- Precision: 4-bit (NF4)
- Batch Size: 4 (per device)
- Learning Rate: 2e-4
- Weight Decay: 0.001
- Gradient Clipping: 0.3
- Epochs: 1
- Scheduler: Cosine
- Warmup Ratio: 0.03
- Max Gradient Norm: 0.3
Model Performance
- Accuracy: The model demonstrated a high level of accuracy in generating relevant responses to technical queries.
- Inference Speed: Optimized for deployment on resource-constrained environments, with efficient memory usage due to 4-bit quantization.
- Robustness: Effective across a wide range of civil engineering topics, though validation by domain experts is recommended for critical applications.
Usage
To use Niri_LLM, load it via the Hugging Face transformers
library:
!pip install accelerate transformers huggingface
from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline
import os
# Make sure to replace 'YOUR_ACTUAL_TOKEN' with your real Hugging Face API token
# You can get your token from https://huggingface.co/settings/tokens
YOUR_ACTUAL_TOKEN = "ENTER_THE_TOKEN"
model = AutoModelForCausalLM.from_pretrained("NireeskshanAI/Fine_tunned_Niri_LLM", use_auth_token=YOUR_ACTUAL_TOKEN)
tokenizer = AutoTokenizer.from_pretrained("NireeskshanAI/Fine_tunned_Niri_LLM", use_auth_token=YOUR_ACTUAL_TOKEN)
pipe = pipeline("text-generation", model=model, tokenizer=tokenizer)
prompt = "How to tackle the problem of pitting corrosion?"
output = pipe(f"<s>[INST] {prompt} [/INST]")
print(output[0]['generated_text'])
Example Prompts
- Corrosion: "Explain the methods to detect pitting corrosion in steel structures."
- Structural Health: "Describe the key techniques used in monitoring the health of bridges."
- Material Science: "What are the effects of chloride ions on concrete durability?"
Limitations and Considerations
- Specialization: This model is highly specialized for civil engineering and may not generalize well to other domains.
- Ethical Use: Ensure that a qualified professional validate the model's outputs before application in real-world scenarios.
- Resource Requirements: While optimized, the model requires a GPU with at least 6GB of memory for efficient inference.
Future Work
- Extended Training: Plan to incorporate more diverse datasets, including international engineering standards and real-time monitoring data.
- Multi-Lingual Support: Expanding the model's capabilities to handle civil engineering queries in multiple languages.
- User Feedback: Incorporating feedback mechanisms to improve model performance and relevance continually.
License
This model is licensed under the Apache License 2.0.
Citation
If you use this model in your research or application, please cite it as:
@misc{,
title={Niri_LLM: Fine-Tuned LLaMA-2 Model for Civil Engineering Tasks},
author={NireeskshanAI},
year={2024},
publisher={Hugging Face},
note={url{https://huggingface.co/NireeskshanAI/Finetuned_NIRI_LLM}},
}
Acknowledgments
Special thanks to:
- Hugging Face for providing the infrastructure and tools for model development and deployment.
- NousResearch for the LLaMA-2 base model.
- Downloads last month
- 3
Model tree for NireeskshanAI/Finetuned_NIRI_LLM
Base model
NousResearch/Llama-2-7b-chat-hf