DEVCamiloSepulveda's picture
Upload folder using huggingface_hub
e8913fc verified
metadata
license: llama3.2
language:
  - en
base_model:
  - meta-llama/Llama-3.2-1B
pipeline_tag: text-classification
library_name: transformers
tags:
  - regression
  - story-point-estimation
  - software-engineering
datasets:
  - titanium
  - appceleratorstudio
metrics:
  - mae
  - mdae
model-index:
  - name: llama-3.2-1b-story-point-estimation
    results:
      - task:
          type: regression
          name: Story Point Estimation
        dataset:
          type: appceleratorstudio
          name: appceleratorstudio Dataset
          split: test
        metrics:
          - type: mae
            value: 2.593
            name: Mean Absolute Error (MAE)
          - type: mdae
            value: 2.051
            name: Median Absolute Error (MdAE)

LLAMA 3 Story Point Estimator - titanium - appceleratorstudio

This model is fine-tuned on issue descriptions from titanium and tested on appceleratorstudio for story point estimation.

Model Details

  • Base Model: LLAMA 3.2 1B

  • Training Project: titanium

  • Test Project: appceleratorstudio

  • Task: Story Point Estimation (Regression)

  • Architecture: PEFT (LoRA)

  • Input: Issue titles

  • Output: Story point estimation (continuous value)

Usage

from transformers import AutoModelForSequenceClassification, AutoTokenizer
from peft import PeftModel

# Load tokenizer and model
tokenizer = AutoTokenizer.from_pretrained("DEVCamiloSepulveda/00-LLAMA3SP-titanium-appceleratorstudio")
model = AutoModelForSequenceClassification.from_pretrained("DEVCamiloSepulveda/00-LLAMA3SP-titanium-appceleratorstudio")

# Prepare input text
text = "Your issue description here"
inputs = tokenizer(text, return_tensors="pt", truncation=True, max_length=20, padding="max_length")

# Get prediction
outputs = model(**inputs)
story_points = outputs.logits.item()

Training Details

  • Fine-tuning method: LoRA (Low-Rank Adaptation)
  • Sequence length: 20 tokens
  • Best training epoch: 2 / 20 epochs
  • Batch size: 32
  • Training time: 245.339 seconds
  • Mean Absolute Error (MAE): 2.593
  • Median Absolute Error (MdAE): 2.051