File size: 1,544 Bytes
4d7938d
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1e33707
4d7938d
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
---
base_model: unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
language:
- en
- de
license: apache-2.0
tags:
- text-generation-inference
- transformers
- unsloth
- llama
- trl
- machine-translation
- historical-language
- early-modern-german
- legal-texts
- economic-history
- open-source
---

# Early Modern Bohemian German to English Translation Model

## Overview

This model translates from Early Modern Bohemian German (EMBG) to English. It was fine-tuned using LoRA on a unique historical dataset of 3,873 paragraph-level translation pairs sourced from legal court records. The dataset was meticulously transcribed and translated by the Chichele Professor of Economic History, **Sheilagh Ogilvie**, from All Souls College, University of Oxford.

### Key Features

- **Base Model**: `unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit`  
- **Fine-Tuning**: Performed using [LoRA](https://arxiv.org/abs/2106.09685) and [Unsloth](https://github.com/unslothai/unsloth), leveraging Hugging Face's [Transformers](https://github.com/huggingface/transformers) and [TRL](https://github.com/huggingface/trl) libraries.
- **Languages Supported**:  
  - Source: Early Modern Bohemian German (EMBG)  
  - Target: English  
- **Dataset**: Legal court records, manually transcribed and translated over five years. The dataset will be published in an upcoming ACL paper.

### Use Cases

- Research in economic history and legal studies.
- Exploration of historical dialects and their nuances.
- Applications in language revitalisation and historical text analysis.