llmixer's picture
Update README.md
dbea74d verified
metadata
license: apache-2.0
tags:
  - control vectors
  - exllamav2
  - creative writing
  - text generation
  - inference
  - model integration

Creative Writing Control Vectors Integration for ExLlamaV2

This project provides a wrapper to integrate jukofyork's creative writing control vectors with ExLlamaV2. While ExLlamaV2 does not natively support control vectors, this wrapper enables loading and injecting GGUF control vectors into the model for dynamic text generation control.

Overview

  • Wrapper for using control vectors with ExLlamaV2
  • Supports loading control vectors from GGUF format
  • Injects vectors directly into ExLlamaV2 inference
  • Enables dynamic text generation control

Usage

  1. Download model in ExLlamaV2 format
  2. Create a "-vectors" directory next to model directory
  3. Download the control vectors from jukofyork's repository and place them in the "-vectors" directory.
  4. Run inference with the --control_vectors (-vc) parameter.

Example command:

python test_inference.py -m Meta-Llama-3-70B-Instruct-8bpw \
  -p "<prompt>" \
  --control_vectors language:simple:0.5,optimism:optimism:0.5

Directory Structure

Ensure your directory structure follows this format to correctly load the control vectors:

models/
  β”œβ”€β”€ Meta-Llama-3-70B-Instruct-8bpw/
  β”‚   └── model files...
  └── Meta-Llama-3-70B-Instruct-8bpw-vectors/
      β”œβ”€β”€ llama-3:70b-language__debias.gguf
      β”œβ”€β”€ llama-3:70b-language__simple.gguf
      β”œβ”€β”€ llama-3:70b-language__ornate.gguf 
      └── ...

Limitations

  • Proof of concept implementation
  • May impact model performance
  • Limited testing with different vector combinations
  • No guarantee of exact equivalence to llama.cpp behavior

Acknowledgments