WeirdCompound-v1.4-24b

This is a merge of pre-trained language models created using mergekit.

Merge Details

Notes

This is a multi-stage merge. There's little method to my madness and I just stopped when I arrived at something that I liked.

Starting point was DepravedCartographer-v1.0-24b with slight changes.

Changelog

v1.1

/intermediate/model/B: replaced anthracite-core/Mistral-Small-3.1-24B-Instruct-2503-HF with anthracite-core/Mistral-Small-3.2-24B-Instruct-2506-ChatML

v1.2

/intermediate/model/B: replaced anthracite-core/Mistral-Small-3.2-24B-Instruct-2506-ChatML with anthracite-core/Mistral-Small-3.2-24B-Instruct-2506-Text-Only for default tokenizer config.

v1.3

/intermediate/model/A: replaced TheDrummer/Cydonia-24B-v3 with TheDrummer/Cydonia-24B-v4
/intermediate/model/A: replaced Doctor-Shotgun/MS3.1-24B-Magnum-Diamond with Doctor-Shotgun/MS3.2-24B-Magnum-Diamond
/intermediate/model/A: replaced Delta-Vector/Austral-24B-Winton with Delta-Vector/MS3.2-Austral-Winton

v1.4

/intermediate/model/B: change recipe to use Doctor-Shotgun/MS3.2-24B-Magnum-Diamond and Delta-Vector/MS3.2-Austral-Winton

Merge Method

This model was merged using the Model Stock merge method using TheDrummer/Cydonia-24B-v4 as a base.

This model was merged using the SLERP merge method.

This model was merged using the NuSLERP merge method using /intermediate/model/B as a base.

Models Merged

The following models were included in the merge:

Delta-Vector/MS3.2-Austral-Winton
Doctor-Shotgun/MS3.2-24B-Magnum-Diamond
aixonlab/Eurydice-24b-v3.5
PocketDoc/Dans-PersonalityEngine-V1.3.0-24b
anthracite-core/Mistral-Small-3.2-24B-Instruct-2506-Text-Only
/intermediate/model/A
/intermediate/model/B
/intermediate/model/C

Configuration

The following YAML configuration was used to produce this model:

base_model: TheDrummer/Cydonia-24B-v4 # Cydonia v4
merge_method: model_stock
dtype: bfloat16
models:
  - model: aixonlab/Eurydice-24b-v3.5 # storytelling / RP
  - model: TheDrummer/Cydonia-24B-v4 # sprinkle in some extra Cydonia v4
  - model: PocketDoc/Dans-PersonalityEngine-V1.3.0-24b # Prompt Adherence
  - model: Delta-Vector/MS3.2-Austral-Winton  # Adventure
  - model: Doctor-Shotgun/MS3.1-24B-Magnum-Diamond # claude opus

→ /intermediate/model/A →

merge_method: slerp
dtype: bfloat16
base_model: anthracite-core/Mistral-Small-3.2-24B-Instruct-2506-Text-Only
models:
  - model: /intermediate/model/A
parameters:
  t: 0.4

→ /intermediate/model/B →

merge_method: nuslerp
dtype: bfloat16
base_model: /intermediate/model/B
models:
  - model: Doctor-Shotgun/MS3.2-24B-Magnum-Diamond
    parameters:
      weight: 0.6
  - model: Delta-Vector/MS3.2-Austral-Winton 
    parameters:
      weight: 0.4

→ /intermediate/model/C →

merge_method: slerp
dtype: bfloat16
base_model: /intermediate/model/B
models:
  - model: /intermediate/model/C
parameters:
  t: 0.5

→ WeirdCompound-v1.4-24b

Downloads last month: 3

Safetensors

Model size

24B params

Tensor type

BF16

Model tree for FlareRebellion/WeirdCompound-v1.4-24b

Quantizations

2 models

Paper for FlareRebellion/WeirdCompound-v1.4-24b

Model Stock: All we need is just a few fine-tuned models

Paper • 2403.19522 • Published Mar 28, 2024 • 13