Skip to content

Commit 3b5d416

Browse files
Merge branch 'main' into ad-412
2 parents ccec76c + 66a90c7 commit 3b5d416

34 files changed

+710
-118
lines changed

.github/workflows/quality.yml

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -44,8 +44,10 @@ jobs:
4444
run: curl -fsSL https://ollama.com/install.sh | sh
4545
- name: Start serving ollama
4646
run: nohup ollama serve &
47-
- name: Pull model granite4:micro
48-
run: ollama pull granite4:micro
47+
- name: Pull models
48+
run: |
49+
ollama pull granite4:micro
50+
ollama pull granite4:micro-h
4951
- name: Run Tests
5052
run: uv run -m pytest -v test
5153
- name: Send failure message tests

docs/alora.md

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@ Use the `m alora train` command to fine-tune a LoRA or aLoRA adapter requirement
3737

3838
```bash
3939
m alora train path/to/data.jsonl \
40-
--basemodel ibm-granite/granite-3.2-8b-instruct \
40+
--basemodel ibm-granite/granite-4.0-micro \
4141
--outfile ./checkpoints/alora_adapter \
4242
--adapter alora \
4343
--epochs 6 \
@@ -47,6 +47,10 @@ m alora train path/to/data.jsonl \
4747
--grad-accum 4
4848
```
4949

50+
> **Note on Model Selection**: Use non-hybrid models (e.g., `granite-4.0-micro`) for aLoRA training.
51+
> Hybrid models (`granite-4.0-h-micro`) are recommended for general inference but adapters should be
52+
> trained on non-hybrid base models for compatibility with the `ibm-granite/rag-intrinsics-lib` repository.
53+
5054
### 📌 Parameters
5155

5256
| Flag | Type | Default | Description |

docs/dev/requirement_aLoRA_rerouting.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -37,10 +37,10 @@ from mellea.core import Requirement
3737
from mellea.backends.adapters import GraniteCommonAdapter
3838

3939
m = start_session(
40-
"huggingface.LocalHFBackend:ibm-granite/granite-3.2-8b-instruct")
40+
"huggingface.LocalHFBackend:ibm-granite/granite-4.0-micro")
4141

4242
# By default, the AloraRequirement uses a GraniteCommonAdapter with "requirement_check".
43-
m.backend.add_adapter(GraniteCommonAdapter("ibm-granite/rag-intrinsics-lib", "requirement_check", base_model_name="granite-3.2-8b-instruct"))
43+
m.backend.add_adapter(GraniteCommonAdapter("ibm-granite/rag-intrinsics-lib", "requirement_check", base_model_name="granite-4.0-micro"))
4444

4545
m.instruct(
4646
"Corporate wants you to find the difference between these two strings:\n\naaa\naba")

docs/examples/aLora/101_example.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@
1313

1414
# Define a backend and add the constraint aLora
1515
backend = LocalHFBackend(
16-
model_id="ibm-granite/granite-3.2-8b-instruct", cache=SimpleLRUCache(5)
16+
model_id="ibm-granite/granite-4.0-h-micro", cache=SimpleLRUCache(5)
1717
)
1818

1919
custom_stembolt_failure_constraint = HFConstraintAlora(

0 commit comments

Comments
 (0)