HD-PiSSA

[EMNLP 2025] HD-PiSSA: High-Rank Distributed Orthogonal Adaptation

paper link: https://arxiv.org/pdf/2505.18777

Abstract

Existing parameter-efficient fine-tuning (PEFT) methods for large language models (LLMs), such as LoRA and PiSSA, constrain model updates to low-rank subspaces, limiting their expressiveness and leading to suboptimal performance on complex tasks. To address this, we introduce High-rank Distributed PiSSA (HD-PiSSA), a distributed PEFT approach that initializes orthogonal adapters across different devices and aggregates their delta updates collectively on W for fine-tuning. Unlike Data Parallel LoRA or PiSSA, which maintain identical adapters across all devices, HD-PiSSA assigns different principal components of the pre-trained weights to each GPU, significantly expanding the range of update directions. This results in over 16x higher effective updated ranks than data-parallel LoRA or PiSSA when fine-tuning on 8 GPUs with the same per-device adapter rank. Empirically, we evaluate HD-PiSSA across various challenging downstream tasks, including mathematics, code generation, and multi-task learning. In the multi-task setting, HD-PiSSA achieves average gains of 10.0 absolute points (14.63%) over LoRA and 4.98 points (6.60%) over PiSSA across 12 benchmarks, demonstrating its benefits from the extra optimization flexibility.

Getting Started

Clone HD-PiSSA:

git clone [https://github.com/zfw1226/D2A](https://github.com/MuLabPKU/HD-PiSSA.git)
cd HD-PiSSA

Install HD-PiSSA Environment

conda create -n hdpissa python==3.11
conda activate hdpissa
pip install -r requirements.txt  --no-deps

Usage

Set the configuration in run.sh
Set your desired prompt template in hd_pissa.py:

PROMPT = (
    "Below is an instruction that describes a task. "
    "Write a response that appropriately completes the request.\n\n"
    "### Instruction:\n{instruction}\n\n### Response:"
)

Run the experiment.

bash run.sh

⚠️ Note
Training might be slow, because we use a float32 base model by default, and our parallel code isn't fully optimized yet.

Evaluation

The results reported in our paper are evaluated based on .

Citation

@article{wang2025hd,
  title={HD-PiSSA: High-Rank Distributed Orthogonal Adaptation},
  author={Wang, Yiding and Meng, Fauxu and Zhang, Xuefeng and Jiang, Fan and Tang, Pingzhi and Zhang, Muhan},
  journal={arXiv preprint arXiv:2505.18777},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
figures		figures
README.md		README.md
hd_pissa.py		hd_pissa.py
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HD-PiSSA

Abstract

Getting Started

Usage

Evaluation

Citation

About

Uh oh!

Releases

Packages

Languages

MuLabPKU/HD-PiSSA

Folders and files

Latest commit

History

Repository files navigation

HD-PiSSA

Abstract

Getting Started

Usage

Evaluation

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages