{"record_type":"pith_number_record","schema_url":"https://pith.science/schemas/pith-number/v1.json","pith_number":"pith:2025:DP3HPN57BTGMINZHNGPZNWJRE4","short_pith_number":"pith:DP3HPN57","schema_version":"1.0","canonical_sha256":"1bf677b7bf0cccc43727699f96d93127054d573cf321ee2b537bded353e506f1","source":{"kind":"arxiv","id":"2510.24561","version":3},"attestation_state":"computed","paper":{"title":"LoRA-DA: Data-Aware Initialization for Low-Rank Adaptation via Asymptotic Analysis","license":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/","headline":"Minimizing the expected parameter discrepancy between fine-tuned and target models yields an optimal data-aware initialization for LoRA.","cross_cats":["cs.AI"],"primary_cat":"cs.LG","authors_text":"Chang Chu, Qi Li, Qingyue Zhang, Shao-Lun Huang, Tianren Peng, Xiangyang Luo, Zhihao Jiang","submitted_at":"2025-10-28T15:55:36Z","abstract_excerpt":"LoRA has become a widely adopted method for PEFT, and its initialization methods have attracted increasing attention. However, existing methods have notable limitations: many methods do not incorporate target-domain data, while gradient-based methods exploit data only at a shallow level by relying on one-step gradient decomposition. In this paper, we establish a theoretical framework for data-aware LoRA initialization. Starting from minimizing the expectation of the parameter discrepancy between the fine-tuned and target models, we derive an optimization problem with two components: a bias ter"},"verification_status":{"content_addressed":true,"pith_receipt":true,"author_attested":false,"weak_author_claims":0,"strong_author_claims":0,"externally_anchored":false,"storage_verified":false,"citation_signatures":0,"replication_records":0,"graph_snapshot":true,"references_resolved":false,"formal_links_present":true},"canonical_record":{"source":{"id":"2510.24561","kind":"arxiv","version":3},"metadata":{"license":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/","primary_cat":"cs.LG","submitted_at":"2025-10-28T15:55:36Z","cross_cats_sorted":["cs.AI"],"title_canon_sha256":"b811800cce06bd1d870a26acdd53dc356f65eff51563678282744256ac902ec7","abstract_canon_sha256":"f18d6e330c036d5df6c7ececba4202299bf8cb899ec23d89e25de73155c45295"},"schema_version":"1.0"},"receipt":{"kind":"pith_receipt","key_id":"pith-v1-2026-05","algorithm":"ed25519","signed_at":"2026-06-08T01:03:50.387066Z","signature_b64":"A4glQSZABUa5I8r3B2gvhHTEuTb5GdbbQrRrfzHUrk2jMtm5OOVeesv5ep4+PuVutGuquLiJYlRT5DqYM3OXAQ==","signed_message":"canonical_sha256_bytes","builder_version":"pith-number-builder-2026-05-17-v1","receipt_version":"0.3","canonical_sha256":"1bf677b7bf0cccc43727699f96d93127054d573cf321ee2b537bded353e506f1","last_reissued_at":"2026-06-08T01:03:50.386093Z","signature_status":"signed_v1","first_computed_at":"2026-06-08T01:03:50.386093Z","public_key_fingerprint":"8d4b5ee74e4693bcd1df2446408b0d54"},"graph_snapshot":{"paper":{"title":"LoRA-DA: Data-Aware Initialization for Low-Rank Adaptation via Asymptotic Analysis","license":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/","headline":"Minimizing the expected parameter discrepancy between fine-tuned and target models yields an optimal data-aware initialization for LoRA.","cross_cats":["cs.AI"],"primary_cat":"cs.LG","authors_text":"Chang Chu, Qi Li, Qingyue Zhang, Shao-Lun Huang, Tianren Peng, Xiangyang Luo, Zhihao Jiang","submitted_at":"2025-10-28T15:55:36Z","abstract_excerpt":"LoRA has become a widely adopted method for PEFT, and its initialization methods have attracted increasing attention. However, existing methods have notable limitations: many methods do not incorporate target-domain data, while gradient-based methods exploit data only at a shallow level by relying on one-step gradient decomposition. In this paper, we establish a theoretical framework for data-aware LoRA initialization. Starting from minimizing the expectation of the parameter discrepancy between the fine-tuned and target models, we derive an optimization problem with two components: a bias ter"},"claims":{"count":4,"items":[{"kind":"strongest_claim","text":"Solving this problem yields an optimal initialization strategy for LoRA, based on which we develop an efficient algorithm, LoRA-DA. Empirical results across multiple benchmarks demonstrate that LoRA-DA consistently improves final accuracy over existing initialization methods.","source":"verdict.strongest_claim","status":"machine_extracted","claim_id":"C1","attestation":"unclaimed"},{"kind":"weakest_assumption","text":"The bias term is approximated using a Fisher-gradient formulation to preserve anisotropy while the variance term uses the Fisher information to capture sampling uncertainty; this approximation must hold for the derived initialization to be optimal in the target fine-tuning regime.","source":"verdict.weakest_assumption","status":"machine_extracted","claim_id":"C2","attestation":"unclaimed"},{"kind":"one_line_summary","text":"LoRA-DA derives an optimal data-aware LoRA initialization by solving an optimization problem from asymptotic analysis of parameter discrepancy using Fisher-gradient bias and Fisher-information variance terms.","source":"verdict.one_line_summary","status":"machine_extracted","claim_id":"C3","attestation":"unclaimed"},{"kind":"headline","text":"Minimizing the expected parameter discrepancy between fine-tuned and target models yields an optimal data-aware initialization for LoRA.","source":"verdict.pith_extraction.headline","status":"machine_extracted","claim_id":"C4","attestation":"unclaimed"}],"snapshot_sha256":"f5d2579ad610b677589cf8e06b4f1aed8c7c49a51d09d0b2e63ae1edf92b9883"},"source":{"id":"2510.24561","kind":"arxiv","version":3},"verdict":{"id":"dfbdd8dc-eae1-4469-959a-c0a7071af2a3","model_set":{"reader":"grok-4.3"},"created_at":"2026-05-18T02:58:14.357911Z","strongest_claim":"Solving this problem yields an optimal initialization strategy for LoRA, based on which we develop an efficient algorithm, LoRA-DA. Empirical results across multiple benchmarks demonstrate that LoRA-DA consistently improves final accuracy over existing initialization methods.","one_line_summary":"LoRA-DA derives an optimal data-aware LoRA initialization by solving an optimization problem from asymptotic analysis of parameter discrepancy using Fisher-gradient bias and Fisher-information variance terms.","pipeline_version":"pith-pipeline@v0.9.0","weakest_assumption":"The bias term is approximated using a Fisher-gradient formulation to preserve anisotropy while the variance term uses the Fisher information to capture sampling uncertainty; this approximation must hold for the derived initialization to be optimal in the target fine-tuning regime.","pith_extraction_headline":"Minimizing the expected parameter discrepancy between fine-tuned and target models yields an optimal data-aware initialization for LoRA."},"integrity":{"clean":true,"summary":{"advisory":0,"critical":0,"by_detector":{},"informational":0},"endpoint":"/pith/2510.24561/integrity.json","findings":[],"available":true,"detectors_run":[],"snapshot_sha256":"c28c3603d3b5d939e8dc4c7e95fa8dfce3d595e45f758748cecf8e644a296938"},"references":{"count":0,"sample":[],"resolved_work":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57","internal_anchors":0},"formal_canon":{"evidence_count":2,"snapshot_sha256":"7f21efc0f643fe16f221e875a524f596f1cdad16260413e46ef1097ae2074a7a"},"author_claims":{"count":0,"strong_count":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"builder_version":"pith-number-builder-2026-05-17-v1"},"aliases":[{"alias_kind":"arxiv","alias_value":"2510.24561","created_at":"2026-06-08T01:03:50.386221+00:00"},{"alias_kind":"arxiv_version","alias_value":"2510.24561v3","created_at":"2026-06-08T01:03:50.386221+00:00"},{"alias_kind":"doi","alias_value":"10.48550/arxiv.2510.24561","created_at":"2026-06-08T01:03:50.386221+00:00"},{"alias_kind":"pith_short_12","alias_value":"DP3HPN57BTGM","created_at":"2026-06-08T01:03:50.386221+00:00"},{"alias_kind":"pith_short_16","alias_value":"DP3HPN57BTGMINZH","created_at":"2026-06-08T01:03:50.386221+00:00"},{"alias_kind":"pith_short_8","alias_value":"DP3HPN57","created_at":"2026-06-08T01:03:50.386221+00:00"}],"events":[],"event_summary":{},"paper_claims":[],"inbound_citations":{"count":0,"internal_anchor_count":0,"sample":[]},"formal_canon":{"evidence_count":2,"sample":[],"anchors":[]},"links":{"html":"https://pith.science/pith/DP3HPN57BTGMINZHNGPZNWJRE4","json":"https://pith.science/pith/DP3HPN57BTGMINZHNGPZNWJRE4.json","graph_json":"https://pith.science/api/pith-number/DP3HPN57BTGMINZHNGPZNWJRE4/graph.json","events_json":"https://pith.science/api/pith-number/DP3HPN57BTGMINZHNGPZNWJRE4/events.json","paper":"https://pith.science/paper/DP3HPN57"},"agent_actions":{"view_html":"https://pith.science/pith/DP3HPN57BTGMINZHNGPZNWJRE4","download_json":"https://pith.science/pith/DP3HPN57BTGMINZHNGPZNWJRE4.json","view_paper":"https://pith.science/paper/DP3HPN57","resolve_alias":"https://pith.science/api/pith-number/resolve?arxiv=2510.24561&json=true","fetch_graph":"https://pith.science/api/pith-number/DP3HPN57BTGMINZHNGPZNWJRE4/graph.json","fetch_events":"https://pith.science/api/pith-number/DP3HPN57BTGMINZHNGPZNWJRE4/events.json","actions":{"anchor_timestamp":"https://pith.science/pith/DP3HPN57BTGMINZHNGPZNWJRE4/action/timestamp_anchor","attest_storage":"https://pith.science/pith/DP3HPN57BTGMINZHNGPZNWJRE4/action/storage_attestation","attest_author":"https://pith.science/pith/DP3HPN57BTGMINZHNGPZNWJRE4/action/author_attestation","sign_citation":"https://pith.science/pith/DP3HPN57BTGMINZHNGPZNWJRE4/action/citation_signature","submit_replication":"https://pith.science/pith/DP3HPN57BTGMINZHNGPZNWJRE4/action/replication_record"}},"created_at":"2026-06-08T01:03:50.386221+00:00","updated_at":"2026-06-08T01:03:50.386221+00:00"}