Identify insights from other domains that help address challenges or open up novel opportunities for your research problem:
- Decomposes a research problem into questions.
- Searches target-domain literature.
- Searches cross-domain literature for transferable ideas.
- Integrates and ranks cross-domain inspirations.
pip install -r requirements.txtsearch.py reads Semantic Scholar credentials from config.py (API_KEY).
Default run (uses data/cross-domain-inspiration-relations.json):
python inspiration_pred.pyUseful options:
python inspiration_pred.py \
--problem_file data/cross-domain-inspiration-relations.json \
--model_name Qwen/Qwen3-14B \
--output_dir inspiration_pred_output \
--max_papers_per_query 20 \
--temp 0.7 \
--min_rel_threshold 0.5 \
--skip_if_existsResults are written to inspiration_pred_output/*.json (or your custom --output_dir).
Each output file contains:
- Problem metadata (
research_problem,target_domain,fine_grained_domain,source_groundtruth) - Cross-domain evidence grouped by question/domain
idea_rankings(ranked integrated ideas)
If you want to convert ground-truth arXiv abstracts into this repo's evaluation format, use:
python evaluation/process_abstracts.pyThis file is a JSON array. Each entry is one cross-domain inspiration relation used as an input problem. The dataset is derived from: CHIMERA
Core fields used by inspiration_pred.py:
source_id(int): source paper identifiertarget_id(int): target paper identifiersource_domain(str): source domain (used as target/focus domain in this pipeline)target_domain(str): referenced inspired domainsource_text(str): source-side idea phrasetarget_text(str): target-side inspiration phrasecontext(str): problem statement passed to decompositionpublication_year(int): used to bound literature searchabstract(str): stored as ground truth metadata
Additional metadata fields in the dataset:
id,relation,arxiv_categoriesfine_grained_source_domain,fine_grained_target_domainhead_leakage,tail_leakagepaper_id
Minimal schema:
[
{
"id": "...",
"source_id": 18243,
"target_id": 38965,
"source_domain": "Philosophy",
"target_domain": "Computer Science",
"source_text": "...",
"target_text": "...",
"relation": "inspiration",
"publication_year": 2021,
"paper_id": 2105.00867,
"abstract": "...",
"context": "..."
}
]
