TRKG-Miner: Relaxed Temporal Knowledge Graph Miner

TRKG-Miner is a framework for link forecasting on knowledge graphs. It mines both cyclic and acyclic rules and uses them to predict links in the knowledge graph.

Note that the task of link forecasting is quite different from link prediction. While many deep learning based approaches have worked successfully for the latter task, the former remains a challenging problem. TRKG-Miner is a rule-based approach that shows significant improvement over the state-of-the-art approaches for link forecasting.

Due credit to Liu et al. for their original implementation of TLogic.

How to run

Create a new python environment.
Run poetry install to install the dependencies from poetry.lock.
The commands for recreating the results from the paper can be found in run.txt.

Datasets

Each event in the temporal knowledge graph is written in the format subject predicate object timestamp, with tabs as separators. The dataset is split into train.txt, valid.txt, and test.txt, where we use the same split as provided by Han et al. The files entity2id.json, relation2id.json, ts2id.json define the mapping of entities, relations, and timestamps to their corresponding IDs, respectively. The file statistics.yaml summarizes the statistics of the dataset and is not needed for running the code.

Parameters

In learn.py:

--dataset, -d: str. Dataset name.

--rule_lengths, -l: int. Length(s) of rules that will be learned, e.g., 2, 1 2 3.

--num_walks, -n: int. Number of walks that will be extracted during rule learning.

--transition_distr: str. Transition distribution; either unif for uniform distribution or exp for exponentially weighted distribution.

--num_processes, -p: int. Number of processes to be run in parallel.

--seed, -s: int. Random seed for reproducibility.

In apply.py:

--dataset, -d: str. Dataset name.

--test_data: str. Data for rule application; either test for test set or any other string for validation set.

--rules, -r: str. Name of the rules file.

--rule_lengths, -l: int. Length(s) of rules that will be applied, e.g., 2, 1 2 3.

--window, -w: int. Size of the time window before the query timestamp for rule application.

--top_k: int. Minimum number of candidates. The rule application stops for a query if this number is reached.

--num_processes, -p: int. Number of processes to be run in parallel.

In evaluate.py:

--dataset, -d: str. Dataset name.

--test_data: str. Data for rule application; either test for test set or any other string for validation set.

--candidates, -c: str. Name of the candidates file.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
mycode		mycode
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TRKG-Miner: Relaxed Temporal Knowledge Graph Miner

Due credit to Liu et al. for their original implementation of TLogic.

How to run

Datasets

Parameters

About

Releases

Packages

Languages

License

nec-research/TRKG-Miner

Folders and files

Latest commit

History

Repository files navigation

TRKG-Miner: Relaxed Temporal Knowledge Graph Miner

Due credit to Liu et al. for their original implementation of TLogic.

How to run

Datasets

Parameters

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages