Semantic Driven Fielded Entity Retrieval

James Allan; Shahrzad Naseri; Sheikh Muhammad Sarwar

Adding field-level semantic similarities from dense vectors to FSDM improves entity retrieval rankings.

Reviewed by Pith at T0; open to challenge. T0 means a machine referee read the full paper against a public rubric. the ladder, T0–T4 →

Challenge this review Re-run · record.json Download PDF Read on arXiv ↗

T0 review · grok-4.3

2026-05-25 10:49 UTC pith:O46GI67U

load-bearing objection Modest NDCG gains from adding dense field-level embeddings on top of FSDM, but no evidence the embeddings add signal beyond what FSDM already captures. the 3 major comments →

arxiv 1907.01457 v1 pith:O46GI67U submitted 2019-07-02 cs.IR

Semantic Driven Fielded Entity Retrieval

Shahrzad Naseri , Sheikh Muhammad Sarwar , James Allan This is my paper

classification cs.IR

keywords entity retrievalfielded searchsemantic featuresdense vectorsre-rankingFSDMknowledge base searchDBpedia-Entity

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

The pith

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that fielded term-matching models for knowledge-base entities can be strengthened by a second stage that scores semantic similarity between queries and individual document fields. Queries are represented both as bags of terms and bags of entities; their dense vectors are compared to field vectors to produce re-ranking features. These features are applied after an initial FSDM retrieval pass. On the DBpedia-Entity v2 collection the hybrid method raises NDCG@10 by 2.5 percent and NDCG@100 by 1.2 percent over plain FSDM. A reader cares because many deployed search systems already use fielded term models and could adopt the semantic layer without discarding existing infrastructure.

Core claim

The authors propose to represent queries as bags of terms as well as bags of entities, compute dense vector representations of both, and derive field-level semantic similarity features from the query-document vector comparisons. These features are used to re-rank the candidate pool first retrieved by the Fielded Sequential Dependence Model, producing statistically significant gains on the DBpedia-Entity v2 benchmark.

What carries the argument

Field-level semantic features computed from dense vector similarities between query terms/entities and document fields, used as a re-ranking layer atop FSDM.

Load-bearing premise

Semantic similarity scores from dense vectors supply signal that is independent of and additive to the fielded term-matching already performed by FSDM.

What would settle it

Re-running the proposed re-ranking on the DBpedia-Entity v2 dataset and observing no statistically significant lift in NDCG@10 or NDCG@100 would falsify the central claim.

Watch this falsifier — get emailed when new claim-graph text bears on it.

If this is right

Entity retrieval systems obtain higher NDCG scores by re-ranking FSDM results with field-level vector similarities.
Both term-based and entity-based query representations contribute useful semantic signal.
The improvement holds across the full set of queries in the DBpedia-Entity v2 collection.
The semantic layer can be added without replacing the underlying term-matching model.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same re-ranking pattern could be tested on other fielded collections such as web pages or product catalogs.
Stronger embedding models would likely increase the size of the observed gains.
The approach suggests a general template for layering semantic features onto any fielded dependence model.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit.

Desk Editor's Note

Modest NDCG gains from adding dense field-level embeddings on top of FSDM, but no evidence the embeddings add signal beyond what FSDM already captures.

read the letter

The paper's core move is to run FSDM for candidate retrieval on fielded entity documents, then re-rank the pool with semantic similarity scores computed from dense vectors of query terms and entities against the same fields. They report 2.5% and 1.2% lifts in NDCG@10 and NDCG@100 on DBpedia-Entity v2, calling the gains significant for all queries. That specific pipeline combination is what they present as new. The execution on a public benchmark is straightforward and the numbers are stated clearly enough to be checked later. Credit for keeping the evaluation external and not inventing new metrics. The soft spots sit right at the central claim. The abstract supplies no ablation that removes the semantic component, no correlation check between the dense scores and FSDM's existing term and sequential features, no description of normalization or weighting when the scores are combined, and no error bars or test details. Without those, it is impossible to know whether the reported improvement comes from added semantic information or simply from the act of re-ranking or extra tuning. The stress-test concern about non-independence therefore stands on the given description. This work is for people already running fielded entity retrieval experiments who might want to test embedding features in the same setup. A reader could extract the pipeline idea and try it, but the current evidence does not yet support treating the gains as a reliable advance. The paper deserves a serious referee to see whether the full methods section supplies the missing ablations and statistics; on the abstract alone it would be borderline.

Referee Report

3 major / 0 minor

Summary. The paper proposes augmenting the Fielded Sequential Dependence Model (FSDM) for knowledge-base entity retrieval by retrieving an initial pool with FSDM and then re-ranking it using field-level semantic similarity features. Queries are represented as bags of terms and entities whose dense vectors are used to compute similarities against document fields; the authors report 2.5% and 1.2% gains in NDCG@10 and NDCG@100 on DBpedia-Entity v2.

Significance. If the semantic features supply signal orthogonal to FSDM's fielded term matching, the approach could provide a lightweight improvement to entity search pipelines. The evaluation on a public benchmark is a positive; however, the lack of ablations, normalization details, and statistical reporting makes it impossible to confirm that the reported gains arise from the claimed semantic contribution rather than the re-ranking step itself.

major comments (3)

[Abstract / Evaluation] Abstract and evaluation section: the central claim of 'significant improvement' due to semantic features is unsupported because no ablation removing the semantic component, no feature-importance analysis, and no correlation between semantic scores and FSDM scores are provided; without these it is impossible to rule out that gains come from re-ranking alone.
[Methods] Methods / re-ranking description: the manuscript supplies no equations or procedural details on how the dense-vector semantic similarities are normalized (e.g., cosine vs. dot product) or linearly combined with the original FSDM scores; this is load-bearing for both reproducibility and the orthogonality assumption.
[Results] Results: reported NDCG improvements are given without error bars, p-values, or the exact statistical test used, and without per-query or per-field breakdowns; this prevents assessment of whether the 2.5%/1.2% figures are robust or driven by a few queries.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their constructive feedback. We address each major comment below and commit to revisions that strengthen the presentation of our results.

read point-by-point responses

Referee: [Abstract / Evaluation] Abstract and evaluation section: the central claim of 'significant improvement' due to semantic features is unsupported because no ablation removing the semantic component, no feature-importance analysis, and no correlation between semantic scores and FSDM scores are provided; without these it is impossible to rule out that gains come from re-ranking alone.

Authors: Our evaluation directly compares the baseline FSDM against the model augmented with semantic field-level re-ranking features on the same candidate pool; the observed gains are therefore attributable to the addition of those features. We nevertheless agree that explicit ablations, feature-importance analysis, and score correlations would further substantiate orthogonality and will add them to the revised manuscript. revision: yes
Referee: [Methods] Methods / re-ranking description: the manuscript supplies no equations or procedural details on how the dense-vector semantic similarities are normalized (e.g., cosine vs. dot product) or linearly combined with the original FSDM scores; this is load-bearing for both reproducibility and the orthogonality assumption.

Authors: We will insert the missing equations and procedural details in the Methods section, specifying that cosine similarity is used for the dense-vector comparisons and describing the linear combination with FSDM scores (including how weights are obtained). revision: yes
Referee: [Results] Results: reported NDCG improvements are given without error bars, p-values, or the exact statistical test used, and without per-query or per-field breakdowns; this prevents assessment of whether the 2.5%/1.2% figures are robust or driven by a few queries.

Authors: The manuscript asserts statistical significance; we will augment the Results section with error bars, the precise test and p-values, and per-query/per-field breakdowns to allow readers to assess robustness. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical gains on external benchmark

full rationale

The paper retrieves a candidate pool with the existing FSDM model then re-ranks using cosine similarities between dense vectors of query terms/entities and document fields. The reported NDCG@10 and NDCG@100 improvements are measured on the independent DBpedia-Entity (v2) collection using standard metrics. No equations, fitted parameters, or self-citations are described that would render these measured gains tautological by construction. The derivation chain therefore remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no equations or implementation details, so no free parameters, axioms, or invented entities can be identified.

pith-pipeline@v0.9.0 · 5687 in / 1016 out tokens · 27705 ms · 2026-05-25T10:49:05.922134+00:00 · methodology

0 comments

read the original abstract

A common approach for knowledge-base entity search is to consider an entity as a document with multiple fields. Models that focus on matching query terms in different fields are popular choices for searching such entity representations. An instance of such a model is FSDM (Fielded Sequential Dependence Model). We propose to integrate field-level semantic features into FSDM. We use FSDM to retrieve a pool of documents, and then to use semantic field-level features to re-rank those documents. We propose to represent queries as bags of terms as well as bags of entities, and eventually, use their dense vector representation to compute semantic features based on query document similarity. Our proposed re-ranking approach achieves significant improvement in entity retrieval on the DBpedia-Entity (v2) dataset over existing FSDM model. Specifically, for all queries we achieve 2.5% and 1.2% significant improvement in NDCG@10 and NDCG@100, respectively.

discussion (0)

Reference graph

Works this paper leans on

33 extracted references · 33 canonical work pages · 1 internal anchor

[1]

Balog, D

K. Balog, D. Carmel, and P. Arjen. de vries, daniel m. herzig, peter mika, haggai roitman, ralf schenkel, pavel serdyukov, thanh tran duc. In The ﬁrst joint international workshop on entity- oriented and semantic search (JIWES), ACM SI- GIR Forum, 2012

work page 2012
[2]

Balog and R

K. Balog and R. Neumayer. A test collection for entity search in dbpedia. In Proceedings of the 36th international ACM SIGIR conference on Re- search and development in information retrieval , pages 737–740. ACM, 2013

work page 2013
[3]

Balog, P

K. Balog, P. Serdyukov, and A. P. d. Vries. Overview of the trec 2010 entity track. Techni- cal report, NOR WEGIAN UNIV OF SCIENCE AND TECHNOLOGY TRONDHEIM, 2010. 4 Table 2: Overall accuracy on each query group as well as all queries. The semantic and FSDM score are lin- early combined using Coordinate Ascent algorithm. † indicates signiﬁcant (p ¡ 0.05) i...

work page 2010
[4]

Blanco, H

R. Blanco, H. Halpin, D. M. Herzig, P. Mika, J. Pound, H. S. Thompson, and T. T. Duc. En- tity search evaluation over structured web data. In Proceedings of the 1st international workshop on entity-oriented search workshop (SIGIR 2011), ACM, New York , 2011

work page 2011
[5]

Bordes, J

A. Bordes, J. Weston, and N. Usunier. Open question answering with weakly supervised em- bedding models. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases , pages 165–180. Springer, 2014

work page 2014
[6]

J. Chen, C. Xiong, and J. Callan. An empiri- cal study of learning to rank for entity search. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Infor- mation Retrieval , pages 737–740. ACM, 2016

work page 2016
[7]

R.-C. Chen. Coordinateascent. https://github.com/rueycheng/CoordinateAscent, 2018

work page 2018
[8]

Demartini, T

G. Demartini, T. Iofciu, and A. P. De Vries. Overview of the inex 2009 entity ranking track. In International Workshop of the Initiative for the Evaluation of XML Retrieval , pages 254–264. Springer, 2009

work page 2009
[9]

Ferragina and U

P. Ferragina and U. Scaiella. Fast and accurate annotation of short texts with wikipedia pages. IEEE software , 29(1):70–75, 2012

work page 2012
[10]

J. Guo, G. Xu, X. Cheng, and H. Li. Named entity recognition in query. In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval , pages 267–274. ACM, 2009

work page 2009
[11]

Halpin, D

H. Halpin, D. M. Herzig, P. Mika, R. Blanco, J. Pound, H. Thompon, and D. T. Tran. Evalu- ating ad-hoc object retrieval. In IWEST@ ISWC , 2010

work page 2010
[12]

Hasibi, F

F. Hasibi, F. Nikolaev, C. Xiong, K. Balog, S. E. Bratsberg, A. Kotov, and J. Callan. Dbpedia- entity v2: A test collection for entity search. In Proceedings of the 40th International ACM SI- GIR Conference on Research and Development in Information Retrieval , pages 1265–1268. ACM, 2017

work page 2017
[13]

K. Y. Itakura and C. L. Clarke. A framework for bm25f-based xml retrieval. In Proceedings of the 33rd international ACM SIGIR conference on Re- search and development in information retrieval , pages 843–844. ACM, 2010. 5

work page 2010
[14]

Y. Lin, Z. Liu, M. Sun, Y. Liu, and X. Zhu. Learn- ing entity and relation embeddings for knowledge graph completion. In AAAI, volume 15, pages 2181–2187, 2015

work page 2015
[15]

Lopez, C

V. Lopez, C. Unger, P. Cimiano, and E. Motta. Evaluating question answering over linked data. Web Semantics: Science, Services and Agents on the World Wide Web , 21:3–13, 2013

work page 2013
[16]

C. Lu, W. Lam, and Y. Liao. Entity retrieval via entity factoid hierarchy. In Proceedings of the 53rd Annual Meeting of the Association for Computa- tional Linguistics and the 7th International Joint Conference on Natural Language Processing (Vol- ume 1: Long Papers) , volume 1, pages 514–523, 2015

work page 2015
[17]

Metzler and W

D. Metzler and W. B. Croft. Linear feature-based models for information retrieval. Information Re- trieval, 10(3):257–274, 2007

work page 2007
[18]

Mikolov, I

T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In Ad- vances in neural information processing systems , pages 3111–3119, 2013

work page 2013
[19]

Nanni, B

F. Nanni, B. Mitra, M. Magnusson, and L. Dietz. Benchmark for complex answer retrieval. In Pro- ceedings of the ACM SIGIR International Confer- ence on Theory of Information Retrieval , pages 293–296. ACM, 2017

work page 2017
[20]

Y. Ni, Q. K. Xu, F. Cao, Y. Mass, D. Sheinwald, H. J. Zhu, and S. S. Cao. Semantic documents re- latedness using concept graph representation. In Proceedings of the Ninth ACM International Con- ference on Web Search and Data Mining , pages 635–644. ACM, 2016

work page 2016
[21]

Pennington, R

J. Pennington, R. Socher, and C. Manning. Glove: Global vectors for word representation. In Pro- ceedings of the 2014 conference on empirical meth- ods in natural language processing (EMNLP) , pages 1532–1543, 2014

work page 2014
[22]

J. R. P´ erez-Ag¨ uera, J. Arroyo, J. Greenberg, J. P. Iglesias, and V. Fresno. Using bm25f for semantic search. In Proceedings of the 3rd international semantic search workshop , page 2. ACM, 2010

work page 2010
[23]

Pound, P

J. Pound, P. Mika, and H. Zaragoza. Ad-hoc ob- ject retrieval in the web of data. In Proceedings of the 19th international conference on World wide web, pages 771–780. ACM, 2010

work page 2010
[24]

ˇReh ˚ uˇ rek and P

R. ˇReh ˚ uˇ rek and P. Sojka. Software Frame- work for Topic Modelling with Large Cor- pora. In Proceedings of the LREC 2010 Work- shop on New Challenges for NLP Frameworks , pages 45–50, Valletta, Malta, May 2010. ELRA. http://is.muni.cz/publication/884893/en

work page 2010
[25]

Ristoski and H

P. Ristoski and H. Paulheim. Rdf2vec: Rdf graph embeddings for data mining. In International Se- mantic Web Conference , pages 498–514. Springer, 2016

work page 2016
[26]

Robertson, H

S. Robertson, H. Zaragoza, and M. Taylor. Sim- ple bm25 extension to multiple weighted ﬁelds. In Proceedings of the thirteenth ACM international conference on Information and knowledge man- agement, pages 42–49. ACM, 2004

work page 2004
[27]

Serdyukov and A

P. Serdyukov and A. De Vries. Delft university at the trec 2009 entity track: Ranking wikipedia entities. Technical report, DELFT UNIV OF TECHNOLOGY (NETHERLANDS), 2009

work page 2009
[28]

Q. Wang, J. Kamps, G. R. Camps, M. Marx, A. Schuth, M. Theobald, S. Gurajada, and A. Mishra. Overview of the inex 2012 linked data track. In CLEF (Online Working Notes/Labs/Workshop), 2012

work page 2012
[29]

Xiong, J

C. Xiong, J. Callan, and T.-Y. Liu. Word-entity duet representations for document ranking. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in In- formation Retrieval , pages 763–772. ACM, 2017

work page 2017
[30]

Xiong, R

C. Xiong, R. Power, and J. Callan. Explicit se- mantic ranking for academic search via knowledge graph embedding. In Proceedings of the 26th in- ternational conference on world wide web , pages 1271–1279. International World Wide Web Con- ferences Steering Committee, 2017

work page 2017
[31]

Embedding Entities and Relations for Learning and Inference in Knowledge Bases

B. Yang, W.-t. Yih, X. He, J. Gao, and L. Deng. Embedding entities and relations for learning and inference in knowledge bases. arXiv preprint arXiv:1412.6575, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014
[32]

Zhiltsov, A

N. Zhiltsov, A. Kotov, and F. Nikolaev. Fielded sequential dependence model for ad-hoc entity re- trieval in the web of data. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Re- trieval, pages 253–262. ACM, 2015

work page 2015
[33]

Zwicklbauer, C

S. Zwicklbauer, C. Seifert, and M. Granitzer. Ro- bust and collective entity disambiguation through semantic embeddings. In Proceedings of the 6 39th International ACM SIGIR conference on Re- search and Development in Information Retrieval , pages 425–434. ACM, 2016. 7

work page 2016

[1] [1]

Balog, D

K. Balog, D. Carmel, and P. Arjen. de vries, daniel m. herzig, peter mika, haggai roitman, ralf schenkel, pavel serdyukov, thanh tran duc. In The ﬁrst joint international workshop on entity- oriented and semantic search (JIWES), ACM SI- GIR Forum, 2012

work page 2012

[2] [2]

Balog and R

K. Balog and R. Neumayer. A test collection for entity search in dbpedia. In Proceedings of the 36th international ACM SIGIR conference on Re- search and development in information retrieval , pages 737–740. ACM, 2013

work page 2013

[3] [3]

Balog, P

K. Balog, P. Serdyukov, and A. P. d. Vries. Overview of the trec 2010 entity track. Techni- cal report, NOR WEGIAN UNIV OF SCIENCE AND TECHNOLOGY TRONDHEIM, 2010. 4 Table 2: Overall accuracy on each query group as well as all queries. The semantic and FSDM score are lin- early combined using Coordinate Ascent algorithm. † indicates signiﬁcant (p ¡ 0.05) i...

work page 2010

[4] [4]

Blanco, H

R. Blanco, H. Halpin, D. M. Herzig, P. Mika, J. Pound, H. S. Thompson, and T. T. Duc. En- tity search evaluation over structured web data. In Proceedings of the 1st international workshop on entity-oriented search workshop (SIGIR 2011), ACM, New York , 2011

work page 2011

[5] [5]

Bordes, J

A. Bordes, J. Weston, and N. Usunier. Open question answering with weakly supervised em- bedding models. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases , pages 165–180. Springer, 2014

work page 2014

[6] [6]

J. Chen, C. Xiong, and J. Callan. An empiri- cal study of learning to rank for entity search. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Infor- mation Retrieval , pages 737–740. ACM, 2016

work page 2016

[7] [7]

R.-C. Chen. Coordinateascent. https://github.com/rueycheng/CoordinateAscent, 2018

work page 2018

[8] [8]

Demartini, T

G. Demartini, T. Iofciu, and A. P. De Vries. Overview of the inex 2009 entity ranking track. In International Workshop of the Initiative for the Evaluation of XML Retrieval , pages 254–264. Springer, 2009

work page 2009

[9] [9]

Ferragina and U

P. Ferragina and U. Scaiella. Fast and accurate annotation of short texts with wikipedia pages. IEEE software , 29(1):70–75, 2012

work page 2012

[10] [10]

J. Guo, G. Xu, X. Cheng, and H. Li. Named entity recognition in query. In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval , pages 267–274. ACM, 2009

work page 2009

[11] [11]

Halpin, D

H. Halpin, D. M. Herzig, P. Mika, R. Blanco, J. Pound, H. Thompon, and D. T. Tran. Evalu- ating ad-hoc object retrieval. In IWEST@ ISWC , 2010

work page 2010

[12] [12]

Hasibi, F

F. Hasibi, F. Nikolaev, C. Xiong, K. Balog, S. E. Bratsberg, A. Kotov, and J. Callan. Dbpedia- entity v2: A test collection for entity search. In Proceedings of the 40th International ACM SI- GIR Conference on Research and Development in Information Retrieval , pages 1265–1268. ACM, 2017

work page 2017

[13] [13]

K. Y. Itakura and C. L. Clarke. A framework for bm25f-based xml retrieval. In Proceedings of the 33rd international ACM SIGIR conference on Re- search and development in information retrieval , pages 843–844. ACM, 2010. 5

work page 2010

[14] [14]

Y. Lin, Z. Liu, M. Sun, Y. Liu, and X. Zhu. Learn- ing entity and relation embeddings for knowledge graph completion. In AAAI, volume 15, pages 2181–2187, 2015

work page 2015

[15] [15]

Lopez, C

V. Lopez, C. Unger, P. Cimiano, and E. Motta. Evaluating question answering over linked data. Web Semantics: Science, Services and Agents on the World Wide Web , 21:3–13, 2013

work page 2013

[16] [16]

C. Lu, W. Lam, and Y. Liao. Entity retrieval via entity factoid hierarchy. In Proceedings of the 53rd Annual Meeting of the Association for Computa- tional Linguistics and the 7th International Joint Conference on Natural Language Processing (Vol- ume 1: Long Papers) , volume 1, pages 514–523, 2015

work page 2015

[17] [17]

Metzler and W

D. Metzler and W. B. Croft. Linear feature-based models for information retrieval. Information Re- trieval, 10(3):257–274, 2007

work page 2007

[18] [18]

Mikolov, I

T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In Ad- vances in neural information processing systems , pages 3111–3119, 2013

work page 2013

[19] [19]

Nanni, B

F. Nanni, B. Mitra, M. Magnusson, and L. Dietz. Benchmark for complex answer retrieval. In Pro- ceedings of the ACM SIGIR International Confer- ence on Theory of Information Retrieval , pages 293–296. ACM, 2017

work page 2017

[20] [20]

Y. Ni, Q. K. Xu, F. Cao, Y. Mass, D. Sheinwald, H. J. Zhu, and S. S. Cao. Semantic documents re- latedness using concept graph representation. In Proceedings of the Ninth ACM International Con- ference on Web Search and Data Mining , pages 635–644. ACM, 2016

work page 2016

[21] [21]

Pennington, R

J. Pennington, R. Socher, and C. Manning. Glove: Global vectors for word representation. In Pro- ceedings of the 2014 conference on empirical meth- ods in natural language processing (EMNLP) , pages 1532–1543, 2014

work page 2014

[22] [22]

J. R. P´ erez-Ag¨ uera, J. Arroyo, J. Greenberg, J. P. Iglesias, and V. Fresno. Using bm25f for semantic search. In Proceedings of the 3rd international semantic search workshop , page 2. ACM, 2010

work page 2010

[23] [23]

Pound, P

J. Pound, P. Mika, and H. Zaragoza. Ad-hoc ob- ject retrieval in the web of data. In Proceedings of the 19th international conference on World wide web, pages 771–780. ACM, 2010

work page 2010

[24] [24]

ˇReh ˚ uˇ rek and P

R. ˇReh ˚ uˇ rek and P. Sojka. Software Frame- work for Topic Modelling with Large Cor- pora. In Proceedings of the LREC 2010 Work- shop on New Challenges for NLP Frameworks , pages 45–50, Valletta, Malta, May 2010. ELRA. http://is.muni.cz/publication/884893/en

work page 2010

[25] [25]

Ristoski and H

P. Ristoski and H. Paulheim. Rdf2vec: Rdf graph embeddings for data mining. In International Se- mantic Web Conference , pages 498–514. Springer, 2016

work page 2016

[26] [26]

Robertson, H

S. Robertson, H. Zaragoza, and M. Taylor. Sim- ple bm25 extension to multiple weighted ﬁelds. In Proceedings of the thirteenth ACM international conference on Information and knowledge man- agement, pages 42–49. ACM, 2004

work page 2004

[27] [27]

Serdyukov and A

P. Serdyukov and A. De Vries. Delft university at the trec 2009 entity track: Ranking wikipedia entities. Technical report, DELFT UNIV OF TECHNOLOGY (NETHERLANDS), 2009

work page 2009

[28] [28]

Q. Wang, J. Kamps, G. R. Camps, M. Marx, A. Schuth, M. Theobald, S. Gurajada, and A. Mishra. Overview of the inex 2012 linked data track. In CLEF (Online Working Notes/Labs/Workshop), 2012

work page 2012

[29] [29]

Xiong, J

C. Xiong, J. Callan, and T.-Y. Liu. Word-entity duet representations for document ranking. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in In- formation Retrieval , pages 763–772. ACM, 2017

work page 2017

[30] [30]

Xiong, R

C. Xiong, R. Power, and J. Callan. Explicit se- mantic ranking for academic search via knowledge graph embedding. In Proceedings of the 26th in- ternational conference on world wide web , pages 1271–1279. International World Wide Web Con- ferences Steering Committee, 2017

work page 2017

[31] [31]

Embedding Entities and Relations for Learning and Inference in Knowledge Bases

B. Yang, W.-t. Yih, X. He, J. Gao, and L. Deng. Embedding entities and relations for learning and inference in knowledge bases. arXiv preprint arXiv:1412.6575, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014

[32] [32]

Zhiltsov, A

N. Zhiltsov, A. Kotov, and F. Nikolaev. Fielded sequential dependence model for ad-hoc entity re- trieval in the web of data. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Re- trieval, pages 253–262. ACM, 2015

work page 2015

[33] [33]

Zwicklbauer, C

S. Zwicklbauer, C. Seifert, and M. Granitzer. Ro- bust and collective entity disambiguation through semantic embeddings. In Proceedings of the 6 39th International ACM SIGIR conference on Re- search and Development in Information Retrieval , pages 425–434. ACM, 2016. 7

work page 2016