Knowledge Graph
Knowledge Graph
Arijit Khan
Aalborg University
[email protected]
arXiv:2305.14485v1 [cs.DB] 23 May 2023
prediction, graph embedding (R-GCN), and KG embed- data. We conclude with a discussion about future work
ding (TransE, DistMult, RotatE). on KG querying.
(8) GraphDB (https://www.ontotext.com/products/graphdb) is • Vector data management and querying. With the
an RDF database using SPARQL query language. It prevalence of KG embedding based query processing,
supports faceted search and third-party tools, such as managing and querying of vector data is critical. Data
metaphactory, for interactive visualization. management community can contribute in this domain
with high-dimensional data indexing, join, querying, and
(9) TigerGraph [37] is a native graph database with
geometric data processing.
property graph data model and GSQL language. Tiger-
Graph GraphStudio provides a graphical interface for • Scalable embedding learning. Scaling knowledge
interactive visualization and exploration. TigerGraph’s graphs embedding is challenging [88, 74, 183]. The
ML Workbench is a Jupyter-based Python development problem gets exacerbated when combined with more com-
framework that is inter-operable with popular deep learn- plex data, such as KG+query embedding and multi-modal
ing frameworks such as PyTorch Geometric, DGL, and KG embedding. Advanced techniques are required for
supports graph embedding (Node2Vec, Fast Random Pro- scalable embedding learning of multi-modal KGs, e.g.,
jection, and Weisfeiler-Lehman). with language models, and conversational KG-QA with
sequence-to-sequence models.
(10) FaunaDB (https://fauna.com) is a document-relational
database with property graph model and GraphQL API. • Graph databases support for KG embedding. Cur-
rent graph DBMS support for ML-based KG querying
Summary. The top-10 commercial graph databases sup- is limited. In future, they can incorporate more KG
port various languages for querying of KGs – as RDF embedding models, vector data management and query
triples or property graphs. Besides, many of them also processing techniques, as well as enable multi-modal
provide interactive interfaces for visualization, query- KG storage and query, more interactive means of KG
ing, and exploration of KGs. Their support for in-built querying such as NLQs and dialogues.
ML-based KG querying is limited. Only Amazon Nep-
• Usability of KG querying methods. Besides SPARQL,
tune provides a few popular KG embedding methods
a number of KG querying approaches exist, e.g., query
such as TransE, DistMult, and RotatE. While many of
languages, keyword search, query-by-example, faceted
these graph databases (e.g., AllegroGraph, ArangoDB,
search, visual query, natural language questions, and
OrientDB) are multi-model, supporting multiple data mod-
conversational QA. It would be interesting to holisti-
els against a single backend, none of them has in-house
cally compare them, understand their user-friendliness,
system for storage and querying of multi-modal data,
and categorize what is applicable in which domains.
such as KGs with text, images, and multimedia.
• Suitability of KG embedding models. A number of
5 Future Directions KG embedding models exist, such as translation-based
models (TransE, TransD, TransH) and semantic match-
Knowledge graphs can support a holistic integration so- ing models (RESCAL, DistMult, ConvE). Different mod-
lution for multi-modal data arriving from heterogeneous els preserve various types of relation properties, e.g.,
sources. For instance, nodes and edges in a KG can have symmetry, antisymmetry, inversion, composition, com-
arbitrary number of properties of different types, e.g., plex mapping properties, etc. [140]. One can analyze
tabular, key-value pairs, text, images, and multimedia. which properties are important for what queries, lead-
Therefore, KGs can be a unified data model for complex ing to a realization of which KG embedding models are
data lake problems, to model cross-domain and diverse suitable for different KGs and queries.
• Explainability, interoperability, and multi-lingual [18] K. D. Bollacker, C. Evans, P. K. Paritosh, T. Sturge, and
KG querying. There is an increasing focus on inter- J. Taylor. Freebase: a collaboratively created graph database
for structuring human knowledge. In SIGMOD, 2008.
pretability of deep learning methods over graph-structured [19] A. Bonifati, R. Ciucanu, and A. Lemay. Learning path queries
data. In this context, explainability in knowledge graph on graph databases. In EDBT, 2015.
embeddings is also important, for instance, what is be- [20] A. Bonifati, G. H. L. Fletcher, H. Voigt, and N. Yakovets.
Querying graphs. Synthesis Lectures on Data Management.
ing learned in knowledge graph embedding and KG-QA Morgan & Claypool Publishers, 2018.
with explanatory evidences. Interoperability between [21] A. Bonifati, W. Martens, and T. Timm. An analytical study of
KGs and supporting multi-lingual KGs [106] and queries large SPARQL query logs. VLDB J., 29(2-3):655–679, 2020.
are other interesting future directions. [22] A. Bordes, N. Usunier, S. Chopra, and J. Weston. Large-scale
simple question answering with memory networks. CoRR,
abs/1506.02075, 2015.
6 Acknowledgement [23] A. Bordes, N. Usunier, A. García-Durán, J. Weston, and
O. Yakhnenko. Translating embeddings for modeling
Arijit Khan acknowledges support from the Novo Nordisk multi-relational data. In NeurIPS, 2013.
Foundation grant NNF22OC0072415. [24] S. Borgwardt, İ. İ. Ceylan, and T. Lukasiewicz. Recent
advances in querying probabilistic knowledge bases. In IJCAI,
7 References 2018.
[25] S. Bouhenni, S. Yahiaoui, N. Nouali-Taboudjemat, and
H. Kheddouci. A survey on distributed graph pattern matching
[1] D. J. Abadi, A. Marcus, S. Madden, and K. Hollenbach. in massive graphs. ACM Comput. Surv., 54(2):36:1–36:35,
Sw-store: a vertically partitioned DBMS for semantic web 2022.
data management. VLDB J., 18(2):385–406, 2009. [26] J. Cai, Z. Zhang, F. Wu, and J. Wang. Deep cognitive
[2] I. Abdelaziz, R. Harbi, Z. Khayyat, and P. Kalnis. A survey reasoning network for multi-hop question answering over
and experimental comparison of distributed SPARQL engines knowledge graphs. In ACL/IJCNLP, 2021.
for very large RDF data. PVLDB, 10(13):2049–2060, 2017. [27] D. V. Camarda, S. Mazzini, and A. Antonuccio. Lodlive,
[3] B. Abu-Salih. Domain-specific knowledge graphs: a survey. J. exploring the web of data. In Semantic Systems, 2012.
Netw. Comput. Appl., 185:103076, 2021. [28] N. Chakraborty, D. Lukovnikov, G. Maheshwari, P. Trivedi,
[4] A. Abujabal, R. S. Roy, M. Yahya, and G. Weikum. J. Lehmann, and A. Fischer. Introduction to neural
Never-ending learning for open-domain question answering network-based question answering over knowledge graphs.
over knowledge bases. In WWW, 2018. WIREs Data Mining Knowl. Discov., 11(3), 2021.
[5] M. Ali, M. Berrendorf, C. T. Hoyt, L. Vermue, M. Galkin, [29] M. Chen, Y. Tian, K. Chang, S. Skiena, and C. Zaniolo.
S. Sharifzadeh, A. Fischer, V. Tresp, and J. Lehmann. Bringing Co-training embeddings of knowledge graphs and entity
light into the dark: a large-scale evaluation of knowledge descriptions for cross-lingual entity alignment. In IJCAI, 2018.
graph embedding models under a unified framework. IEEE [30] X. Chen, Z. Hu, and Y. Sun. Fuzzy logic based logical query
Trans. Pattern Anal. Mach. Intell., 44(12):8825–8845, 2022. answering on knowledge graphs. In AAAI, 2022.
[6] W. Ali, M. Saleem, B. Yao, A. Hogan, and A. N. Ngomo. A [31] E. I. Chong, S. Das, G. Eadon, and J. Srinivasan. An efficient
survey of RDF stores & SPARQL engines for querying sql-based RDF querying scheme. In VLDB, 2005.
knowledge graphs. VLDB J., 31(3):1–26, 2022. [32] P. Christmann, R. S. Roy, A. Abujabal, J. Singh, and
[7] R. Angles, M. Arenas, P. Barceló, P. A. Boncz, G. H. L. G. Weikum. Look before you hop: conversational question
Fletcher, C. Gutierrez, T. Lindaaker, M. Paradies, S. Plantikow, answering over knowledge graphs using judicious context
J. F. Sequeda, O. van Rest, and H. Voigt. G-CORE: a core for expansion. In CIKM, 2019.
future graph query languages. In SIGMOD, 2018. [33] F. Darari, R. E. Prasojo, and W. Nutt. CORNER: a
[8] R. Angles, M. Arenas, P. Barceló, A. Hogan, J. L. Reutter, and completeness reasoner for SPARQL queries over RDF data
D. Vrgoc. Foundations of modern query languages for graph sources. In The Semantic Web: ESWC Satellite Events, 2014.
databases. ACM Comput. Surv., 50(5):68:1–68:40, 2017. [34] T. Dettmers, P. Minervini, P. Stenetorp, and S. Riedel.
[9] E. Arakelyan, D. Daza, P. Minervini, and M. Cochez. Complex Convolutional 2d knowledge graph embeddings. In AAAI,
query answering with neural link predictors. In ICLR, 2021. 2018.
[10] A. Atserias, M. Grohe, and D. Marx. Size bounds and query [35] A. Deutsch, N. Francis, A. Green, K. Hare, B. Li, L. Libkin,
plans for relational joins. CoRR, abs/1711.03860, 2017. T. Lindaaker, V. Marsault, W. Martens, J. Michels, F. Murlak,
[11] I. Balazevic, C. Allen, and T. M. Hospedales. Tucker: tensor S. Plantikow, P. Selmer, O. van Rest, H. Voigt, D. Vrgoc,
factorization for knowledge graph completion. In M. Wu, and F. Zemke. Graph pattern matching in GQL and
EMNLP-IJCNLP, 2019. SQL/PGQ. In SIGMOD, 2022.
[12] K. Balog and T. Kenter. Personal knowledge graphs: a [36] A. Deutsch, Y. Xu, and M. Wu. Seamless syntactic and
research agenda. In SIGIR, 2019. semantic integration of query primitives over relational and
[13] L. Bellomarini, E. Sallinger, and G. Gottlob. The vadalog graph data in gsql, 2018.
system: datalog-based reasoning for knowledge graphs. [37] A. Deutsch, Y. Xu, M. Wu, and V. E. Lee. Tigergraph: a native
PVLDB, 11(9):975–987, 2018. MPP graph database. CoRR, abs/1901.08248, 2019.
[14] J. Berant, A. Chou, R. Frostig, and P. Liang. Semantic parsing [38] B. Ding, Q. Wang, B. Wang, and L. Guo. Improving
on freebase from question-answer pairs. In EMNLP, 2013. knowledge graph embedding using simple constraints. In ACL,
[15] M. Besta, E. Peter, R. Gerstenberger, M. Fischer, 2018.
M. Podstawski, C. Barthels, G. Alonso, and T. Hoefler. [39] Y. Ding, J. Yu, B. Liu, Y. Hu, M. Cui, and Q. Wu. Mukea:
Demystifying graph databases: analysis and taxonomy of data multimodal knowledge extraction and accumulation for
organization, system designs, and graph queries. CoRR, 2019. knowledge-based visual question answering. In CVPR, 2022.
[16] S. S. Bhowmick and B. Choi. Data-driven visual query [40] H. Du, Z. Le, H. Wang, Y. Chen, and J. Yu. COKG-QA:
interfaces for graphs: past, present, and (near) future. In multi-hop question answering over COVID-19 knowledge
SIGMOD, 2022. graphs. Data Intell., 4(3):471–492, 2022.
[17] S. S. Bhowmick, B. Choi, and C. Li. Graph querying meets [41] K. Elbedweihy, S. N. Wrigley, and F. Ciravegna. Evaluating
HCI: state of the art and future directions. In SIGMOD, 2017. semantic search query approaches with expert and casual
users. In ISWC, 2012.
[42] W. Fan, J. Li, S. Ma, N. Tang, and Y. Wu. Adding regular [68] A. Khan, N. Li, X. Yan, Z. Guan, S. Chakraborty, and S. Tao.
expressions to graph reachability and pattern queries. In ICDE, Neighborhood based fast graph search in large networks. In
2011. SIGMOD, 2011.
[43] W. Fan, J. Li, S. Ma, H. Wang, and Y. Wu. Graph [69] A. Khan, G. Segovia, and D. Kossmann. On smart query
homomorphism revisited for graph matching. PVLDB, routing: for distributed graph querying with decoupled
3(1):1161–1172, 2010. storage. In USENIX ATC, 2018.
[44] W. Fan, X. Wang, and Y. Wu. Answering graph pattern queries [70] A. Khan, Y. Wu, C. C. Aggarwal, and X. Yan. Nema: fast
using views. In ICDE, 2014. graph search with label similarity. PVLDB, 6(3):181–192,
[45] Q. Fang, X. Zhang, J. Hu, X. Wu, , and C. Xu. Contrastive 2013.
multi-modal knowledge graph representation learning. IEEE [71] A. Khan, Y. Ye, and L. Chen. On uncertain graphs. Synthesis
Trans. Knowl. Data Eng., 2022. Lectures on Data Management. Morgan & Claypool
[46] N. Francis, A. Green, P. Guagliardo, L. Libkin, T. Lindaaker, Publishers, 2018.
V. Marsault, S. Plantikow, M. Rydberg, P. Selmer, and [72] J. Lee, W. Han, R. Kasperovics, and J. Lee. An in-depth
A. Taylor. Cypher: an evolving query language for property comparison of subgraph isomorphism algorithms in graph
graphs. In SIGMOD, 2018. databases. Proc. VLDB Endow., 6(2):133–144, 2012.
[47] M. Galkin, Z. Zhu, H. Ren, and J. Tang. Inductive logical [73] J. Lehmann, R. Isele, M. Jakob, A. Jentzsch, D. Kontokostas,
query answering in knowledge graphs. In NeurIPS, 2022. P. N. Mendes, S. Hellmann, M. Morsey, P. v. Kleef, S. Auer,
[48] M. R. Garey and D. S. Johnson. Computers and intractability: and C. Bizer. Dbpedia - a large-scale, multilingual knowledge
a guide to the theory of NP-completeness. W. H. Freeman, base extracted from wikipedia. Semantic Web, 6(2):167–195,
1979. 2015.
[49] G. A. Gesese, R. Biswas, M. Alam, and H. Sack. A survey on [74] A. Lerer, L. Wu, J. Shen, T. Lacroix, L. Wehrstedt, A. Bose,
knowledge graph embeddings with literals: Which model links and A. Peysakhovich. Pytorch-biggraph: a large scale graph
better literal-ly? Semantic Web, 12(4):617–647, 2021. embedding system. In MLSys, 2019.
[50] Y. Guo, Z. Pan, and J. Heflin. Lubm: a benchmark for owl [75] Y. Li, T. Ge, and C. X. Chen. Online indices for predictive
knowledge base systems. J. Web Semant., 3(2-3):158–182, top-k entity and aggregate queries on knowledge graphs. In
2005. ICDE, 2020.
[51] W. L. Hamilton, P. Bajaj, M. Zitnik, D. Jurafsky, and [76] M. Lissandrini, M. Brugnara, and Y. Velegrakis. Beyond
J. Leskovec. Embedding logical queries on knowledge graphs. macrobenchmarks: microbenchmark-based graph database
In NeurIPS, 2018. evaluation. PVLDB, 12(4):390–403, 2018.
[52] L. Han, T. Finin, and A. Joshi. Gorelations: an intuitive query [77] M. Lissandrini, D. Mottin, T. Palpanas, and Y. Velegrakis.
system for dbpedia. In JIST, 2011. Graph-query suggestions for knowledge graph exploration. In
[53] W.-S. Han, J. Lee, M.-D. Pham, and J. X. Yu. iGraph: a WWW, 2020.
framework for comparisons of disk-based graph indexing [78] L. Liu, B. Du, H. Ji, C. Zhai, and H. Tong. Neural-answering
techniques. PVLDB, 3(1–2):449–459, 2010. logical queries on knowledge graphs. In KDD, 2021.
[54] Z. Hu, Y. Xu, W. Yu, S. Wang, Z. Yang, C. Zhu, K. Chang, and [79] L. Liu, B. Du, J. Xu, Y. Xia, and H. Tong. Joint knowledge
Y. Sun. Empowering language models with knowledge graph graph completion and question answering. In KDD, 2022.
reasoning for open-domain question answering. In EMNLP, [80] W. Liu, P. Zhou, Z. Zhao, Z. Wang, Q. Ju, H. Deng, and
2022. P. Wang. K-BERT: enabling language representation with
[55] X. Huang, J. Zhang, D. Li, and P. Li. Knowledge graph knowledge graph. In AAAI, 2020.
embedding based question answering. In WSDM, 2019. [81] X. Liu, S. Zhao, K. Su, Y. Cen, J. Qiu, M. Zhang, W. Wu,
[56] Z. Huang, M. Chiang, and W. Lee. Line: logical query Y. Dong, and J. Tang. Mask and reason: pre-training
reasoning over hierarchical knowledge graphs. In KDD, 2022. knowledge graph transformers for complex logical queries. In
[57] D. Ibragimov, K. Hose, T. B. Pedersen, and E. Zimányi. KDD, 2022.
Optimizing aggregate SPARQL queries using materialized [82] X. Long, L. Zhuang, L. Aodi, S. Wang, and H. Li.
RDF views. In ISWC, 2016. Neural-based mixture probabilistic query embedding for
[58] F. Ilievski, P. A. Szekely, and B. Zhang. CSKG: the answering FOL queries on knowledge graphs. In EMNLP,
commonsense knowledge graph. In ESWC, 2021. 2022.
[59] M. S. Islam, C. Liu, and J. Li. Efficient answering of why-not [83] H. Ma, M. A. Langouri, Y. Wu, F. Chiang, and J. Pi.
questions in similar graph matching. IEEE Trans. Knowl. Data Ontology-based entity matching in attributed graphs. PVLDB,
Eng., 27(10):2672–2686, 2015. 12(10):1195–1207, 2019.
[60] N. Jayaram, A. Khan, C. Li, X. Yan, and R. Elmasri. Querying [84] S. Ma, Y. Cao, W. Fan, J. Huai, and T. Wo. Strong simulation:
knowledge graphs by example entity tuples. IEEE Trans. capturing topology in graph pattern matching. ACM Trans.
Knowl. Data Eng., 27(10):2797–2811, 2015. Database Syst., 39(1):4:1–4:46, 2014.
[61] G. Ji, S. He, L. Xu, K. Liu, and J. Zhao. Knowledge graph [85] A. Mhedhbi and S. Salihoglu. Modern techniques for querying
embedding via dynamic mapping matrix. In ACL, 2015. graph-structured relations: foundations, system
[62] X. Jin, Z. Yang, X. Lin, S. Yang, L. Qin, and Y. Peng. FAST: implementations, and open challenges. PVLDB,
fpga-based subgraph matching on massive graphs. In ICDE, 15(12):3762–3765, 2022.
2021. [86] Microsoft. Sql graph architecture.
[63] J. Kalo, L. Fichtel, P. Ehler, and W. Balke. Knowlybert - https://learn.microsoft.com/en-us/sql/
hybrid query answering over language models and knowledge relational-databases/graphs/
graphs. In ISWC, 2020. sql-graph-architecture?view=sql-server-ver16,
[64] Z. Kaoudi and I. Manolescu. RDF in the clouds: a survey. 2022.
VLDB J., 24(1):67–91, 2015. [87] T. M. Mitchell, W. W. Cohen, E. R. H. Jr., P. P. Talukdar,
[65] F. Katsarou, N. Ntarmos, and P. Triantafillou. Performance and B. Yang, J. Betteridge, A. Carlson, B. D. Mishra, M. Gardner,
scalability of indexed subgraph query processing methods. B. Kisiel, J. Krishnamurthy, N. Lao, K. Mazaitis, T. Mohamed,
Proc. VLDB Endow., 8(12):1566–1577, 2015. N. Nakashole, E. A. Platanios, A. Ritter, M. Samadi,
[66] V. Kepuska and G. Bohouta. Next-generation of virtual B. Settles, R. C. Wang, D. Wijaya, A. Gupta, X. Chen,
personal assistants (microsoft cortana, apple siri, amazon alexa A. Saparov, M. Greaves, and J. Welling. Never-ending
and google home). In CCWC, 2018. learning. Commun. ACM, 61(5):103–115, 2018.
[67] A. Khan and S. Elnikety. Systems for big-graphs. Proc. VLDB [88] J. Mohoney, R. Waleffe, H. Xu, T. Rekatsinas, and
Endow., 7(13):1709–1710, 2014.
S. Venkataraman. Marius: learning massive graph embeddings 2022.
on a single machine. In OSDI, 2021. [115] H. Ren, W. Hu, and J. Leskovec. Query2box: reasoning over
[89] D. Mottin, M. Lissandrini, Y. Velegrakis, and T. Palpanas. knowledge graphs in vector space using box embeddings. In
Exemplar queries: give me an example of what you need. ICLR, 2020.
PVLDB, 7(5):365–376, 2014. [116] H. Ren and J. Leskovec. Beta embeddings for multi-hop
[90] D. Nathani, J. Chauhan, C. Sharma, and M. Kaul. Learning logical reasoning in knowledge graphs. In NeurIPS, 2020.
attention-based embeddings for relation prediction in [117] X. Ren and J. Wang. Multi-query optimization for subgraph
knowledge graphs. In ACL, 2019. isomorphism search. PVLDB, 10(3):121–132, 2016.
[91] R. Navigli and S. P. Ponzetto. Babelnet: building a very large [118] M. A. Rodriguez. The gremlin graph traversal machine and
multilingual semantic network. In ACL, 2010. language (invited talk). In DBPL, 2015.
[92] Neo4J. Why graph databases? [119] R. S. Roy and A. Anand. Question answering over curated and
https://neo4j.com/why-graph-databases/, 2016. open web sources. In SIGIR, 2020.
[93] T. Neumann and G. Weikum. Scalable join processing on very [120] T. Sagi, M. Lissandrini, T. B. Pedersen, and K. Hose. A design
large RDF graphs. In SIGMOD, 2009. space for RDF data representations. VLDB J., 31(2):347–373,
[94] T. Neumann and G. Weikum. The RDF-3X engine for scalable 2022.
management of RDF data. VLDB J., 19(1):91–113, 2010. [121] S. Sahu, A. Mhedhbi, S. Salihoglu, J. Lin, and M. T. Özsu. The
[95] H. Q. Ngo. Worst-case optimal join algorithms: techniques, ubiquity of large graphs and surprising challenges of graph
results, and open problems. In PODS, 2018. processing: extended survey. VLDB J., 29(2-3):595–618, 2020.
[96] D. Q. Nguyen, T. D. Nguyen, D. Q. Nguyen, and D. Q. Phung. [122] S. Sakr, S. Elnikety, and Y. He. G-SPARQL: a hybrid engine
A novel embedding model for knowledge base completion for querying large attributed graphs. In CIKM, 2012.
based on convolutional neural network. In NAACL-HLT, 2018. [123] M. Sarwat, S. Elnikety, Y. He, and M. F. Mokbel. Horton+: a
[97] J. Ni, V. Pandelea, T. Young, H. Zhou, and E. Cambria. Hitkg: distributed system for processing declarative reachability
towards goal-oriented conversations via multi-hierarchy queries over partitioned graphs. PVLDB, 6(14):1918–1929,
learning. In AAAI, 2022. 2013.
[98] M. Nickel, V. Tresp, and H. Kriegel. A three-way model for [124] A. Saxena, A. Kochsiek, and R. Gemulla.
collective learning on multi-relational data. In ICML, 2011. Sequence-to-sequence knowledge graph completion and
[99] M. Nickel, V. Tresp, and H. Kriegel. Factorizing YAGO: question answering. In ACL, 2022.
scalable machine learning for linked data. In WWW, 2012. [125] A. Saxena, A. Tripathi, and P. P. Talukdar. Improving
[100] C. Nikolaou and M. Koubarakis. Querying incomplete multi-hop question answering over knowledge graphs using
information in RDF with SPARQL. Artif. Intell., 237:138–171, knowledge base embeddings. In ACL, 2020.
2016. [126] M. S. Schlichtkrull, T. N. Kipf, P. Bloem, R. v. d. Berg,
[101] N. F. Noy, Y. Gao, A. Jain, A. Narayanan, A. Patterson, and I. Titov, and M. Welling. Modeling relational data with graph
J. Taylor. Industry-scale knowledge graphs: lessons and convolutional networks. In ESWC, 2018.
challenges. Commun. ACM, 62(8):36–43, 2019. [127] M. Schmidt, T. Hornung, G. Lausen, and C. Pinkel. Sp2bench:
[102] Oracle. Pgql 1.5 specification. a sparql performance benchmark. In ICDE, 2009.
https://pgql-lang.org/spec/1.5/, 2022. [128] C. Shi, Y. Li, J. Zhang, Y. Sun, and P. S. Yu. A survey of
[103] A. Pacaci and M. T. Özsu. Experimental analysis of streaming heterogeneous information network analysis. IEEE Trans.
algorithms for graph partitioning. In SIGMOD, 2019. Knowl. Data Eng., 29(1):17–37, 2017.
[104] X. Pan, T. Ye, D. Han, S. Song, and G. Huang. Contrastive [129] A. Singhal. Introducing the knowledge graph: things, not
language-image pre-training with knowledge graphs. In strings. https://blog.google/products/search/
NeurIPS, 2022. introducing-knowledge-graph-things-not/, 2012.
[105] N. Papailiou, D. Tsoumakos, P. Karras, and N. Koziris. [130] C. Sommer. Shortest-path queries in static networks. ACM
Graph-aware, workload-adaptive SPARQL query caching. In Comput. Surv., 46(4):45:1–45:31, 2014.
SIGMOD, 2015. [131] Q. Song, M. H. Namaki, P. Lin, and Y. Wu. Answering
[106] S. Pei, L. Yu, G. Yu, and X. Zhang. Rea: robust cross-lingual why-questions for subgraph queries. IEEE Trans. Knowl. Data
entity alignment between knowledge graphs. In KDD, 2020. Eng., 34(10):4636–4649, 2022.
[107] P. Peng, Q. Ge, L. Zou, M. T. Özsu, Z. Xu, and D. Zhao. [132] R. Speer and C. Havasi. Representing general relational
Optimizing multi-query evaluation in federated RDF systems. knowledge in ConceptNet 5. In LREC, 2012.
IEEE Trans. Knowl. Data Eng., 33(4):1692–1707, 2021. [133] A. Styperek, M. Ciesielczyk, A. Szwabe, and P. Misiorek.
[108] P. Pezeshkpour, L. Chen, and S. Singh. Embedding Evaluation of sparql-compliant semantic search user
multimodal relational data for knowledge base completion. In interfaces. Vietnam. J. Comput. Sci., 2(3):191–199, 2015.
EMNLP, 2018. [134] Y. Su, S. Yang, H. Sun, M. Srivatsa, S. Kase, M. Vanni, and
[109] D. L. Phuoc, H. N. M. Quoc, Q. H. Ngo, T. T. Nhat, and X. Yan. Exploiting relevance feedback in knowledge graph
M. Hauswirth. The graph of things: a step towards the live search. In KDD, 2015.
knowledge graph of connected things. J. Web Semant., [135] F. M. Suchanek, G. Kasneci, and G. Weikum. Yago: a core of
37-38:25–35, 2016. semantic knowledge. In WWW, 2007.
[110] A. Poggi, D. Lembo, D. Calvanese, G. D. Giacomo, [136] H. Sun, A. O. Arnold, T. Bedrax-Weiss, F. Pereira, and W. W.
M. Lenzerini, and R. Rosati. Linking data to ontologies. J. Cohen. Faithful embeddings for knowledge base queries. In
Data Semant., 10:133–173, 2008. NeurIPS, 2020.
[111] A. Quamar, V. Efthymiou, C. Lei, and F. Özcan. Natural [137] R. Sun, X. Cao, Y. Zhao, J. Wan, K. Zhou, F. Zhang, Z. Wang,
language interfaces to data. Found. Trends Databases, and K. Zheng. Multi-modal knowledge graphs for
11(4):319–414, 2022. recommender systems. In CIKM, 2020.
[112] E. Rahm and P. A. Bernstein. A survey of approaches to [138] S. Sun and Q. Luo. In-memory subgraph matching: an
automatic schema matching. VLDB J., 10(4):334–350, 2001. in-depth study. In SIGMOD, page 1083–1098, 2020.
[113] H. Ren, H. Dai, B. Dai, X. Chen, M. Yasunaga, H. Sun, [139] W. Sun, A. Fokoue, K. Srinivas, A. Kementsietsidis, G. Hu,
D. Schuurmans, J. Leskovec, and D. Zhou. LEGO: latent and G. T. Xie. Sqlgraph: an efficient relational-based property
execution-guided reasoning for multi-hop question answering graph store. In SIGMOD, 2015.
on knowledge graphs. In ICML, 2021. [140] Z. Sun, Z. Deng, J. Nie, and J. Tang. Rotate: knowledge graph
[114] H. Ren, H. Dai, B. Dai, X. Chen, D. Zhou, J. Leskovec, and embedding by relational rotation in complex space. In ICLR,
D. Schuurmans. SMORE: knowledge graph completion and 2019.
multi-hop reasoning in massive knowledge graphs. In KDD,
[141] G. Szárnyas, A. Prat-Pérez, A. Averbuch, J. Marton, AAAI, 2016.
M. Paradies, M. Kaufmann, O. Erling, P. A. Boncz, [166] R. Xie, Z. Liu, H. Luan, and M. Sun. Image-embodied
V. Haprian, and J. B. Antal. An early look at the LDBC social knowledge representation learning. In IJCAI, 2017.
network benchmark’s business intelligence workload. In [167] D. Xu, C. Ruan, E. Körpeoglu, S. Kumar, and K. Achan.
GRADES and NDA, 2018. Product knowledge graph embedding for e-commerce. In
[142] A. Talmor and J. Berant. The web as a knowledge-base for WSDM, 2020.
answering complex questions. In NAACL-HLT, 2018. [168] B. Yang, W. Yih, X. He, J. Gao, and L. Deng. Embedding
[143] A. Tchechmedjiev, P. Fafalios, K. Boland, M. Gasquet, entities and relations for learning and inference in knowledge
M. Zloch, B. Zapilko, S. Dietze, and K. Todorov. Claimskg: a bases. In ICLR, 2015.
knowledge graph of fact-checked claims. In ISWC, 2019. [169] D. Yang, P. Qing, Y. Li, H. Lu, and X. Lin. Gammae: gamma
[144] A. Termehchy, M. Winslett, Y. Chodpathumwan, and embeddings for logical queries on knowledge graphs. In
A. Gibbons. Design independent query interfaces. IEEE Trans. EMNLP, 2022.
Knowl. Data Eng., 24(10):1819–1832, 2012. [170] J. Yang, W. Yao, and W. Zhang. Keyword search on large
[145] Y. Tian. The world of graph databases from an industry graphs: a survey. Data Sci. Eng., 6(2):142–162, 2021.
perspective. SIGMOD Rec., 51(4):60–67, 2022. [171] S. Yang, Y. Wu, H. Sun, and X. Yan. Schemaless and
[146] P. Trivedi, G. Maheshwari, M. Dubey, and J. Lehmann. structureless graph querying. PVLDB, 7(7):565–576, 2014.
Lc-quad: a corpus for complex question answering over [172] Z. Yang, P. Qi, S. Zhang, Y. Bengio, W. W. Cohen,
knowledge graphs. In ISWC, 2017. R. Salakhutdinov, and C. D. Manning. Hotpotqa: a dataset for
[147] Y. Tuan, Y. Chen, and H. Lee. Dykgchat: benchmarking diverse, explainable multi-hop question answering. In
dialogue generation grounding on dynamic knowledge graphs. EMNLP, 2018.
In EMNLP-IJCNLP, 2019. [173] J. Yao, B. Cui, L. Hua, and Y. Huang. Keyword query
[148] Y. Tzitzikas, N. Manolis, and P. Papadakos. Faceted reformulation on structured data. In ICDE, 2012.
exploration of RDF/S datasets: a survey. J. Intell. Inf. Syst., [174] M. Yasunaga, A. Bosselut, H. Ren, X. Zhang, C. D. Manning,
48(2):329–364, 2017. P. Liang, and J. Leskovec. Deep bidirectional
[149] S. Vashishth, S. Sanyal, V. Nitin, and P. P. Talukdar. language-knowledge graph pretraining. In NeurIPS, 2022.
Composition-based multi-relational graph convolutional [175] D. Yu, C. Zhu, Y. Yang, and M. Zeng. Jaket: joint pre-training
networks. In ICLR, 2020. of knowledge graph and language understanding. In AAAI,
[150] E. Vasilyeva, M. Thiele, C. Bornhövd, and W. Lehner. 2022.
Answering "why empty?" and "why so many?" queries in [176] M. Zaib, W. E. Zhang, Q. Z. Sheng, A. Mahmood, and
graph databases. J. Comput. Syst. Sci., 82(1):3–22, 2016. Y. Zhang. Conversational question answering: a survey.
[151] D. Vrandečić and M. Krötzsch. Wikidata: a free collaborative Knowl. Inf. Syst., 64(12):3151–3195, 2022.
knowledgebase. Commun. ACM, 57(10):78–85, 2014. [177] L. Zeng, L. Zou, M. T. Özsu, L. Hu, and F. Zhang. GSI:
[152] L. Wang, W. Zhao, Z. Wei, and J. Liu. Simkgc: simple gpu-friendly subgraph isomorphism. In ICDE, 2020.
contrastive knowledge graph completion with pre-trained [178] M. Zhang, R. Dai, M. Dong, and T. He. Drlk: dynamic
language models. In ACL, 2022. hierarchical reasoning with language model and knowledge
[153] M. Wang, S. Wang, H. Yang, Z. Zhang, X. Chen, and G. Qi. Is graph for question answering. In EMNLP, 2022.
visual context really helpful for knowledge graph? a [179] W. Zhang, J. Chen, J. Li, Z. Xu, J. Z. Pan, and H. Chen.
representation learning perspective. In MM, 2021. Knowledge graph reasoning with logics and embeddings:
[154] X. Wang, T. Gao, Z. Zhu, Z. Zhang, Z. Liu, J. Li, and J. Tang. survey and perspective. CoRR, abs/2202.07412, 2022.
Kepler: a unified model for knowledge embedding and [180] W. Zhang, B. Paudel, L. Wang, J. Chen, H. Zhu, W. Zhang,
pre-trained language representation. Trans. Assoc. Comput. A. Bernstein, and H. Chen. Iteratively learning embeddings
Linguistics, 9:176–194, 2021. and rules for knowledge graph reasoning. In WWW, 2019.
[155] Y. Wang, A. Khan, T. Wu, J. Jin, and H. Yan. Semantic guided [181] X. Zhang, L. Chen, Y. Tong, and M. Wang. EAGRE: towards
and response times bounded top-k similarity search over scalable I/O efficient SPARQL query evaluation on the cloud.
knowledge graphs. In ICDE, 2020. In ICDE, 2013.
[156] Y. Wang, A. Khan, X. Xu, J. Jin, Q. Hong, and T. Fu. [182] Z. Zhang, J. Wang, J. Chen, S. Ji, and F. Wu. Cone: cone
Aggregate queries on knowledge graphs: fast approximation embeddings for multi-hop reasoning over knowledge graphs.
with semantic-aware sampling. In ICDE, 2022. In NeurIPS, 2021.
[157] Y. Wang, A. Khan, X. Xu, S. Ye, S. Pan, and Y. Zhou. [183] D. Zheng, X. Song, C. Ma, Z. Tan, Z. Ye, J. Dong, H. Xiong,
Approximate and interactive processing of aggregate queries Z. Zhang, and G. Karypis. Dgl-ke: training knowledge graph
on knowledge graphs: a demonstration. In CIKM, 2022. embeddings at scale. In SIGIR, 2020.
[158] Y. Wang, Y. Li, J. Fan, C. Ye, and M. Chai. A survey of typical [184] W. Zheng, J. X. Yu, L. Zou, and H. Cheng. Question
attributed graph queries. World Wide Web, 24(1):297–346, answering over knowledge graphs: question understanding via
2021. template decomposition. Proc. VLDB Endow.,
[159] Z. Wang, L. Li, Q. Li, and D. Zeng. Multimodal data enhanced 11(11):1373–1386, 2018.
representation learning for knowledge graphs. In IJCNN, 2019. [185] W. Zheng, L. Zou, W. Peng, X. Yan, S. Song, and D. Zhao.
[160] Z. Wang, J. Zhang, J. Feng, and Z. Chen. Knowledge graph Semantic SPARQL similarity search over RDF knowledge
embedding by translating on hyperplanes. In AAAI, 2014. graphs. PVLDB, 9(11):840–851, 2016.
[161] G. Weikum, X. L. Dong, S. Razniewski, and F. M. Suchanek. [186] H. Zhou, T. Young, M. Huang, H. Zhao, J. Xu, and X. Zhu.
Machine knowledge: creation and curation of comprehensive Commonsense knowledge aware conversation generation with
knowledge bases. Found. Trends Databases, 10(2-4):108–490, graph attention. In IJCAI, 2018.
2021. [187] Q. Zhou, C. Wang, M. Xiong, H. Wang, and Y. Yu. SPARK:
[162] K. Wilkinson, C. Sayers, H. A. Kuno, and D. Reynolds. adapting keyword query to semantic search. In ISWC.
Efficient RDF storage and retrieval in jena2. In SWDB, 2003. Springer, 2007.
[163] P. T. Wood. Query languages for graph databases. SIGMOD [188] Z. Zhu, M. Galkin, Z. Zhang, and J. Tang. Neural-symbolic
Rec., 41(1):50–60, 2012. models for logical queries on knowledge graphs. In ICML,
[164] M. Wylot, M. Hauswirth, P. Cudré-Mauroux, and S. Sakr. Rdf 2022.
data storage and query processing schemes: a survey. ACM [189] L. Zou, R. Huang, H. Wang, J. X. Yu, W. He, and D. Zhao.
Comput. Surv., 51(4), 2018. Natural language question answering over RDF: a graph data
[165] R. Xie, Z. Liu, J. Jia, H. Luan, and M. Sun. Representation driven approach. In SIGMOD, 2014.
learning of knowledge graphs with entity descriptions. In