In this paper we introduce GraphDBLP, a tool that models the DBLP bibliography as a graph, and enriches the DBLP data through semantic keyword similarities computed via word-embedding. GraphDBLP has been implemented on top of the Neo4j graph-database, and it can be queried through the Cypher query language. We also provide three meaningful queries for exploring the DBLP community to (i) investigate author profiles by analysing their publication records; (ii) identify the most prolific authors on a given topic,and (iii) perform social network analyses over the whole community. GraphDBLP is available on Github. To date, it contains 5+ million nodes and 24+ million relationships, enabling users to explore the DBLP data by referencing more than 3.3 million publications, 1.7 million authors and more than 5 thousand publication venues. Thanks to the use of word-embedding, more than 7.5 thousand keywords and related similarity values were collected.

GraphDBLP Released: Querying the Computer Scientists Network as a Graph / Cesarini, Mirko; Mercorio, Fabio; Mezzanzanica, Mario; Moscato, Vincenzo; Picariello, Antonio. - 2161:(2018). (Intervento presentato al convegno 26th Italian Symposium on Advanced Database Systems, SEBD 2018 tenutosi a Castellaneta Marina (Taranto, Italia) nel 24-27, June 2018).

GraphDBLP Released: Querying the Computer Scientists Network as a Graph

Mezzanzanica, Mario;Moscato, Vincenzo;Picariello, Antonio
2018

Abstract

In this paper we introduce GraphDBLP, a tool that models the DBLP bibliography as a graph, and enriches the DBLP data through semantic keyword similarities computed via word-embedding. GraphDBLP has been implemented on top of the Neo4j graph-database, and it can be queried through the Cypher query language. We also provide three meaningful queries for exploring the DBLP community to (i) investigate author profiles by analysing their publication records; (ii) identify the most prolific authors on a given topic,and (iii) perform social network analyses over the whole community. GraphDBLP is available on Github. To date, it contains 5+ million nodes and 24+ million relationships, enabling users to explore the DBLP data by referencing more than 3.3 million publications, 1.7 million authors and more than 5 thousand publication venues. Thanks to the use of word-embedding, more than 7.5 thousand keywords and related similarity values were collected.
2018
GraphDBLP Released: Querying the Computer Scientists Network as a Graph / Cesarini, Mirko; Mercorio, Fabio; Mezzanzanica, Mario; Moscato, Vincenzo; Picariello, Antonio. - 2161:(2018). (Intervento presentato al convegno 26th Italian Symposium on Advanced Database Systems, SEBD 2018 tenutosi a Castellaneta Marina (Taranto, Italia) nel 24-27, June 2018).
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/748572
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact