DAKKAR

DAta BenchmarK for Keyword-based Access and Retrieval

Find Out More

DAKKAR:
A deeper look into the project


It's a Starting Grants project sponsored by University of Padua and Fondazione Cassa di Risparmio di Padova e di Rovigo.

The project aims to promote the development and evolution of user-oriented keyword based search systems for structured data by defining and implementing large, open, public and sustainable evaluation activities.

Keyword Search (KWS) Systems Code

We propose the design of a novel unified framework that supports the reproducibility and new implementation of graph-based Keyword Search Systems over relational data. The design of the framework is based on the pioneering systems BANKS, BANKS-II and DPBF. Along with the framework, we provide an open source JAVA implementation of a core module.

We have choosen JGraphT library for implementing the graph data structure. This is designed to scale to millions of vertices and edges.

In the current implementation we deal with PostgreSQL databases, but the code can be easily extended to handle other relational DBMS.

Go to the GIT repository


Repository


Project reports

  • V. Cozza Graph-based Keyword Search Systems: the design of a unified framework. Submitted for publication to Sigmod Record and currently under review
  • V. Cozza Graph-based Keyword Search Systems: open-source implementation of a unified framework. Technical Report

  • DAKKAR Publications


    1. A. Badan, L. Benvegnù, M. Biasetton, G. Bonato, A. Brighente, S. Marchesin, A. Minetto, L. Pellegrina, A. Purpura, R. Simionato, M. Tessarotto, A. Tonon, and N. Ferro. Keyword-based access to relational data: To reproduce, or to not reproduce? Proc. 25th Italian Symposium on Advanced Database Systems (SEBD 2017), pages 166-177.
    2. A. Badan, L. Benvegnù, M. Biasetton, G. Bonato, A. Brighente, A. Cenzato, P. Ceron, G. Cogato, S. Marchesin, A. Minetto, L. Pellegrina, A. Purpura, R. Simionato, N. Soleti, M. Tessarotto, A. Tonon, F. Vendramin and N. Ferro Towards open-source shared implementations of keyword-based access systems to relational data Proc. 1st International Workshop on Keyword-Based Access and Ranking at Scale (KARS 2017) - Proc. of the Workshops of the EDBT/ICDT 2017 Joint Conference (EDBT/ICDT 2017). CEUR Workshop Proceedings (CEUR-WS.org), Vol. 1810, ISSN 1613-0073
    3. Proc. 1st International Workshop on Keyword-Based Access and Ranking at Scale (KARS 2017) - Proc. of the Workshops of the EDBT/ICDT 2017 Joint Conference (EDBT/ICDT 2017) Y. E. Ioannidis, J. Stoyanovich, G. Orsi, I. Y. Song, P. Marcel, R. Martoglia, W. Penzo, F. Mandreoli, R. De Virgilio, V. De Antonellis, D. Bianchini, D. Kotzinos, V. Christophides, C. Nikolaou, Y. Theodoridis, N. Ferro, F. Guerra, Z. Ives, G. Silvello and M. Theobald, editors (2017) Editorship CEUR Workshop Proceedings (CEUR-WS.org), Vol. 1810, ISSN 1613-0073;
    4. Proc. Digital Libraries and Archives: 13th Italian Research Conference on Digital Libraries, IRCDL 2017, Modena, Italy, January 26-27, 2017, Revised Selected Papers Editors: C. Grana, L. Baraldi (Eds.)
    5. V. Cozza, M. Petrocchi, A. Spognardi, (2018) Mining implicit data association from Tripadvisor hotel reviews. 2nd International workshop on Data Analytics solutions for Real-LIfe APplications (DARLI—AP 2018) Proc. of the Workshops of the EDBT/ICDT 2018 Joint Conference (EDBT/ICDT 2019). CEUR-WS.org, Vol. 2083, ISSN 1613-0073.
    6. N. Ferro and L. Sinico (2018). Graph Databases Benchmarking on the Italian Business Register.  In S. Bergamaschi, T. Di Noia, and A. Maurino editors, Proc. 26th Italian Symposium on Advanced Database Systems (SEBD 2018). CEUR Workshop Proceedings (CEUR-WS.org), Vol. 2161, ISSN 1613-0073
    7. O. Alonso and G. Silvello. DESIRES: Design of Experimental Search & Information Retrieval Systems. Proceedings of the First Biennial Conference on Design of Experimental Search & Information Retrieval Systems, CEUR Workshop Proceedings 2167. Bertinoro, Italy, August 28-31, 2018.
    8. D. Dosso. Keyword Search on RDF Graphs. Proceedings of the First Biennial Conference on Design of Experimental Search & Information Retrieval Systems, CEUR Workshop Proceedings 2167. Bertinoro, Italy, August 28-31, 2018.
    9. V. Cozza, V. T. Hoang, M. Petrocchi and R. De Nicola. Transparency in Keyword Faceted Search: an investigation on Google Shopping Proc. 15th Italian Research Conference on Digital Libraries (IRCDL 2019). Communications in Computer and Information Science book series (CCIS, volume 988), Springer, Heidelberg, Germany, 2019.
    10. M. Agosti, E. Fabris and G. Silvello. On Synergies between Information Retrieval and Digital Libraries. In Proc. 15th Italian Research Conference on Digital Libraries (IRCDL 2019). Communications in Computer and Information Science book series (CCIS, volume 988), Springer, Heidelberg, Germany, 2019.


    DAKKAR Events


    DAKKAR team co-organized the Keyword-based Access and Ranking at Scale (KARS) Workshop, co-located with 20th EDBT/ICDT 2017 Joint Conference, March 21 2017, San Servolo, Venice, Italy

    Partecipants


    DAKKAR project is conducted by: Maristella Agosti, Vittoria Cozza, Nicola Ferro and Gianmaria Silvello Information Management Systems Research Group, Department of Information Engineering, University of Padua.

    DAKKAR project is mainly conducted at University of Padua, and in collaboration with Department of Engineering "Enzo Ferrari", University of Modena and Reggio Emilia; Department of Computer, Control, and Management Engineering "Antonio Ruberti", Sapienza University of Rome.