Home

Our Lab

The Knowledge Discovery and Data Mining Laboratory (KDD Lab) is a joint research initiative of ISTI Institute of CNR and the Department of Computer Science of the University of Pisa.

The objective of the research unit is the development of theory, techniques and systems for extracting and delivering useful knowledge out of large masses of data.

Today, knowledge discovery and data mining is both a technology that blends data analysis methods with sophisticated algorithms for processing large data sets, and an active research field that aims at developing new data analysis methods for novel forms of data. On one side, classification, clustering and pattern discovery tools are now part of mature data analysis and Business Intelligence systems and have been successfully applied to problems in various commercial and scientific domains. On the other side, the increasing heterogeneity and complexity of the new forms of data – such as those arriving from medicine, biology, the Web, the Earth observation systems, the mobility data arriving from wireless networks – call for new forms of patterns and models, together with new algorithms to discover such patterns and models efficiently.

In this context, the mission of the KDD laboratory is to pursue fundamental research, strategic applications and higher education.

Research

It was 1999 when we approached data mining research field. Our exploration of the world of Data is still continuing...

Excellent expertise has been gained thanks to the involvement in several EU projects, as GeoPKDD(www.geopkdd.eu) and MODAP (www.modap.eu). A concrete recent achievement is the realization of the system M-ATLAS as an platform to support the mobility knowledge discovery process, from data preprocessing, to data mining to semantic enrichment and patterns interpretation.

The main research track of KDDLab in the field of Complex Networks is Multidimensional Network Analysis. Traditionally, Complex Network Analysis has been monodimensional: researchers focused their attention to network with a single kind of relation represented. KDDLab is pushing the research over multidimensional networks, i.e. network with multiple kind of relations, since they are a better model to represent the complexity in reality (transportation, infrastructure and social networks are often multidimensional).

In the era of Big Data the opportunities of discovering knowledge from social big data increase with the risk of privacy and discrimination violation. However, big data analytics and fairness are not necessarily enemies. Sometimes many practical and impactful services based on big data analytics can be designed in such a way that the quality of results can coexist with discrimination and privacy protection. The solution is the application of the privacy-by-design and discrimination-by-design principles.

One of the most pressing and fascinating challenges scientists face today, is understanding the complexity of our globally interconnected society. The big data arising from the digital breadcrumbs of human activities promise to let us scrutinize the ground truth of individual and collective behaviour at an unprecedented detail and scale. There is an urgent need to harness these opportunities for scientific advancement and for the social good. The main obstacle to this accomplishment, besides the scarcity of data scientists, is the lack of a large-scale open infrastructure, where big data and social mining research can be carried out.

Satellite on Quantifying Success at NetSci 2019

Start:
2019/05/27 02:00 Europe/Rome
End:
2019/05/31 02:00 Europe/Rome
Location:
Burlington, Vermont, USA
Link:
Quantifying Success - Satellite at NetSci 2019
Description:

During the past few years, the increasing availability of large-scale datasets that capture activities in scientific publications, patents, grant proposals, sports, enterprises, as well as social media activities has created an unprecedented opportunity to explore patterns underlying success.

BIGSSS Summer Schools in Computational Social Science on Migration

Start:
2019/06/10 02:00 Europe/Rome
End:
2019/06/21 02:00 Europe/Rome
Location:
Cagliari, Italy
Link:
BIGSSS-CSS Summer School on Migration
Description:

The BIGSSS-CSS Summer School on Migration will take place from June 10 to 21, 2019 in Sardinia and will be hosted by the University of Cagliari in Sardinia (Italy).

Summer School on Large Scale Text and Social Media Analytics with GATE

Start:
2019/06/17 02:00 Europe/Rome
End:
2019/06/21 02:00 Europe/Rome
Location:
Sheffield, UK
Link:
12th GATE Training Course: Large Scale Text and Social Media Analytics with GATE
Description:

The GATE training course will be held from 17-21 June 2019 at the University of Sheffield, UK. Early bird registration is available at a reduced rate before 1 May.

Lipari School on Computational Complex and Social Systems - Data Science

Start:
2019/07/19 02:00 Europe/Rome
End:
2019/07/25 02:00 Europe/Rome
Location:
Lipari, Italy
Link:
Lipari School on Computational Complex and Social Systems
Description:

This summer school will provide opportunities to collect experience with modern data analysis, in particular Big Data analytics. This includes subjects to mine data in the Internet of Things. Our main and special guest lectures, which are recognized authorities in such a field, will address such a scope focusing on algorithms, computational models and practical results.

European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases ECML PKDD

Start:
2019/09/16 02:00 Europe/Rome
End:
2019/09/20 02:00 Europe/Rome
Location:
Würzburg, Germany
Link:
ECML PKDD 2019
Description:

The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases will take place in Würzburg, Germany, from the 16th to the 20th of September 2019.

This event is the premier European machine learning and data mining conference and builds upon over 17 years of successful events and conferences held across Europe.

IEEE International Conference on Data Science and Advanced Analytics DSAA 2019

Start:
2019/10/05 02:00 Europe/Rome
End:
2019/10/08 02:00 Europe/Rome
Location:
Washington DC, USA
Link:
The 6th IEEE International Conference on Data Science and Advanced Analytics
Description:

The IEEE International Conference on Data Science and Advanced Analytics (DSAA) features its strong interdisciplinary synergy between statistics (via ASA), computing and information/intelligence sciences (IEEE and ACM), and cross-domain interactions between academia and business for data science. DSAA sets up a high standard for its organizing committee, keynote speeches, submissions to main conference and special sessions, and a competitive rate for paper acceptance.

IEEE International Conference on Data Mining ICDM 2019

Start:
2019/11/08 01:00 Europe/Rome
End:
2019/11/11 01:00 Europe/Rome
Location:
Beijing, China
Link:
IEEE International Conference on Data Mining
Description:

Aims and Scope

Publications

Fun Facts


3
Gigabytes of data produced by a single person each year
3100
Millions of Internet users
500
Millions of Tweets sent per day
2300
Gigabytes of Internet traffic per day

Contacts

Need info? Want ideas? Write us!

Address @ ISTI

KDD Lab
Istituto di Scienza e Tecnologie dell’Informazione
Area della Ricerca CNR
via G. Moruzzi 1
56124 Pisa, Italy

Address @ UniPi

KDD Lab
Dipartimento di Informatica
Università di Pisa
Largo B. Pontecorvo 3
56127 Pisa, Italy

Phone Number

Phone: +39 050 621 3013
Fax: +39 050 315 2040

Email

kddlab-info@isti.cnr.it