Home

Our Lab

The Knowledge Discovery and Data Mining Laboratory (KDD Lab) is a joint research initiative of ISTI Institute of CNR and the Department of Computer Science of the University of Pisa.

The objective of the research unit is the development of theory, techniques and systems for extracting and delivering useful knowledge out of large masses of data.

Today, knowledge discovery and data mining is both a technology that blends data analysis methods with sophisticated algorithms for processing large data sets, and an active research field that aims at developing new data analysis methods for novel forms of data. On one side, classification, clustering and pattern discovery tools are now part of mature data analysis and Business Intelligence systems and have been successfully applied to problems in various commercial and scientific domains. On the other side, the increasing heterogeneity and complexity of the new forms of data – such as those arriving from medicine, biology, the Web, the Earth observation systems, the mobility data arriving from wireless networks – call for new forms of patterns and models, together with new algorithms to discover such patterns and models efficiently.

In this context, the mission of the KDD laboratory is to pursue fundamental research, strategic applications and higher education.

Research

It was 1999 when we approached data mining research field. Our exploration of the world of Data is still continuing...

Excellent expertise has been gained thanks to the involvement in several EU projects, as GeoPKDD(www.geopkdd.eu) and MODAP (www.modap.eu). A concrete recent achievement is the realization of the system M-ATLAS as an platform to support the mobility knowledge discovery process, from data preprocessing, to data mining to semantic enrichment and patterns interpretation.

The main research track of KDDLab in the field of Complex Networks is Multidimensional Network Analysis. Traditionally, Complex Network Analysis has been monodimensional: researchers focused their attention to network with a single kind of relation represented. KDDLab is pushing the research over multidimensional networks, i.e. network with multiple kind of relations, since they are a better model to represent the complexity in reality (transportation, infrastructure and social networks are often multidimensional).

In the era of Big Data the opportunities of discovering knowledge from social big data increase with the risk of privacy and discrimination violation. However, big data analytics and fairness are not necessarily enemies. Sometimes many practical and impactful services based on big data analytics can be designed in such a way that the quality of results can coexist with discrimination and privacy protection. The solution is the application of the privacy-by-design and discrimination-by-design principles.

One of the most pressing and fascinating challenges scientists face today, is understanding the complexity of our globally interconnected society. The big data arising from the digital breadcrumbs of human activities promise to let us scrutinize the ground truth of individual and collective behaviour at an unprecedented detail and scale. There is an urgent need to harness these opportunities for scientific advancement and for the social good. The main obstacle to this accomplishment, besides the scarcity of data scientists, is the lack of a large-scale open infrastructure, where big data and social mining research can be carried out.

DyNo, 3rd International Workshop on Dynamics in and of Networks @ ECML PKDD

Start:
2017/09/18 09:00 Europe/Rome
End:
2017/09/22 18:00 Europe/Rome
Location:
Skopje, Macedonia
Link:
International Workshop on Dynamics in and of Networks
Description:

The purpose of this workshop is to encourage principled research that will lead to the advancement of the social science in time-evolving networks. The workshop will seek top- quality submissions addressing important topics such as: dynamic network modeling, time- aware network mining approaches, social influence spreading, diffusion processes in dynamic networks and forecast of network topology perturbation.

KNOWMe: 1st International Workshop on Knowledge Discovery from Mobility and Transportation Systems

Start:
2017/09/22 09:00 Europe/Rome
End:
2017/09/22 09:00 Europe/Rome
Location:
Alexander Palace, Skopje, Macedonia
Link:
KNOWledge Discovery from Mobility and Transportation Systems
Description:

The recent technological advances on telecommunications create a new reality on mobility sensing. Nowadays, we live in an era where ubiquitous digital devices are able to broadcast rich information about human mobility in real-time and at a high rate. Such fact exponentially increased the availability of large-scale mobility data which has been popularized in the media as the new currency, fueling the future vision of our smart cities that will transform our lives. The reality is that we just began to recognize significant research challenges across a spectrum of topics.

DAPS2017: Data mining for the Analysis of Performance and Success

Start:
2017/11/18 09:00 Europe/Rome
End:
2017/11/18 09:00 Europe/Rome
Location:
New Orleans, USA
Link:
DAPS 2017
Description:

The increasing availability of Big Data, able to capture diverse collective phenomena, provides an unprecedented opportunity to explore the patterns underlying success. From the strategies followed by successful sportsmen to the emergence of runaway videos on YouTube, from popularity in social media to rising starts in the scientific enterprise, from widespread technologies to groundbreaking innovations, there is wealth of data that can be explored to answer common questions: How can we measure performance? What are the common patterns of success?

Data analysis & Social Mining for the Interconnected Society @ 3rd EAI International Conference on Smart Objects and Technologies for Social Good

Start:
2017/11/29 09:00 Europe/Rome
End:
2017/11/30 18:00 Europe/Rome
Location:
Department of Computer Science, University of Pisa, Largo B. Pontecorvo 3, Pisa, Italy
Link:
Data analysis & Social Mining for the Interconnected Society
Description:

The rapid growth of the Internet and the Web, the speed with which global communication and trade now takes place, and the fast spreading around the world of news and information as well as epidemics, trends, financial crises and social: these are all signals that mankind has entered a new era, a new techno-social ecosystem whose inner mechanisms are different from before, and largely unveiled.

Conference on Fairness, Accountability, and Transparency FAT*

Start:
2018/02/23 09:00 Europe/Rome
End:
2018/02/24 18:00 Europe/Rome
Location:
New York City, USA
Link:
FAT* Conference
Description:

Algorithmic systems are being adopted in a growing number of contexts. Fueled by big data, these systems filter, sort, score, recommend, personalize, and otherwise shape human experiences of socio-technical systems. Although these systems bring myriad benefits, they also contain inherent risks, such as codifying and entrenching biases; reducing accountability and hindering due process; and increasing the information assymmetry between data producers and data holders.

Publications

Fun Facts


3
Gigabytes of data produced by a single person each year
3100
Millions of Internet users
500
Millions of Tweets sent per day
2300
Gigabytes of Internet traffic per day

Contacts

Need info? Want ideas? Write us!

Address @ ISTI

KDD Lab
Istituto di Scienza e Tecnologie dell’Informazione
Area della Ricerca CNR
via G. Moruzzi 1
56124 Pisa, Italy

Address @ UniPi

KDD Lab
Dipartimento di Informatica
Università di Pisa
Largo B. Pontecorvo 3
56127 Pisa, Italy

Phone Number

Phone: +39 050 621 3013
Fax: +39 050 315 2040

Email

kddlab-info@isti.cnr.it

Largo B. Pontecorvo 3
56127 Pisa
Italy
via Moruzzi 1
56124 Pisa
Italy