%0 Conference Paper %B Complex Networks & Their Applications X %D 2022 %T From Mean-Field to Complex Topologies: Network Effects on the Algorithmic Bias Model %A Valentina Pansanella %A Giulio Rossetti %A Letizia Milli %B Complex Networks & Their Applications X %G eng %0 Journal Article %J IEEE Intelligent SystemsIEEE Intelligent Systems %D 2021 %T Conformity: a Path-Aware Homophily measure for Node-Attributed Networks %A Giulio Rossetti %A Salvatore Citraro %A Letizia Milli %X Unveil the homophilic/heterophilic behaviors that characterize the wiring patterns of complex networks is an important task in social network analysis, often approached studying the assortative mixing of node attributes. Recent works underlined that a global measure to quantify node homophily necessarily provides a partial, often deceiving, picture of the reality. Moving from such literature, in this work, we propose a novel measure, namely Conformity, designed to overcome such limitation by providing a node-centric quantification of assortative mixing patterns. Differently from the measures proposed so far, Conformity is designed to be path-aware, thus allowing for a more detailed evaluation of the impact that nodes at different degrees of separations have on the homophilic embeddedness of a target. Experimental analysis on synthetic and real data allowed us to observe that Conformity can unveil valuable insights from node-attributed graphs. %B IEEE Intelligent SystemsIEEE Intelligent Systems %P 1 - 1 %8 2021 %@ 1941-1294 %G eng %U https://ieeexplore.ieee.org/document/9321348 %! IEEE Intelligent Systems %R https://doi.org/10.1109/MIS.2021.3051291 %0 Journal Article %J arXiv preprint arXiv:2012.05195 %D 2020 %T Conformity: A Path-Aware Homophily Measure for Node-Attributed Networks %A Giulio Rossetti %A Salvatore Citraro %A Letizia Milli %B arXiv preprint arXiv:2012.05195 %G eng %0 Conference Paper %B International Conference on Complex Networks and Their Applications %D 2020 %T Opinion Dynamic Modeling of Fake News Perception %A Toccaceli, Cecilia %A Letizia Milli %A Giulio Rossetti %X Fake news diffusion represents one of the most pressing issues of our online society. In recent years, fake news has been analyzed from several points of view, primarily to improve our ability to separate them from the legit ones as well as identify their sources. Among such vast literature, a rarely discussed theme is likely to play uttermost importance in our understanding of such a controversial phenomenon: the analysis of fake news’ perception. In this work, we approach such a problem by proposing a family of opinion dynamic models tailored to study how specific social interaction patterns concur to the acceptance, or refusal, of fake news by a population of interacting individuals. To discuss the peculiarities of the proposed models, we tested them on several synthetic network topologies, thus underlying when/how they affect the stable states reached by the performed simulations. %B International Conference on Complex Networks and Their Applications %I Springer %G eng %U https://link.springer.com/chapter/10.1007/978-3-030-65347-7_31 %R https://doi.org/10.1007/978-3-030-65347-7_31 %0 Journal Article %J arXiv preprint arXiv:2011.05606 %D 2020 %T UTLDR: an agent-based framework for modeling infectious diseases and public interventions %A Giulio Rossetti %A Letizia Milli %A Salvatore Citraro %A Morini, Virginia %B arXiv preprint arXiv:2011.05606 %G eng %0 Journal Article %J Applied Network Science %D 2019 %T CDLIB: a python library to extract, compare and evaluate communities from complex networks %A Giulio Rossetti %A Letizia Milli %A Cazabet, Rémy %X Community Discovery is among the most studied problems in complex network analysis. During the last decade, many algorithms have been proposed to address such task; however, only a few of them have been integrated into a common framework, making it hard to use and compare different solutions. To support developers, researchers and practitioners, in this paper we introduce a python library - namely CDlib - designed to serve this need. The aim of CDlib is to allow easy and standardized access to a wide variety of network clustering algorithms, to evaluate and compare the results they provide, and to visualize them. It notably provides the largest available collection of community detection implementations, with a total of 39 algorithms. %B Applied Network Science %V 4 %P 52 %8 2019/07/29 %@ 2364-8228 %G eng %U https://link.springer.com/article/10.1007/s41109-019-0165-9 %! Applied Network Science %R https://doi.org/10.1007/s41109-019-0165-9 %0 Conference Paper %B International Conference on Complex Networks and Their Applications %D 2019 %T Community-Aware Content Diffusion: Embeddednes and Permeability %A Letizia Milli %A Giulio Rossetti %B International Conference on Complex Networks and Their Applications %I Springer %G eng %0 Journal Article %J Applied network science %D 2018 %T Active and passive diffusion processes in complex networks %A Letizia Milli %A Giulio Rossetti %A Dino Pedreschi %A Fosca Giannotti %X Ideas, information, viruses: all of them, with their mechanisms, spread over the complex social information, viruses: all tissues described by our interpersonal relations. Usually, to simulate and understand the unfolding of such complex phenomena are used general mathematical models; these models act agnostically from the object of which they simulate the diffusion, thus considering spreading of virus, ideas and innovations alike. Indeed, such degree of abstraction makes it easier to define a standard set of tools that can be applied to heterogeneous contexts; however, it can also lead to biased, incorrect, simulation outcomes. In this work we introduce the concepts of active and passive diffusion to discriminate the degree in which individuals choice affect the overall spreading of content over a social graph. Moving from the analysis of a well-known passive diffusion schema, the Threshold model (that can be used to model peer-pressure related processes), we introduce two novel approaches whose aim is to provide active and mixed schemas applicable in the context of innovations/ideas diffusion simulation. Our analysis, performed both in synthetic and real-world data, underline that the adoption of exclusively passive/active models leads to conflicting results, thus highlighting the need of mixed approaches to capture the real complexity of the simulated system better. %B Applied network science %V 3 %P 42 %G eng %U https://link.springer.com/article/10.1007/s41109-018-0100-5 %R https://doi.org/10.1007/s41109-018-0100-5 %0 Conference Paper %B International Conference on Complex Networks CompleNet %D 2018 %T Diffusive Phenomena in Dynamic Networks: a data-driven study %A Letizia Milli %A Giulio Rossetti %A Dino Pedreschi %A Fosca Giannotti %X Everyday, ideas, information as well as viruses spread over complex social tissues described by our interpersonal relations. So far, the network contexts upon which diffusive phenomena unfold have usually considered static, composed by a fixed set of nodes and edges. Recent studies describe social networks as rapidly changing topologies. In this work – following a data-driven approach – we compare the behaviors of classical spreading models when used to analyze a given social network whose topological dynamics are observed at different temporal-granularities. Our goal is to shed some light on the impacts that the adoption of a static topology has on spreading simulations as well as to provide an alternative formulation of two classical diffusion models. %B International Conference on Complex Networks CompleNet %I Springer %C Boston March 5-8 2018 %G eng %U https://link.springer.com/chapter/10.1007/978-3-319-73198-8_13 %R 10.1007/978-3-319-73198-8_13 %0 Journal Article %J International Journal of Data Science and Analytics %D 2018 %T NDlib: a python library to model and analyze diffusion processes over complex networks %A Giulio Rossetti %A Letizia Milli %A S Rinzivillo %A Alina Sirbu %A Dino Pedreschi %A Fosca Giannotti %X Nowadays the analysis of dynamics of and on networks represents a hot topic in the social network analysis playground. To support students, teachers, developers and researchers, in this work we introduce a novel framework, namely NDlib, an environment designed to describe diffusion simulations. NDlib is designed to be a multi-level ecosystem that can be fruitfully used by different user segments. For this reason, upon NDlib, we designed a simulation server that allows remote execution of experiments as well as an online visualization tool that abstracts its programmatic interface and makes available the simulation platform to non-technicians. %B International Journal of Data Science and Analytics %V 5 %P 61–79 %G eng %U https://link.springer.com/article/10.1007/s41060-017-0086-6 %R 10.1007/s41060-017-0086-6 %0 Journal Article %J PloS one %D 2017 %T Forecasting success via early adoptions analysis: A data-driven study %A Giulio Rossetti %A Letizia Milli %A Fosca Giannotti %A Dino Pedreschi %X Innovations are continuously launched over markets, such as new products over the retail market or new artists over the music scene. Some innovations become a success; others don’t. Forecasting which innovations will succeed at the beginning of their lifecycle is hard. In this paper, we provide a data-driven, large-scale account of the existence of a special niche among early adopters, individuals that consistently tend to adopt successful innovations before they reach success: we will call them Hit-Savvy. Hit-Savvy can be discovered in very different markets and retain over time their ability to anticipate the success of innovations. As our second contribution, we devise a predictive analytical process, exploiting Hit-Savvy as signals, which achieves high accuracy in the early-stage prediction of successful innovations, far beyond the reach of state-of-the-art time series forecasting models. Indeed, our findings and predictive model can be fruitfully used to support marketing strategies and product placement. %B PloS one %V 12 %P e0189096 %G eng %0 Conference Paper %B International Workshop on Complex Networks and their Applications %D 2017 %T Information diffusion in complex networks: The active/passive conundrum %A Letizia Milli %A Giulio Rossetti %A Dino Pedreschi %A Fosca Giannotti %X Ideas, information, viruses: all of them, with their mechanisms, can spread over the complex social tissues described by our interpersonal relations. Classical spreading models can agnostically from the object of which they simulate the diffusion, thus considering spreading of virus, ideas and innovations alike. Indeed, such simplification makes easier to define a standard set of tools that can be applied to heterogeneous contexts; however, it can also lead to biased, partial, simulation outcomes. In this work we discuss the concepts of active and passive diffusion: moving from analysis of a well-known passive model, the Threshold one, we introduce two novel approaches whose aim is to provide active and mixed schemas applicable in the context of innovations/ideas diffusion simulation. Our data-driven analysis shows how, in such context, the adoption of exclusively passive/active models leads to conflicting results, thus highlighting the need of mixed approaches. %B International Workshop on Complex Networks and their Applications %I Springer %G eng %U https://link.springer.com/chapter/10.1007/978-3-319-72150-7_25 %R 10.1007/978-3-319-72150-7_25 %0 Journal Article %J International Journal of Data Science and Analytics %D 2017 %T NDlib: a python library to model and analyze diffusion processes over complex networks %A Giulio Rossetti %A Letizia Milli %A S Rinzivillo %A Alina Sirbu %A Dino Pedreschi %A Fosca Giannotti %X Nowadays the analysis of dynamics of and on networks represents a hot topic in the social network analysis playground.To support students, teachers, developers and researchers, in this work we introduce a novel framework, namely NDlib, an environment designed to describe diffusion simulations. NDlib is designed to be a multi-level ecosystem that can be fruitfully used by different user segments. For this reason, upon NDlib, we designed a simulation server that allows remote execution of experiments as well as an online visualization tool that abstracts its programmatic interface and makes available the simulation platform to non-technicians. %B International Journal of Data Science and Analytics %P 1–19 %G eng %0 Conference Paper %B IEEE International Conference on Data Science and Advanced Analytics, DSA %D 2017 %T NDlib: Studying Network Diffusion Dynamics %A Giulio Rossetti %A Letizia Milli %A S Rinzivillo %A Alina Sirbu %A Dino Pedreschi %A Fosca Giannotti %X Nowadays the analysis of diffusive phenomena occurring on top of complex networks represents a hot topic in the Social Network Analysis playground. In order to support students, teachers, developers and researchers in this work we introduce a novel simulation framework, ND LIB . ND LIB is designed to be a multi-level ecosystem that can be fruitfully used by different user segments. Upon the diffusion library, we designed a simulation server that allows remote execution of experiments and an online visualization tool that abstract the programmatic interface and makes available the simulation platform to non-technicians. %B IEEE International Conference on Data Science and Advanced Analytics, DSA %C Tokyo %G eng %U https://ieeexplore.ieee.org/abstract/document/8259774 %R https://doi.org/10.1109/DSAA.2017.6 %0 Conference Paper %B International Conference on Data Science and Advanced Analytics (IEEE DSAA'2015) %D 2015 %T Quantification in Social Networks %A Letizia Milli %A Anna Monreale %A Giulio Rossetti %A Dino Pedreschi %A Fosca Giannotti %A Fabrizio Sebastiani %X In many real-world applications there is a need to monitor the distribution of a population across different classes, and to track changes in this distribution over time. As an example, an important task is to monitor the percentage of unemployed adults in a given region. When the membership of an individual in a class cannot be established deterministically, a typical solution is the classification task. However, in the above applications the final goal is not determining which class the individuals belong to, but estimating the prevalence of each class in the unlabeled data. This task is called quantification. Most of the work in the literature addressed the quantification problem considering data presented in conventional attribute format. Since the ever-growing availability of web and social media we have a flourish of network data representing a new important source of information and by using quantification network techniques we could quantify collective behavior, i.e., the number of users that are involved in certain type of activities, preferences, or behaviors. In this paper we exploit the homophily effect observed in many social networks in order to construct a quantifier for networked data. Our experiments show the effectiveness of the proposed approaches and the comparison with the existing state-of-the-art quantification methods shows that they are more accurate. %B International Conference on Data Science and Advanced Analytics (IEEE DSAA'2015) %I IEEE %C Paris, France %G eng %U http://www.giuliorossetti.net/about/wp-content/uploads/2015/12/main_DSAA.pdf %R 10.1109/DSAA.2015.7344845 %0 Conference Paper %B 47th SIS Scientific Meeting of the Italian Statistica Society %D 2014 %T Use of mobile phone data to estimate mobility flows. Measuring urban population and inter-city mobility using big data in an integrated approach %A Barbara Furletti %A Lorenzo Gabrielli %A Fosca Giannotti %A Letizia Milli %A Mirco Nanni %A Dino Pedreschi %? Roberta Vivio %? Giuseppe Garofalo %X The Big Data, originating from the digital breadcrumbs of human activi- ties, sensed as a by-product of the technologies that we use for our daily activities, let us to observe the individual and collective behavior of people at an unprecedented detail. Many dimensions of our social life have big data “proxies”, as the mobile calls data for mobility. In this paper we investigate to what extent such ”big data”, in integration with administrative ones, could be a support in producing reliable and timely estimates of inter-city mobility. The study has been jointly developed by Is- tat, CNR, University of Pisa in the range of interest of the “Commssione di studio avente il compito di orientare le scelte dellIstat sul tema dei Big Data ”. In an on- going project at ISTAT, called “Persons and Places” – based on an integration of administrative data sources, it has been produced a first release of Origin Destina- tion matrix – at municipality level – assuming that the places of residence and that of work (or study) be the terminal points of usual individual mobility for work or study. The coincidence between the city of residence and that of work (or study) – is considered as a proxy of the absence of intercity mobility for a person (we define him a static resident). The opposite case is considered as a proxy of presence of mo- bility (the person is a dynamic resident: commuter or embedded). As administrative data do not contain information on frequency of the mobility, the idea is to specify an estimate method, using calling data as support, to define for each municipality the stock of standing residents, embedded city users and daily city users (commuters) %B 47th SIS Scientific Meeting of the Italian Statistica Society %C Cagliari %8 06/2014 %@ 978-88-8467-874-4 %U http://www.sis2014.it/proceedings/allpapers/3026.pdf %0 Conference Paper %B 2013 IEEE 13th International Conference on Data Mining, Dallas, TX, USA, December 7-10, 2013 %D 2013 %T Quantification Trees %A Letizia Milli %A Anna Monreale %A Giulio Rossetti %A Fosca Giannotti %A Dino Pedreschi %A Fabrizio Sebastiani %X In many applications there is a need to monitor how a population is distributed across different classes, and to track the changes in this distribution that derive from varying circumstances, an example such application is monitoring the percentage (or "prevalence") of unemployed people in a given region, or in a given age range, or at different time periods. When the membership of an individual in a class cannot be established deterministically, this monitoring activity requires classification. However, in the above applications the final goal is not determining which class each individual belongs to, but simply estimating the prevalence of each class in the unlabeled data. This task is called quantification. In a supervised learning framework we may estimate the distribution across the classes in a test set from a training set of labeled individuals. However, this may be sub optimal, since the distribution in the test set may be substantially different from that in the training set (a phenomenon called distribution drift). So far, quantification has mostly been addressed by learning a classifier optimized for individual classification and later adjusting the distribution it computes to compensate for its tendency to either under-or over-estimate the prevalence of the class. In this paper we propose instead to use a type of decision trees (quantification trees) optimized not for individual classification, but directly for quantification. Our experiments show that quantification trees are more accurate than existing state-of-the-art quantification methods, while retaining at the same time the simplicity and understandability of the decision tree framework. %B 2013 IEEE 13th International Conference on Data Mining, Dallas, TX, USA, December 7-10, 2013 %P 528–536 %U http://dx.doi.org/10.1109/ICDM.2013.122 %R 10.1109/ICDM.2013.122