%0 Conference Paper %B Discovery Science %D 2023 %T EXPHLOT: EXplainable Privacy assessment for Human LOcation Trajectories %A Francesca Naretto %A Roberto Pellungrini %A Daniele Fadda %A Salvo Rinzivillo %B Discovery Science %G eng %0 Journal Article %J IEEE Transactions on Intelligent Transportation SystemsIEEE Transactions on Intelligent Transportation Systems %D 2020 %T Modeling Adversarial Behavior Against Mobility Data Privacy %A Roberto Pellungrini %A Luca Pappalardo %A F. Simini %A Anna Monreale %X Privacy risk assessment is a crucial issue in any privacy-aware analysis process. Traditional frameworks for privacy risk assessment systematically generate the assumed knowledge for a potential adversary, evaluating the risk without realistically modelling the collection of the background knowledge used by the adversary when performing the attack. In this work, we propose Simulated Privacy Annealing (SPA), a new adversarial behavior model for privacy risk assessment in mobility data. We model the behavior of an adversary as a mobility trajectory and introduce an optimization approach to find the most effective adversary trajectory in terms of privacy risk produced for the individuals represented in a mobility data set. We use simulated annealing to optimize the movement of the adversary and simulate a possible attack on mobility data. We finally test the effectiveness of our approach on real human mobility data, showing that it can simulate the knowledge gathering process for an adversary in a more realistic way. %B IEEE Transactions on Intelligent Transportation SystemsIEEE Transactions on Intelligent Transportation Systems %P 1 - 14 %8 2020 %@ 1558-0016 %G eng %U https://ieeexplore.ieee.org/abstract/document/9199893 %! IEEE Transactions on Intelligent Transportation Systems %R https://doi.org/10.1109/TITS.2020.3021911 %0 Conference Paper %B Discovery Science %D 2020 %T Predicting and Explaining Privacy Risk Exposure in Mobility Data %A Francesca Naretto %A Roberto Pellungrini %A Anna Monreale %A Nardini, Franco Maria %A Musolesi, Mirco %E Appice, Annalisa %E Tsoumakas, Grigorios %E Manolopoulos, Yannis %E Matwin, Stan %X Mobility data is a proxy of different social dynamics and its analysis enables a wide range of user services. Unfortunately, mobility data are very sensitive because the sharing of people’s whereabouts may arise serious privacy concerns. Existing frameworks for privacy risk assessment provide tools to identify and measure privacy risks, but they often (i) have high computational complexity; and (ii) are not able to provide users with a justification of the reported risks. In this paper, we propose expert, a new framework for the prediction and explanation of privacy risk on mobility data. We empirically evaluate privacy risk on real data, simulating a privacy attack with a state-of-the-art privacy risk assessment framework. We then extract individual mobility profiles from the data for predicting their risk. We compare the performance of several machine learning algorithms in order to identify the best approach for our task. Finally, we show how it is possible to explain privacy risk prediction on real data, using two algorithms: Shap, a feature importance-based method and Lore, a rule-based method. Overall, expert is able to provide a user with the privacy risk and an explanation of the risk itself. The experiments show excellent performance for the prediction task. %B Discovery Science %I Springer International Publishing %C Cham %8 2020// %@ 978-3-030-61527-7 %G eng %U https://link.springer.com/chapter/10.1007/978-3-030-61527-7_27 %R https://doi.org/10.1007/978-3-030-61527-7_27 %0 Conference Paper %B ECML PKDD 2020 Workshops %D 2020 %T Prediction and Explanation of Privacy Risk on Mobility Data with Neural Networks %A Francesca Naretto %A Roberto Pellungrini %A Nardini, Franco Maria %A Fosca Giannotti %E Koprinska, Irena %E Kamp, Michael %E Appice, Annalisa %E Loglisci, Corrado %E Antonie, Luiza %E Zimmermann, Albrecht %E Riccardo Guidotti %E Özgöbek, Özlem %E Ribeiro, Rita P. %E Gavaldà, Ricard %E Gama, João %E Adilova, Linara %E Krishnamurthy, Yamuna %E Ferreira, Pedro M. %E Malerba, Donato %E Medeiros, Ibéria %E Ceci, Michelangelo %E Manco, Giuseppe %E Masciari, Elio %E Ras, Zbigniew W. %E Christen, Peter %E Ntoutsi, Eirini %E Schubert, Erich %E Zimek, Arthur %E Anna Monreale %E Biecek, Przemyslaw %E S Rinzivillo %E Kille, Benjamin %E Lommatzsch, Andreas %E Gulla, Jon Atle %X The analysis of privacy risk for mobility data is a fundamental part of any privacy-aware process based on such data. Mobility data are highly sensitive. Therefore, the correct identification of the privacy risk before releasing the data to the public is of utmost importance. However, existing privacy risk assessment frameworks have high computational complexity. To tackle these issues, some recent work proposed a solution based on classification approaches to predict privacy risk using mobility features extracted from the data. In this paper, we propose an improvement of this approach by applying long short-term memory (LSTM) neural networks to predict the privacy risk directly from original mobility data. We empirically evaluate privacy risk on real data by applying our LSTM-based approach. Results show that our proposed method based on a LSTM network is effective in predicting the privacy risk with results in terms of F1 of up to 0.91. Moreover, to explain the predictions of our model, we employ a state-of-the-art explanation algorithm, Shap. We explore the resulting explanation, showing how it is possible to provide effective predictions while explaining them to the end-user. %B ECML PKDD 2020 Workshops %I Springer International Publishing %C Cham %8 2020// %@ 978-3-030-65965-3 %G eng %U https://link.springer.com/chapter/10.1007/978-3-030-65965-3_34 %R https://doi.org/10.1007/978-3-030-65965-3_34 %0 Conference Paper %B Companion of The 2019 World Wide Web Conference, WWW 2019, San Francisco, CA, USA, May 13-17, 2019. %D 2019 %T Human Mobility from theory to practice: Data, Models and Applications %A Luca Pappalardo %A Gianni Barlacchi %A Roberto Pellungrini %A Filippo Simini %X The inclusion of tracking technologies in personal devices opened the doors to the analysis of large sets of mobility data like GPS traces and call detail records. This tutorial presents an overview of both modeling principles of human mobility and machine learning models applicable to specific problems. We review the state of the art of five main aspects in human mobility: (1) human mobility data landscape; (2) key measures of individual and collective mobility; (3) generative models at the level of individual, population and mixture of the two; (4) next location prediction algorithms; (5) applications for social good. For each aspect, we show experiments and simulations using the Python library ”scikit-mobility” developed by the presenters of the tutorial. %B Companion of The 2019 World Wide Web Conference, WWW 2019, San Francisco, CA, USA, May 13-17, 2019. %G eng %U https://doi.org/10.1145/3308560.3320099 %R 10.1145/3308560.3320099 %0 Conference Paper %B ECML PKDD 2018 Workshops %D 2019 %T Privacy Risk for Individual Basket Patterns %A Roberto Pellungrini %A Anna Monreale %A Riccardo Guidotti %E Alzate, Carlos %E Anna Monreale %E Bioglio, Livio %E Bitetta, Valerio %E Bordino, Ilaria %E Caldarelli, Guido %E Ferretti, Andrea %E Riccardo Guidotti %E Gullo, Francesco %E Pascolutti, Stefano %E Pensa, Ruggero G. %E Robardet, Céline %E Squartini, Tiziano %X Retail data are of fundamental importance for businesses and enterprises that want to understand the purchasing behaviour of their customers. Such data is also useful to develop analytical services and for marketing purposes, often based on individual purchasing patterns. However, retail data and extracted models may also provide very sensitive information to possible malicious third parties. Therefore, in this paper we propose a methodology for empirically assessing privacy risk in the releasing of individual purchasing data. The experiments on real-world retail data show that although individual patterns describe a summary of the customer activity, they may be successful used for the customer re-identifiation. %B ECML PKDD 2018 Workshops %I Springer International Publishing %C Cham %8 2019// %@ 978-3-030-13463-1 %G eng %U https://link.springer.com/chapter/10.1007/978-3-030-13463-1_11 %R https://doi.org/10.1007/978-3-030-13463-1_11 %0 Conference Paper %B Software Technologies: Applications and Foundations - STAF 2018 Collocated Workshops, Toulouse, France, June 25-29, 2018, Revised Selected Papers %D 2018 %T Analyzing Privacy Risk in Human Mobility Data %A Roberto Pellungrini %A Luca Pappalardo %A Francesca Pratesi %A Anna Monreale %X Mobility data are of fundamental importance for understanding the patterns of human movements, developing analytical services and modeling human dynamics. Unfortunately, mobility data also contain individual sensitive information, making it necessary an accurate privacy risk assessment for the individuals involved. In this paper, we propose a methodology for assessing privacy risk in human mobility data. Given a set of individual and collective mobility features, we define the minimum data format necessary for the computation of each feature and we define a set of possible attacks on these data formats. We perform experiments computing the empirical risk in a real-world mobility dataset, and show how the distributions of the considered mobility features are affected by the removal of individuals with different levels of privacy risk. %B Software Technologies: Applications and Foundations - STAF 2018 Collocated Workshops, Toulouse, France, June 25-29, 2018, Revised Selected Papers %G eng %U https://doi.org/10.1007/978-3-030-04771-9_10 %R 10.1007/978-3-030-04771-9_10 %0 Conference Paper %B Personal Analytics and Privacy. An Individual and Collective Perspective - First International Workshop, PAP 2017, Held in Conjunction with ECML PKDD 2017, Skopje, Macedonia, September 18, 2017, Revised Selected Papers %D 2017 %T Assessing Privacy Risk in Retail Data %A Roberto Pellungrini %A Francesca Pratesi %A Luca Pappalardo %X Retail data are one of the most requested commodities by commercial companies. Unfortunately, from this data it is possible to retrieve highly sensitive information about individuals. Thus, there exists the need for accurate individual privacy risk evaluation. In this paper, we propose a methodology for assessing privacy risk in retail data. We define the data formats for representing retail data, the privacy framework for calculating privacy risk and some possible privacy attacks for this kind of data. We perform experiments in a real-world retail dataset, and show the distribution of privacy risk for the various attacks. %B Personal Analytics and Privacy. An Individual and Collective Perspective - First International Workshop, PAP 2017, Held in Conjunction with ECML PKDD 2017, Skopje, Macedonia, September 18, 2017, Revised Selected Papers %G eng %U https://doi.org/10.1007/978-3-319-71970-2_3 %R 10.1007/978-3-319-71970-2_3 %0 Journal Article %J ACM Trans. Intell. Syst. Technol. %D 2017 %T A Data Mining Approach to Assess Privacy Risk in Human Mobility Data %A Roberto Pellungrini %A Luca Pappalardo %A Francesca Pratesi %A Anna Monreale %X Human mobility data are an important proxy to understand human mobility dynamics, develop analytical services, and design mathematical models for simulation and what-if analysis. Unfortunately mobility data are very sensitive since they may enable the re-identification of individuals in a database. Existing frameworks for privacy risk assessment provide data providers with tools to control and mitigate privacy risks, but they suffer two main shortcomings: (i) they have a high computational complexity; (ii) the privacy risk must be recomputed every time new data records become available and for every selection of individuals, geographic areas, or time windows. In this article, we propose a fast and flexible approach to estimate privacy risk in human mobility data. The idea is to train classifiers to capture the relation between individual mobility patterns and the level of privacy risk of individuals. We show the effectiveness of our approach by an extensive experiment on real-world GPS data in two urban areas and investigate the relations between human mobility patterns and the privacy risk of individuals. %B ACM Trans. Intell. Syst. Technol. %V 9 %P 31:1–31:27 %G eng %U http://doi.acm.org/10.1145/3106774 %R 10.1145/3106774 %0 Generic %D 2017 %T Fast Estimation of Privacy Risk in Human Mobility Data %A Roberto Pellungrini %A Luca Pappalardo %A Francesca Pratesi %A Anna Monreale %X Mobility data are an important proxy to understand the patterns of human movements, develop analytical services and design models for simulation and prediction of human dynamics. Unfortunately mobility data are also very sensitive, since they may contain personal information about the individuals involved. Existing frameworks for privacy risk assessment enable the data providers to quantify and mitigate privacy risks, but they suffer two main limitations: (i) they have a high computational complexity; (ii) the privacy risk must be re-computed for each new set of individuals, geographic areas or time windows. In this paper we explore a fast and flexible solution to estimate privacy risk in human mobility data, using predictive models to capture the relation between an individual’s mobility patterns and her privacy risk. We show the effectiveness of our approach by experimentation on a real-world GPS dataset and provide a comparison with traditional methods. %@ 978-3-319-66283-1 %G eng %R 10.1007/978-3-319-66284-8_35