Ride Sharing & Data Privacy: How Data Handling Affects the Willingness to Disclose Personal Information

Findings Digital services like ride sharing rely heavily on personal data, e


Questions
Privacy is important when an individual interacts with others as their social circles and boundaries affect the perceived privacy (Page, Kobsa, and Knijnenburg 2012;Squicciarini et al. 2014;Jagadish 2020). In the sharing economy -a term for (digital) peer-to-peer marketplaces where individuals share their private property with others -communicating and meeting with unknown persons is a common interaction. Individuals have varying motivations for sharing (Schor et al. 2016;Milanova and Maas 2017) and there are numerous marketplaces which offer services for sharing, for example, apartment rentals and shared rides. Privacy is a significant factor when individuals decide about participating in such a shared marketplace (Ranzini et al. 2017). Despite the importance of information privacy and its regulation, e.g. (General Data Protection Regulation 2016), providers of such marketplaces rarely provide information to the participants regarding the privacy of their personal data, which is a central issue for any digital service. We investigate how this lack of transparency could affect user's participation. Therefore, we previously analyzed twelve ride sharing services offered in Germany, Austria, and Switzerland to identify their data practices in regards to data privacy and privacy-related features (Hesselmann, Gertheiss, and Müller 2021). Table 1 shows the collected and exposed (towards other users) data attributes, e.g. name or address, across the analyzed ride sharing services, illustrating the diversity of the data practices and lack of standardization  Number of data attributes collected and exposed by ride sharing services: (1) bessermitfahren.de, (2) blablacar.de, (3) clickapoint.com, (4) e-carpooling.ch, (5) fahrgemeinschaft.de, (6) foahstmit.at, (7) greendrive.at, (8) mifaz.de, (9) mitfahrangebot.de, (10) mitfahrportal.de, (11) pendlerportal.de, (12)  among services. The results of the analysis served as basis for the online survey in which we asked participants to state their willingness to disclose personal information under varying privacy conditions. We defined the following research questions for our survey:

Methods
The survey was advertised within the Clausthal University of Technology as well as topic-and field-related mail channels in Germany and includes 127 complete responses. Due to the chosen recruitment methods for our study, participants are predominantly young adults which also form the most common age group to participate in shared rides (Schoesslers 2016; European Federation for Transport and Environment 2020). Most participants live in Germany (89%). 61% of participants are male and 72% report prior experience with ride sharing.
We included a set of 15 data attributes, mixed with common and rare data from our prior analysis of ride sharing services, and combined this with four conditions: • To what degree does the exposure of personal information towards other users influence an individual's willingness to disclose it?
• How strongly is the willingness to disclose personal information influenced by the availability of privacy settings (e.g. change exposure of certain information to other users)?
• To what extent does the validation of the authenticity of personal information influence an individual's willingness to disclose personal information?
1 Instead of focusing on privacy concerns (which are often reported to be high but rarely influence the actual disclosure behavior of an individual significantly (Lutz et al. 2018)), we asked participants to rate fairness and sensitivity of data disclosure in the context of ride sharing, as these factors are good predictors of actual disclosure behavior (Malheiros, Preibusch, and Sasse 2013). We conducted this as a within-subject study, i.e. all participants rated every condition. Based on Malheiros, Preibusch, and Sasse (2013), fairness was measured with ratings ranging from very unfair (0) to very fair (+5) and sensitivity was measured as the tendency of data disclosure from very unlikely (0) to very likely (+5). To account for ordering effects, the sequence in which the questions were presented to the participant was randomized. An example of a survey question is displayed in Figure 1.

Findings
We used the condition (i.) personal information is visible for other users as a baseline to compare with the other conditions as it was the most common in our analysis of ride sharing services (Hesselmann, Gertheiss, and Müller 2021). Thus, we conducted three comparisons A, B, and C: The p-values are obtained from paired (two-sided) t-tests. To account for multiple testing, we used the Bonferroni-Holm-Method (Holm 1979 in Table 2, with positive sign indicating a favorable rating for the latter condition of the respective comparison. The corresponding p-value is given in parentheses below the difference in mean rating.
The differences in comparison A indicate that users prefer their information to be made accessible to other users, e.g. for the data attributes profile picture, car type/model, and car picture, and are more inclined to disclose the information. However, for the remaining data attributes users are equally or less inclined to disclose their information as mean ratings are equal or in favor of information being not visible. In contrast to this, the results of comparison B reveal a onesided preference in favor of profile settings. A look at the mean ratings shows that profile settings increase perceived fairness and/or the likelihood of data disclosure for all 15 data attributes. Comparison C displays that information validation has varying effects on the user. It affects the perceived fairness and likelihood in a positive way for bank details and in a negative way for first name.
When looking at the data attributes across the conducted comparisons, gender and employer show only minimal differences in mean ratings. The attribute social media is an interesting result as participants strongly prefer to validate their profile but are reluctant to share it with other users. The opposite effect is seen for first name; the mean values are almost identical across comparison A and B but change strongly for C, which shows that participants have little concern to disclose their name but are reluctant to validate it via a copy of their ID. A look at the p-values shows that 54 hypotheses were rejected, indicating significant differences in 60% of the conducted comparisons.
Overall the results of this preliminary study are promising for the direction in which our work is heading and we will continue our efforts and improve upon the small data set with additional experiments ideally seconding our approach.