PRIVACY POLICY INFERENCE OF USER-UPLOADED
IMAGES ON CONTENT SHARING SITES
Abstract—With the increasing volume of images users share through social sites,
maintaining privacy has become a major problem, as demonstrated by a recent
wave of publicized incidents where users inadvertently shared personal
information. In light of these incidents, the need of tools to help users control
access to their shared content is apparent. Toward addressing this need, we propose
an Adaptive Privacy Policy Prediction (A3P) system to help users compose privacy
settings for their images. We examine the role of social context, image content, and
metadata as possible indicators of users’ privacy preferences. We propose a two-
level framework which according to the user’s available history on the site,
determines the best available privacy policy for the user’s images being uploaded.
Our solution relies on an image classification framework for image categories
which may be associated with similar policies, and on a policy prediction
algorithm to automatically generate a policy for each newly uploaded image, also
according to users’ social features. Over time, the generated policies will follow
the evolution of users’ privacy attitude. We provide the results of our extensive
evaluation over 5,000 policies, which demonstrate the effectiveness of our system,
with prediction accuracies over 90 percent.
EXISTING SYSTEM:
Several recent works have studied how to automate the task of privacy settings
Bonneau et al. proposed the concept of privacy suites which recommend to users a
suite of privacy settings that n“expert” users or other trusted friends have already
set, so that normal users can either directly choose a setting or only need to do
minor modification. Similarly, Danezis proposed a machine-learning based
approach to automatically extract privacy settings from the social context within
which the data is produced. Parallel to the work of Danezis, Adu-Oppong et al.
develop privacy settings based on a concept of “Social Circles” which consist of
clusters of friends formed by partitioning users’ friend lists. Ravichandran et al.
studied how to predict a user’s privacy preferences for location-based data (i.e.,
share her location or not) based on location and time of day. Fang et al. proposed a
privacy wizard to help users grant privileges to their friends. The wizard asks users
to first assign privacy labels to selected friends, and then uses this as input to
construct a classifier which classifies friends based on their profiles and
automatically assign privacy labels to the unlabeled friends. More recently,
Klemperer et al. studied whether the keywords and captions with which users tag
their photos can be used to help users more intuitively create and maintain access-
control policies. Their findings are inline with our approach: tags created for
organizational purposes can be repurposed to help create reasonably accurate
access-controlrules.
PROPOSED SYSTEM:
We propose an Adaptive Privacy Policy Prediction (A3P) system which aims to
provide users a hassle free privacy settings experience by automatically generating
personalized policies. The A3P system handles user uploaded images, and factors
in the following criteria that influence one’s privacy settings of images: The
impact of social environment and personal characteristics. Social context of users,
such as their profile information and relationships with others may provide useful
information regarding users’ privacy preferences. For example, users interested in
photography may like to share their photos with other amateur photographers.
Users who have several family members among their social contacts may share
with them pictures related to family events. However, using common policies
across all users or across users with similar traits may be too simplistic and not
satisfy individual preferences. Users may have drastically different opinions even
on the same type of images. For example, a privacy adverse person may be willing
to share all his personal images while a more conservative person may just want to
share personal images with his family members. In light of these considerations, it
is important to find the balancing point between the impact of social environment
and users’ individual characteristics in order to predict the policies that match each
individual’s needs. Moreover, individuals may change their overall attitude toward
privacy as time passes. In order to develop a personalized policy recommendation
system, such changes on privacy opinions should be carefully considered. The role
of image’s content and metadata. In general, similar images often incur similar
privacy preferences, especially when people appear in the images. For example,
one may upload several photos of his kids and specify that only his family
members are allowed to see these photos. He may upload some other photos of
landscapes which he took as a hobby and for these photos, he may set privacy
preference allowing anyone to view and comment the photos. Analyzing the visual
content may not be sufficient to capture users’ privacy preferences. Tags and other
metadata are indicative of the social context of the image, including where it was
taken and why , and also provide a synthetic description of images, complementing
the information obtained from visual content analysis.
Module 1
A3P-CORE
There are two major components in A3P-core: (i) Image classification and (ii)
Adaptive policy prediction. For each user, his/her images are first classified based
on content and metadata. Then, privacy policies of each category of images are
analyzed for the policy prediction. Adopting a two-stage approach is more suitable
for policy recommendation than applying the common one-stage data mining
approaches to mine both image features and policies together. Recall that when a
user uploads a new image, the user is waiting for a recommended policy. The two-
stage approach allows the system to employ the first stage to classify the new
image and find the candidate sets of images for the subsequent policy
recommendation. As for the one-stage mining approach, it would not be able to
locate the right class of the new image because its classification criteria needs both
image features and policies whereas the policies of the new image are not available
yet. Moreover, combining both image features and policies into a single classifier
would lead to a system which is very dependent to the specific syntax of the
policy. If a change in the supported policies were to be introduced, the whole
learning model would need to change.
Module 2
Image Classification
To obtain groups of images that may be associated with similar privacy
preferences, we propose a hierarchical image classification which classifies images
first based on their contents and then refine each category into subcategories based
on their metadata. Images that do not have metadata will be grouped only by
content. Such a hierarchical classification gives a higher priority to image content
and minimizes the influence of missing tags. Note that it is possible that some
images are included in multiple categories as long as they contain the typical
content features or metadata of those categories. The content-based classification
creates two categories: “landscape” and “kid”. Images C, D, E and F are included
in both categories as they show kids playing outdoor which satisfy the two themes:
“landscape” and “kid”. These two categories are further divided into subcategories
based on tags associated with the images. As a result, we obtain two subcategories
under each theme respectively. Notice that image G is not shown in any
subcategory as it does not have any tag; image A shows up in both subcategories
because it has tags indicating both “beach” and “wood”.
Module 3
PolicyMining
We propose a hierarchical mining approach for policy mining. Our approach
leverages association rule mining techniques to discover popular patterns in
policies. Policy mining is carried out within the same category of the new image
because images in the same category are more likely under the similar level of
privacy protection. The basic idea of the hierarchical mining is to follow a natural
order in which a user defines a policy. Given an image, a user usually first decides
who can access the image, then thinks about what specific access rights (e.g., view
only or download) should be given, and finally refine the access conditions such as
setting the expiration date. Correspondingly, the hierarchical mining first look for
popular subjects defined by the user, then look for popular actions in the policies
containing the popular subjects, and finally for popular conditions in the policies
containing bothpopular subjects and conditions.
_ Step 1: In the same category of the new image, conduct association rule mining
on the subject component of polices. Let S1, S2; . . ., denote the subjects occurring
in policies. Each resultant rule is an implication of the form X ) Y, where X, Y _
fS1, S2; . . . ; g, and X  Y ¼ ;. Among the obtained rules, we select the best rules
according to one of the interestingness measures, i.e., the generality of the rule,
defined using support and confidence as introduced in [16]. The selected rules
indicate the most popular subjects (i.e., single subject) or subject combinations
(i.e., multiple subjects) in policies. In the subsequent steps, we consider policies
which contain at least one subject in the selected rules. For clarity, we denote the
set of such policies as Gsub i corresponding to a selected rule Rsub i .
_ Step 2: In each policy set Gsub i , we now conduct association rule mining on the
action component. The result will be a set of association rules in the form of X ) Y,
where X, Y _fopen, comment, tag, downloadg, and X  Y ¼ ;. Similar to the first
step, we will select the best rules according to the generality interestingness. This
time, the selected rules indicate the most popular combination of actions in policies
with respect to each particular subject or subject combination. Policies which do
not contain any action included in the selected rules will be removed. Given a
selected rule Ract j we denote the set of remaining policies as Gact j , and note that
Gact j _ Gsub
_ Step 3: We proceed to mine the condition component in each policy set Gact j .
Let attr1, attr2, ..., attrn denote the distinct attributes in the condition component of
the policies in Gact j . The association rules are in the same format of X ) Y but
with X, Y _fattr1, attr2; . . . ; attrng. Once the rules are obtained, we again select
the best rules using the generality interestingness measure. The selected rules give
us a set of attributes which often appear in policies. Similarly, we denote the
policies containing at least one attribute in the selected rule Rcon k as Gcon k and
Gconk _ Gact j
_ Step 4: This step is to generate candidate policies. Given Gcon k _ Gact j _ Gsub
i , we consider each corresponding series of best rules: Rcon kx , Ract jy and Rsub
iz . Candidate policies are required to possess all elements in Rcon kx , Ract jy and
Rsub iz Note that candidate policies may be different from the policies as result of
Step 3. This is because Step 3 will keep policies as long as they have one of the
attributes in the selected rules.
Module 4
A3P-SOCIAL
The A3P-social employs a multi-criteria inference mechanism that generates
representative policies by leveraging key information related to the user’s social
context and his general attitude toward privacy. As mentioned earlier, A3Psocial
will be invoked by the A3P-core in two scenarios. One is when the user is a newbie
of a site, and does not have enough images stored for the A3P-core to infer
meaningful and customized policies. The other is when the system notices
significant changes of privacy trend in the user’s social circle, which may be of
interest for the user to possibly adjust his/her privacy settings accordingly. In what
follows, we first present the types of social context considered by A3P-Social, and
then present the policy recommendation process.
CONCLUSION:
We have proposed an Adaptive Privacy Policy Prediction (A3P) system that helps
users automate the privacy policy
settings for their uploaded images. The A3P system provides a comprehensive
framework to infer privacy preferences based on the information available for a
given user. We also effectively tackled the issue of cold-start, leveraging social
context information. Our experimental study proves that our A3P is a practical tool
that offers significant improvements over current approaches to privacy.
REFERENCES
[1] A. Acquisti and R. Gross, “Imagined communities: Awareness, information
sharing, and privacy on the facebook,” in Proc. 6th Int. Conf. Privacy Enhancing
Technol. Workshop, 2006, pp. 36–58.
[2] R. Agrawal and R. Srikant,“Fast algorithms for mining association rules in
large databases,” in Proc. 20th Int. Conf. Very Large Data Bases, 1994, pp. 487–
499.
[3] S. Ahern, D. Eckles, N. S. Good, S. King, M. Naaman, and R. Nair, “Over-
exposed?: Privacy patterns and considerations in online and mobile photo sharing,”
in Proc. Conf. Human Factors Comput. Syst., 2007, pp. 357–366.
[4] M. Ames and M. Naaman, “Why we tag: Motivations for annotation in mobile
and online media,” in Proc. Conf. Human Factors Comput. Syst., 2007, pp. 971–
980.
[5] A. Besmer and H. Lipford, “Tagged photos: Concerns, perceptions, and
protections,” in Proc. 27th Int. Conf. Extended Abstracts Human Factors Comput.
Syst., 2009, pp. 4585–4590.
[6] D. G. Altman and J. M. Bland ,“Multiple significance tests: The bonferroni
method,” Brit. Med. J., vol. 310, no. 6973, 1995.
[7] J. Bonneau, J. Anderson, and L. Church, “Privacy suites: Shared privacy for
social networks,” in Proc. Symp. Usable Privacy Security, 2009.
[8] J. Bonneau, J. Anderson, and G. Danezis, “Prying data out of a social network,”
in Proc. Int. Conf. Adv. Soc. Netw. Anal. Mining., 2009, pp.249–254.
[9] H.-M. Chen, M.-H. Chang, P.-C. Chang, M.-C. Tien, W. H. Hsu, and J.-L. Wu,
“Sheepdog: Group and tag recommendation for flickr photos by automatic search-
based learning,” in Proc. 16th ACM Int. Conf. Multimedia, 2008, pp. 737–740.
[10] M. D. Choudhury, H. Sundaram, Y.-R. Lin, A. John, and D. D. Seligmann,
“Connecting content to community in social media via image content, user tags
and user communication,” in Proc. IEEE Int. Conf. Multimedia Expo, 2009,
pp.1238–1241.
[11] L. Church, J. Anderson, J. Bonneau, and F. Stajano, “Privacy stories:
Confidence on privacy behaviors through end user programming,” in Proc. 5th
Symp. Usable Privacy Security, 2009.
[12] R. da Silva Torres and A. Falc~ao, “Content-based image retrieval: Theory
and applications,” Revista de Inform_atica Te_orica e Aplicada, vol. 2, no. 13, pp.
161–185, 2006

More Related Content

DOCX
Privacy policy inference of user uploaded
PDF
Privacy policy inference of user uploaded images on content sharing sites
PPT
Privacy presentation
DOCX
Privacy policy inference of user uploaded
DOCX
IEEE Projects 2015 | Privacy policy inference of user uploaded images on cont...
PDF
Ijricit 01-008 confidentiality strategy deduction of user-uploaded pictures o...
PDF
iaetsd Adaptive privacy policy prediction for user uploaded images on
PDF
Privacy Policy Inference of User-Uploaded Images on Content Sharing Sites
Privacy policy inference of user uploaded
Privacy policy inference of user uploaded images on content sharing sites
Privacy presentation
Privacy policy inference of user uploaded
IEEE Projects 2015 | Privacy policy inference of user uploaded images on cont...
Ijricit 01-008 confidentiality strategy deduction of user-uploaded pictures o...
iaetsd Adaptive privacy policy prediction for user uploaded images on
Privacy Policy Inference of User-Uploaded Images on Content Sharing Sites

What's hot (18)

PDF
Adaptive Privacy Policy Prediction of User Uploaded Images on Content sharing...
DOCX
PRIVACY POLICY INFERENCE OF USER-UPLOADED IMAGES ON CONTENT SHARING SITES
PDF
Detecting Spam Tags Against Collaborative Unfair Through Trust Modelling
PDF
Predicting Privacy Policy Automatically To the User Uploaded Images on Conten...
PDF
Tag based image retrieval (tbir) using automatic image annotation
DOCX
Privacy policy inference of user uploaded
DOCX
TAG BASED IMAGE SEARCH BY SOCIAL RE-RANKING
PDF
A Survey On Privacy Policy Inference for Social Images
PDF
Privacy Management of Multi User Environment in Online Social Networks (OSNs)
PDF
Identification of inference attacks on private Information from Social Networks
PDF
International Journal of Engineering Research and Development (IJERD)
PDF
YOURPRIVACYPROTECTOR: A RECOMMENDER SYSTEM FOR PRIVACY SETTINGS IN SOCIAL NET...
PDF
IJSRED-V2I2P09
PDF
FIND MY VENUE: Content & Review Based Location Recommendation System
PDF
oneslider_template_ACS [532226]
DOCX
Preventing private information inference attacks on social networks
PPTX
Social Media Mining - Chapter 8 (Influence and Homophily)
DOCX
Service rating prediction by exploring social mobile users’ geographical loca...
Adaptive Privacy Policy Prediction of User Uploaded Images on Content sharing...
PRIVACY POLICY INFERENCE OF USER-UPLOADED IMAGES ON CONTENT SHARING SITES
Detecting Spam Tags Against Collaborative Unfair Through Trust Modelling
Predicting Privacy Policy Automatically To the User Uploaded Images on Conten...
Tag based image retrieval (tbir) using automatic image annotation
Privacy policy inference of user uploaded
TAG BASED IMAGE SEARCH BY SOCIAL RE-RANKING
A Survey On Privacy Policy Inference for Social Images
Privacy Management of Multi User Environment in Online Social Networks (OSNs)
Identification of inference attacks on private Information from Social Networks
International Journal of Engineering Research and Development (IJERD)
YOURPRIVACYPROTECTOR: A RECOMMENDER SYSTEM FOR PRIVACY SETTINGS IN SOCIAL NET...
IJSRED-V2I2P09
FIND MY VENUE: Content & Review Based Location Recommendation System
oneslider_template_ACS [532226]
Preventing private information inference attacks on social networks
Social Media Mining - Chapter 8 (Influence and Homophily)
Service rating prediction by exploring social mobile users’ geographical loca...
Ad

Similar to Privacy policy inference of user uploaded (20)

PDF
Privacy Policy Inference of User-Uploaded Images on Content Sharing Sites
PDF
Implementation of Privacy Policy Specification System for User Uploaded Image...
PDF
Protect Social Connection Using Privacy Predictive Algorithm
PDF
Privacy Recommendations and Ranking of User Images on Content Sharing Sites
PDF
Socially Shared Images with Automated Annotation Process by Using Improved Us...
PDF
My Privacy My decision: Control of Photo Sharing on Online Social Networks
PDF
A Dynamic Intelligent Policies Analysis Mechanism for Personal Data Processin...
PDF
Intelligent access control policies for Social network site
PDF
IRJET- An Analysis of Personal Data Shared to Third Parties by Web Services
PDF
Building Open Data Markets Using Sensing as a Service Model
DOC
1.supporting privacy protection in personalized web search..9440480873 ,proje...
PDF
Harnessing AI for Data Privacy through a Multidimensional Framework
PDF
Harnessing AI for Data Privacy through a Multidimensional Framework
PDF
HARNESSING AI FOR DATA PRIVACY THROUGH A MULTIDIMENSIONAL FRAMEWORK
PDF
HARNESSING AI FOR DATA PRIVACY THROUGH A MULTIDIMENSIONAL FRAMEWORK
PDF
Mit csail-tr-2007-034
PDF
Literature survey andrei_manta_0
PDF
Information Disclosure Profiles for Segmentation and Recommendation
PPTX
SUPPORTING PRIVACY PROTECTION IN PERSONALIZED WEB SEARCH
Privacy Policy Inference of User-Uploaded Images on Content Sharing Sites
Implementation of Privacy Policy Specification System for User Uploaded Image...
Protect Social Connection Using Privacy Predictive Algorithm
Privacy Recommendations and Ranking of User Images on Content Sharing Sites
Socially Shared Images with Automated Annotation Process by Using Improved Us...
My Privacy My decision: Control of Photo Sharing on Online Social Networks
A Dynamic Intelligent Policies Analysis Mechanism for Personal Data Processin...
Intelligent access control policies for Social network site
IRJET- An Analysis of Personal Data Shared to Third Parties by Web Services
Building Open Data Markets Using Sensing as a Service Model
1.supporting privacy protection in personalized web search..9440480873 ,proje...
Harnessing AI for Data Privacy through a Multidimensional Framework
Harnessing AI for Data Privacy through a Multidimensional Framework
HARNESSING AI FOR DATA PRIVACY THROUGH A MULTIDIMENSIONAL FRAMEWORK
HARNESSING AI FOR DATA PRIVACY THROUGH A MULTIDIMENSIONAL FRAMEWORK
Mit csail-tr-2007-034
Literature survey andrei_manta_0
Information Disclosure Profiles for Segmentation and Recommendation
SUPPORTING PRIVACY PROTECTION IN PERSONALIZED WEB SEARCH
Ad

More from nexgentech15 (20)

DOCX
Subgraph matching with set similarity in a
DOCX
Rule based method for entity resolution
DOCX
Discovery of ranking fraud for mobile apps
DOCX
Secure auditing and deduplicating data in cloud
DOCX
Provable multicopy dynamic data possession
DOCX
Orchestrating bulk data transfers across
DOCX
New algorithms for secure outsourcing of
DOCX
Identity based encryption with outsourced
DOCX
Cost effective authentic and anonymous
DOCX
Control cloud data access privilege and
DOCX
A trusted iaa s environment
DOCX
A profit maximization scheme with guaranteed
DOCX
User defined privacy grid system
DOCX
Learning to rank image tags with limited
DOCX
Detecting malicious facebook applications
DOCX
Collusion tolerable privacy-preserving sum
DOCX
Automatic face naming by learning discriminative
DOCX
A computational dynamic trust model
DOCX
Space efficient verifiable secret sharing
DOCX
Query aware determinization of uncertain
Subgraph matching with set similarity in a
Rule based method for entity resolution
Discovery of ranking fraud for mobile apps
Secure auditing and deduplicating data in cloud
Provable multicopy dynamic data possession
Orchestrating bulk data transfers across
New algorithms for secure outsourcing of
Identity based encryption with outsourced
Cost effective authentic and anonymous
Control cloud data access privilege and
A trusted iaa s environment
A profit maximization scheme with guaranteed
User defined privacy grid system
Learning to rank image tags with limited
Detecting malicious facebook applications
Collusion tolerable privacy-preserving sum
Automatic face naming by learning discriminative
A computational dynamic trust model
Space efficient verifiable secret sharing
Query aware determinization of uncertain

Recently uploaded (20)

PPTX
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
PDF
International_Financial_Reporting_Standa.pdf
PDF
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
PDF
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
PPTX
Introduction to pro and eukaryotes and differences.pptx
PPTX
Unit 4 Computer Architecture Multicore Processor.pptx
PDF
Hazard Identification & Risk Assessment .pdf
PDF
advance database management system book.pdf
PDF
Empowerment Technology for Senior High School Guide
PPTX
Share_Module_2_Power_conflict_and_negotiation.pptx
PPTX
Computer Architecture Input Output Memory.pptx
PDF
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
PPTX
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
PPTX
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
PDF
AI-driven educational solutions for real-life interventions in the Philippine...
PDF
Weekly quiz Compilation Jan -July 25.pdf
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PDF
What if we spent less time fighting change, and more time building what’s rig...
PDF
My India Quiz Book_20210205121199924.pdf
PDF
Uderstanding digital marketing and marketing stratergie for engaging the digi...
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
International_Financial_Reporting_Standa.pdf
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
Introduction to pro and eukaryotes and differences.pptx
Unit 4 Computer Architecture Multicore Processor.pptx
Hazard Identification & Risk Assessment .pdf
advance database management system book.pdf
Empowerment Technology for Senior High School Guide
Share_Module_2_Power_conflict_and_negotiation.pptx
Computer Architecture Input Output Memory.pptx
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
AI-driven educational solutions for real-life interventions in the Philippine...
Weekly quiz Compilation Jan -July 25.pdf
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
What if we spent less time fighting change, and more time building what’s rig...
My India Quiz Book_20210205121199924.pdf
Uderstanding digital marketing and marketing stratergie for engaging the digi...

Privacy policy inference of user uploaded

  • 1. PRIVACY POLICY INFERENCE OF USER-UPLOADED IMAGES ON CONTENT SHARING SITES Abstract—With the increasing volume of images users share through social sites, maintaining privacy has become a major problem, as demonstrated by a recent wave of publicized incidents where users inadvertently shared personal information. In light of these incidents, the need of tools to help users control access to their shared content is apparent. Toward addressing this need, we propose an Adaptive Privacy Policy Prediction (A3P) system to help users compose privacy settings for their images. We examine the role of social context, image content, and metadata as possible indicators of users’ privacy preferences. We propose a two- level framework which according to the user’s available history on the site, determines the best available privacy policy for the user’s images being uploaded. Our solution relies on an image classification framework for image categories which may be associated with similar policies, and on a policy prediction algorithm to automatically generate a policy for each newly uploaded image, also according to users’ social features. Over time, the generated policies will follow the evolution of users’ privacy attitude. We provide the results of our extensive evaluation over 5,000 policies, which demonstrate the effectiveness of our system, with prediction accuracies over 90 percent.
  • 2. EXISTING SYSTEM: Several recent works have studied how to automate the task of privacy settings Bonneau et al. proposed the concept of privacy suites which recommend to users a suite of privacy settings that n“expert” users or other trusted friends have already set, so that normal users can either directly choose a setting or only need to do minor modification. Similarly, Danezis proposed a machine-learning based approach to automatically extract privacy settings from the social context within which the data is produced. Parallel to the work of Danezis, Adu-Oppong et al. develop privacy settings based on a concept of “Social Circles” which consist of clusters of friends formed by partitioning users’ friend lists. Ravichandran et al. studied how to predict a user’s privacy preferences for location-based data (i.e., share her location or not) based on location and time of day. Fang et al. proposed a privacy wizard to help users grant privileges to their friends. The wizard asks users to first assign privacy labels to selected friends, and then uses this as input to construct a classifier which classifies friends based on their profiles and automatically assign privacy labels to the unlabeled friends. More recently, Klemperer et al. studied whether the keywords and captions with which users tag their photos can be used to help users more intuitively create and maintain access-
  • 3. control policies. Their findings are inline with our approach: tags created for organizational purposes can be repurposed to help create reasonably accurate access-controlrules. PROPOSED SYSTEM: We propose an Adaptive Privacy Policy Prediction (A3P) system which aims to provide users a hassle free privacy settings experience by automatically generating personalized policies. The A3P system handles user uploaded images, and factors in the following criteria that influence one’s privacy settings of images: The impact of social environment and personal characteristics. Social context of users, such as their profile information and relationships with others may provide useful information regarding users’ privacy preferences. For example, users interested in photography may like to share their photos with other amateur photographers. Users who have several family members among their social contacts may share with them pictures related to family events. However, using common policies across all users or across users with similar traits may be too simplistic and not satisfy individual preferences. Users may have drastically different opinions even on the same type of images. For example, a privacy adverse person may be willing to share all his personal images while a more conservative person may just want to
  • 4. share personal images with his family members. In light of these considerations, it is important to find the balancing point between the impact of social environment and users’ individual characteristics in order to predict the policies that match each individual’s needs. Moreover, individuals may change their overall attitude toward privacy as time passes. In order to develop a personalized policy recommendation system, such changes on privacy opinions should be carefully considered. The role of image’s content and metadata. In general, similar images often incur similar privacy preferences, especially when people appear in the images. For example, one may upload several photos of his kids and specify that only his family members are allowed to see these photos. He may upload some other photos of landscapes which he took as a hobby and for these photos, he may set privacy preference allowing anyone to view and comment the photos. Analyzing the visual content may not be sufficient to capture users’ privacy preferences. Tags and other metadata are indicative of the social context of the image, including where it was taken and why , and also provide a synthetic description of images, complementing the information obtained from visual content analysis.
  • 5. Module 1 A3P-CORE There are two major components in A3P-core: (i) Image classification and (ii) Adaptive policy prediction. For each user, his/her images are first classified based on content and metadata. Then, privacy policies of each category of images are analyzed for the policy prediction. Adopting a two-stage approach is more suitable for policy recommendation than applying the common one-stage data mining approaches to mine both image features and policies together. Recall that when a user uploads a new image, the user is waiting for a recommended policy. The two- stage approach allows the system to employ the first stage to classify the new image and find the candidate sets of images for the subsequent policy recommendation. As for the one-stage mining approach, it would not be able to locate the right class of the new image because its classification criteria needs both image features and policies whereas the policies of the new image are not available yet. Moreover, combining both image features and policies into a single classifier would lead to a system which is very dependent to the specific syntax of the policy. If a change in the supported policies were to be introduced, the whole learning model would need to change. Module 2
  • 6. Image Classification To obtain groups of images that may be associated with similar privacy preferences, we propose a hierarchical image classification which classifies images first based on their contents and then refine each category into subcategories based on their metadata. Images that do not have metadata will be grouped only by content. Such a hierarchical classification gives a higher priority to image content and minimizes the influence of missing tags. Note that it is possible that some images are included in multiple categories as long as they contain the typical content features or metadata of those categories. The content-based classification creates two categories: “landscape” and “kid”. Images C, D, E and F are included in both categories as they show kids playing outdoor which satisfy the two themes: “landscape” and “kid”. These two categories are further divided into subcategories based on tags associated with the images. As a result, we obtain two subcategories under each theme respectively. Notice that image G is not shown in any subcategory as it does not have any tag; image A shows up in both subcategories because it has tags indicating both “beach” and “wood”. Module 3 PolicyMining
  • 7. We propose a hierarchical mining approach for policy mining. Our approach leverages association rule mining techniques to discover popular patterns in policies. Policy mining is carried out within the same category of the new image because images in the same category are more likely under the similar level of privacy protection. The basic idea of the hierarchical mining is to follow a natural order in which a user defines a policy. Given an image, a user usually first decides who can access the image, then thinks about what specific access rights (e.g., view only or download) should be given, and finally refine the access conditions such as setting the expiration date. Correspondingly, the hierarchical mining first look for popular subjects defined by the user, then look for popular actions in the policies containing the popular subjects, and finally for popular conditions in the policies containing bothpopular subjects and conditions. _ Step 1: In the same category of the new image, conduct association rule mining on the subject component of polices. Let S1, S2; . . ., denote the subjects occurring in policies. Each resultant rule is an implication of the form X ) Y, where X, Y _ fS1, S2; . . . ; g, and X Y ¼ ;. Among the obtained rules, we select the best rules according to one of the interestingness measures, i.e., the generality of the rule, defined using support and confidence as introduced in [16]. The selected rules indicate the most popular subjects (i.e., single subject) or subject combinations (i.e., multiple subjects) in policies. In the subsequent steps, we consider policies
  • 8. which contain at least one subject in the selected rules. For clarity, we denote the set of such policies as Gsub i corresponding to a selected rule Rsub i . _ Step 2: In each policy set Gsub i , we now conduct association rule mining on the action component. The result will be a set of association rules in the form of X ) Y, where X, Y _fopen, comment, tag, downloadg, and X Y ¼ ;. Similar to the first step, we will select the best rules according to the generality interestingness. This time, the selected rules indicate the most popular combination of actions in policies with respect to each particular subject or subject combination. Policies which do not contain any action included in the selected rules will be removed. Given a selected rule Ract j we denote the set of remaining policies as Gact j , and note that Gact j _ Gsub _ Step 3: We proceed to mine the condition component in each policy set Gact j . Let attr1, attr2, ..., attrn denote the distinct attributes in the condition component of the policies in Gact j . The association rules are in the same format of X ) Y but with X, Y _fattr1, attr2; . . . ; attrng. Once the rules are obtained, we again select the best rules using the generality interestingness measure. The selected rules give us a set of attributes which often appear in policies. Similarly, we denote the policies containing at least one attribute in the selected rule Rcon k as Gcon k and Gconk _ Gact j
  • 9. _ Step 4: This step is to generate candidate policies. Given Gcon k _ Gact j _ Gsub i , we consider each corresponding series of best rules: Rcon kx , Ract jy and Rsub iz . Candidate policies are required to possess all elements in Rcon kx , Ract jy and Rsub iz Note that candidate policies may be different from the policies as result of Step 3. This is because Step 3 will keep policies as long as they have one of the attributes in the selected rules. Module 4 A3P-SOCIAL The A3P-social employs a multi-criteria inference mechanism that generates representative policies by leveraging key information related to the user’s social context and his general attitude toward privacy. As mentioned earlier, A3Psocial will be invoked by the A3P-core in two scenarios. One is when the user is a newbie of a site, and does not have enough images stored for the A3P-core to infer meaningful and customized policies. The other is when the system notices significant changes of privacy trend in the user’s social circle, which may be of interest for the user to possibly adjust his/her privacy settings accordingly. In what follows, we first present the types of social context considered by A3P-Social, and then present the policy recommendation process. CONCLUSION:
  • 10. We have proposed an Adaptive Privacy Policy Prediction (A3P) system that helps users automate the privacy policy settings for their uploaded images. The A3P system provides a comprehensive framework to infer privacy preferences based on the information available for a given user. We also effectively tackled the issue of cold-start, leveraging social context information. Our experimental study proves that our A3P is a practical tool that offers significant improvements over current approaches to privacy. REFERENCES [1] A. Acquisti and R. Gross, “Imagined communities: Awareness, information sharing, and privacy on the facebook,” in Proc. 6th Int. Conf. Privacy Enhancing Technol. Workshop, 2006, pp. 36–58. [2] R. Agrawal and R. Srikant,“Fast algorithms for mining association rules in large databases,” in Proc. 20th Int. Conf. Very Large Data Bases, 1994, pp. 487– 499.
  • 11. [3] S. Ahern, D. Eckles, N. S. Good, S. King, M. Naaman, and R. Nair, “Over- exposed?: Privacy patterns and considerations in online and mobile photo sharing,” in Proc. Conf. Human Factors Comput. Syst., 2007, pp. 357–366. [4] M. Ames and M. Naaman, “Why we tag: Motivations for annotation in mobile and online media,” in Proc. Conf. Human Factors Comput. Syst., 2007, pp. 971– 980. [5] A. Besmer and H. Lipford, “Tagged photos: Concerns, perceptions, and protections,” in Proc. 27th Int. Conf. Extended Abstracts Human Factors Comput. Syst., 2009, pp. 4585–4590. [6] D. G. Altman and J. M. Bland ,“Multiple significance tests: The bonferroni method,” Brit. Med. J., vol. 310, no. 6973, 1995. [7] J. Bonneau, J. Anderson, and L. Church, “Privacy suites: Shared privacy for social networks,” in Proc. Symp. Usable Privacy Security, 2009. [8] J. Bonneau, J. Anderson, and G. Danezis, “Prying data out of a social network,” in Proc. Int. Conf. Adv. Soc. Netw. Anal. Mining., 2009, pp.249–254. [9] H.-M. Chen, M.-H. Chang, P.-C. Chang, M.-C. Tien, W. H. Hsu, and J.-L. Wu, “Sheepdog: Group and tag recommendation for flickr photos by automatic search- based learning,” in Proc. 16th ACM Int. Conf. Multimedia, 2008, pp. 737–740. [10] M. D. Choudhury, H. Sundaram, Y.-R. Lin, A. John, and D. D. Seligmann, “Connecting content to community in social media via image content, user tags
  • 12. and user communication,” in Proc. IEEE Int. Conf. Multimedia Expo, 2009, pp.1238–1241. [11] L. Church, J. Anderson, J. Bonneau, and F. Stajano, “Privacy stories: Confidence on privacy behaviors through end user programming,” in Proc. 5th Symp. Usable Privacy Security, 2009. [12] R. da Silva Torres and A. Falc~ao, “Content-based image retrieval: Theory and applications,” Revista de Inform_atica Te_orica e Aplicada, vol. 2, no. 13, pp. 161–185, 2006