• This is an article about ethical practice on big data gathering.

./20161105-0139-cet-code-of-ethical-practice-1.png

  • There is this startup named Color.
  • They want to use usable sensors in common smart phone for big data gathering.

./20161105-0139-cet-code-of-ethical-practice-2.png

  • Color make us realized that companies are always gaining new ways to capture and gather information about everything.

./20161105-0139-cet-code-of-ethical-practice-3.png

./20161105-0139-cet-code-of-ethical-practice-4.png

  • There are these technology that is commonly used for big data gathering.
    • Language processing.
    • Machine learning.
    • Software architecture like Hadoop which can handle multiple simultaneous search query.

./20161105-0139-cet-code-of-ethical-practice-5.png

  • Back then gathered data was arranged very messy and unstructured.
  • Now everything is stored in data warehouses.
  • The target of data mining could be anything, although it is usually comes from social networking site.

./20161105-0139-cet-code-of-ethical-practice-6.png

  • Data gathered is reached into staggering amount of 8.000 billion gigabytes.
  • By 2020, there would be total of 35 zetabytes of information stored globally.

./20161105-0139-cet-code-of-ethical-practice-7.png

  • Some privacy measures by American government.
    • John McCain and John Kerry proposed Consumer Privacy Bill Of Right Act in 2011.
    • Senator Jay Rockfeller has proposed Do - Not - Track Online Act also in 2011.
  • In Europe, there is The European Union Article 29 Working Group that addressing similar content.

./20161105-0139-cet-code-of-ethical-practice-8.png

  • Aside from government there is also a precaution dropped by Digital Advertising Alliance.
  • They introduced rule - making privacy framework to assure security and safety of customer information.
  • There is also Self - Regulatory Program For Online Behavior Adverting as well.

./20161105-0139-cet-code-of-ethical-practice-9.png

  • From Wall Street Journal it is known that average price of untargeted advertisement was at some point around 1.98 US Dollars per thousands view. While average price of targeted advertisement could go double at rate of 4.12 US Dollars.

./20161105-0139-cet-code-of-ethical-practice-10.png

./20161105-0139-cet-code-of-ethical-practice-11.png

  • Big company started to acquire smaller company that run around big data and data mining.

./20161105-0139-cet-code-of-ethical-practice-12.png

  • David Moore said, "It ceases to be an ad, it becomes important information.".

./20161105-0139-cet-code-of-ethical-practice-13.png

  • Facebook, Google, and Zynga start to aggregate data of billions user informations.

./20161105-0139-cet-code-of-ethical-practice-14.png

  • Example of security breach happened to Apple and Sony's PlayStation.
  • For around 100 million customers information.

./20161105-0139-cet-code-of-ethical-practice-15.png

  • Opportunities.
    • Data exchange.
    • Data market.
    • Predictive analytic market.
  • Below are some good practices on data mining.
  • In big line.
    • Clarity of practices.
    • Simplicity of settings.
    • Privacy by design (security).
    • Exchange values.

./20161105-0139-cet-code-of-ethical-practice-16.png

  • Clarity of practices means that users need to be able to see what kind of data that are being collected during their activity online.

./20161105-0139-cet-code-of-ethical-practice-17.png

  • Simplicity of settings means that user need to be able to set every possible aspect of their privacy.
  • Facebook and Google do this practice really well.

./20161105-0139-cet-code-of-ethical-practice-18.png

  • For example Facebook privacy settings can have an option up to 170 options.

./20161105-0139-cet-code-of-ethical-practice-19.png

  • Privacy by design means that the organization need to also protect their data from malicious attack.
  • It is said by Ann Cavaoukin a privacy comisioner from Ontario, Canada that options and transparency is not enough if unauthorized people can access the data.
  • So security need to be in mind when the whole data gathering infrastructure is made.

./20161105-0139-cet-code-of-ethical-practice-20.png

  • Exchange value means that the data gatherer needs to provide back to user in form of convenience.
  • For example some web application records how their user use their web application. In result there would be a recommended contents that is curated specifically for each of their users.

./20161105-0139-cet-code-of-ethical-practice-21.png

  • Clear example of exchange value is Netflix.
  • Netflix can show movie/serial/television program that might interest their user based on what kind of content that the user opened/searched before.

./20161105-0139-cet-code-of-ethical-practice-22.png

  • These points are made to make easy companies on defining their product's principles.

./20161105-0139-cet-code-of-ethical-practice-23.png

./20161105-0139-cet-code-of-ethical-practice-24.png