Google Has Your Data, But How Do They Use It?

InsightsPortfolio

Jun 30

By David Magerman, PhD

TL;DR - Our data is everywhere. We can’t stop companies from having it. But we have a right to know how it is used. By encrypting all human behavioral data and creating an audit trail when it is decrypted and used, we can achieve enforceable data privacy policies.

[Note: During the drafting of this blog post, California real estate developer Alistair Mactaggart, one of the authors of California’s CCPA law, succeeded in getting a new privacy law on the ballot for the coming November elections (described here). The proposed measure would require the enforcement of “purpose limitation,” effectively requiring companies to allow users to limit the ways in which their behavioral data is used in machine learning algorithms. The law would require implementation of some variant of what is proposed here].

When it comes to data privacy, those of us who have been tilting at the windmill of protecting people from the abuse of their data have been getting it all wrong. We have been overly focused on the ubiquity of human behavioral data, how people have the right to be anonymized and forgotten, when we should be concentrating our efforts on clarifying and constraining how the data is used, shining a light on the machine learning algorithms that weaponize it.

Digital human behavioral data is everywhere. Our browsers track our movements around the internet. Retailers, credit card companies, and banks collect our spending behavior. Cell phone carriers and app developers track our physical location using GPS services on our phones. Alexa and Siri hear virtually all of our conversations, in our house and whenever we are near our phone, tablet, or computer. Social media companies track every aspect of our lives, via our posts and the posts in which we are tagged. Cameras everywhere use facial recognition to find us out in the world. Our DNA is collected by various parties, either because we give it to 23AndMe, or because we have to provide it to our employer, or someone collects it without our knowledge. The list goes on and on.

The technology exists to capture nearly every second of our lives no matter where we are. (Do you bring your cell phone into the bathroom?) Any desire we might have to prevent people from collecting, storing, and aggregating our human behavioral data is a hopeless cause. By deploying devices that can track our every move everywhere in society, including in our own ears (earpods now come with Alexa pre-installed!), we have opened Pandora’s Box of data promiscuity, and we will never be able to close it again.

Two years ago, the European Union came out with their General Data Protection Regulation (GDPR). California followed earlier this year with its California Consumer Privacy Act. Other jurisdictions have added their attempts to regulate human behavioral data. One of the main focuses of these regulations is the “right to be forgotten”. Another aspect of these regulations is the anonymization of personally identifiable information (PII). These are all attempts to regulate the availability of human behavioral data to limit the ability of companies using this data against the desires of the presumed owners of this data, the human beings that generated it. I have been a proponent of these efforts, and I still support the development of more thought around these regulations. However, I think all of these regulatory efforts miss a key point: enforcement.

Putting it bluntly, all of these regulatory efforts are effectively operating on the honor system when it comes to enforcement. We are trusting that companies are abiding by data collection rules. We are assuming they are deleting all copies of data they aren’t allowed to keep (and not dropping secret backups in data vaults hidden in the Rocky Mountains). We trust that they are anonymizing data before they are using it, and anonymizing it in ways that truly hide the identities of the humans behind the data, as opposed to applying some transformation that allows the valuable parts of the human’s identity to be easily recovered by algorithms. And, while there are ways of auditing compliance of these regulations, it is extraordinarily hard, if not impossible, to prove compliance, and violations are easily attributable to human error, software bugs, or other incidental mishaps. For all of the fanfare around their announcement and implementation, GDPR, CCPA, and their relatives are largely nuisances that can be easily worked around.

As I embraced this demoralizing view, and I started to confront the reality that the current approaches to data privacy aren’t going to work, I had an epiphany that has transformed my view of how to approach this problem going forward. Ultimately, the data isn’t the problem. The data exists. Frankly, the data exists whether we collect it or not. We have DNA. We have spending histories. We have location. There are images of us everywhere. We exist, and our data is just the residue of our existence.

The problem is how we USE that data: the algorithms. Machine learning has become so powerful and accurate, it is able to use the data that we feed it to model us in ways that can harm us. Facial recognition algorithms can build models to identify us when we don’t want to be found by people we don’t want to find us. Marketing data analytics tools can build models to predict how companies can convince us to buy things we don’t want to buy or to convince us to pay more for things than we should. Social media companies deploy machine learning to determine what information to show us to keep us glued to their platforms. Machine learning algorithms can use data from a relatively small sample of people and extrapolate from that data to predict the behavior of whole swaths of society.

It isn’t the data itself that is the problem. It’s the way the data is used that can harm us. And that observation can lead us to a solution to protecting us from the damage of the misuse of human behavioral data, by good and bad actors. And the answer is counterintuitive. Right now, there is a movement to anonymize data, to disconnect it from the source. I think the beginning of the solution is the opposite: to attach real identity to every piece of data that is collected anywhere on any computer system. Then, you need to force everyone to ask permission to use it, based on how they plan to use it. Here is what I mean.

Right now, everyone is being asked, by various regulations, to identify human behavioral data and to strip it of identity, anonymize it. But let’s say we did the opposite. We require everyone who has even a scrap of human behavioral data to attach the identity of the person who could be identified by that data. Then encrypt that data with a key associated with that person. Everyone has a key. And every time an algorithm needs to use that key to decrypt the information, it needs to get permission from the human being to decrypt the data.

[Note: In order to be impactful, this permissioning needs to extend to derivative uses of data as well. If a person’s data is decrypted and used in an algorithm that produces summary statistics, and those summary statistics are later used in another machine learning algorithm, that use needs to be permissioned as well. The permissioning rules need to tag along with all derivative data sets, and the auditing needs to include those uses, otherwise companies could easily work around these safeguards the way they do now with weak attempts at anonymizing data].

This isn’t as arduous or unfeasible as it might sound. People could give blanket permission for certain kinds of use of the data. All the company needs to do is ask. Companies could build automated or semi-automated tools to respond to these requests based on an individual’s preferences. Companies are already doing this to support responding to requests for CCPA and GDPR compliance, and they are doing it quite effectively. The key feature of this solution is that the “user” of the data is an algorithm, and the company deploying each algorithm that uses the data would have to describe the purpose of the algorithm and get permission from the human represented by the data before they could use it.

You might be happy to have your DNA used for medical research. You might not be as happy if it is being used for investigating crimes, or to build models for pricing health insurance. You might be willing to have your spending data used for banks that want to build risk models for making loans if they compensate you for providing the information. You might want retailers to use your shopping data to help you identify products that you might want to buy. Or you might not.

If you sell your data or simply let people use it, you can’t control HOW it is being used. And that’s the key to protecting ourselves from the abuse of human behavioral data in machine learning algorithms. It’s the algorithms that use the data that matter, and having the ability to control what algorithms are allowed to use our data, and what goals those algorithms achieve for companies that deploy them, is the key to defusing the dangers of the ubiquity of human behavioral data.

And there’s one more significant advantage to the decryption-by-use approach to protecting human behavioral data: accountability. By virtue of forcing everyone to ask for permission to use data every time they use it, you can create an audit trail that systematically catalogs every use of every piece of data by every algorithm. In real-time, the audit trail is as boring as a box of accounting receipts. But if there is ever an accusation of misuse of human behavioral data against a suspected bad actor, the audit trail of how human behavioral data was used by algorithms within a company would create a roadmap to accountability. Retrospectively, a company could be forced to justify each use, to explain each algorithm, to prove the validity and permissibility of each use of a person’s data.

If we could get to the point that we had an audit trail for every decryption of every piece of human behavioral data used by machine learning algorithms, then, for the first time, we could create an enforcement system for protecting human behavioral data that would have some teeth. The onus would be on the user of the data to prove that their use was valid and permitted. We would no longer be operating on the Honor System. Wouldn’t that be a better world to live in?

More News & Insights

News & Insights

Jun 17, 2025

Personal.ai launches “No LLM”

Jun 17, 2025

Jun 9, 2025

Row64 announces $4M Seed Round

Jun 9, 2025

Jun 2, 2025

IBM Acquires Seek AI

Jun 2, 2025

May 15, 2025

Ocrolus CEO Sam Bobley named Top 25 Fintech AI Executive

May 15, 2025

Apr 30, 2025

Personal AI announces collaboration with NVIDIA

Apr 30, 2025

Apr 24, 2025

STMicroelectronics Acquires Deeplite

Apr 24, 2025

Mar 6, 2025

On the Record with Lizzy Kolar, Scope Zero Co-Founder & CEO

Mar 6, 2025

Differential goes on the record with Lizzy Kolar, the co-founder and CEO of Scope Zero. Scope Zero's mission is to reduce annual utility bills and fuel expenses by $300 billion, the environmental equivalent of removing 125M cars from the road.

Mar 6, 2025

Feb 11, 2025

DataRobot Acquires Agnostiq

Feb 11, 2025

AttackIQ Acquires DeepSurface

Feb 11, 2025

Aug 27, 2024

On the Record with Moshe Hecht Hatch.AI Founder & CEO

Aug 27, 2024

Differential goes on the record with Moshe Hecht, an award-winning philanthropic futurist and innovator, reshaping the world of giving through technology and data solutions. The founder and CEO of Hatch, he is a dedicated philanthropist and has been published in Forbes, Guidestar, and Nonprofit Pro.

Aug 27, 2024

Aug 18, 2024

Driving Sustainability through Employee Benefits with Scope Zero CEO Lizzy Kolar

Aug 18, 2024

The WorkplaceTech Spotlight host Hadeel Al-Tashi sits down with Lizzy Kolar, Co-Founder and CEO of Scope Zero to dive into how Scope Zero's Carbon Savings Account (CSA) empowers employees to make affordable home technology and transportation upgrades while aligning with corporate sustainability goals. They discuss how the CSA not only supports environmental and financial wellness for employees but also strengthens a company's commitment to sustainability. Don't miss this opportunity to learn how integrating green benefits can drive meaningful impact within your organization.

Aug 18, 2024

Aug 6, 2024

Hatch. AI Closes a $3 Million Seed Round

Aug 6, 2024

Hatch AI, a groundbreaking intelligence platform for nonprofits, announced a $3 million raise in seed funding, led by Differential. Read the full press announcement at the link below.

Aug 6, 2024

Mar 4, 2024

Pienso: Putting AI into the hands of people with problems to solve

Mar 4, 2024

MIT News: Alumni-founded Pienso has developed a user-friendly AI builder so domain experts can build solutions without writing any code.

Mar 4, 2024

Feb 26, 2024

On the Record with Nate Cavanaugh, CoFounder & Co-CEO of FlowFi

Feb 26, 2024

On the Record with Nate Cavanaugh, CoFounder & Co-CEO of FlowFi.

In 2021, Nate co-founded of FlowFi, a SaaS-enabled marketplace that connects startups and SMBs with finance experts. FlowFi has raised $10M from top VC firms including Blumberg Capital, Differential Ventures, Clocktower Ventures and Precursor Ventures, and generated 7-figures of annual recurring revenue in its first year.

Nate was nominated to the Forbes 30 Under 30 list for Enterprise Technology.

Feb 26, 2024

Feb 13, 2024

FlowFi Closes on $9M in Seed Funding

Feb 13, 2024

TECHCRUNCH: FlowFi, a startup creating a marketplace of finance experts for entrepreneurs, closed on $9 million in seed funding.

Blumberg Capital led the investment and was joined by a group of investors including Parade Ventures, Differential Ventures, Precursor Ventures, Special Ventures, 14 Peaks Capital and Cooley LLP.

Feb 13, 2024

Dec 13, 2023

Cyolo’s Almog Apirion on Nasdaq TradeTalks

Dec 13, 2023

NASDAQ: Nasdaq TradeTalks: 2024 Cybersecurity Budget Outlook with Almog Apirion, Cyolo.

Dec 13, 2023

Nov 30, 2023

Retrocausal Raises $5.3M in Financing

Nov 30, 2023

FINSMES: Retrocausal, a Seattle, WA-based platform provider for manufacturing process management, raised $5.3M in funding.

The round was led by Glasswing Ventures, One Way Ventures, and Indicator Ventures, with participation from existing investors Argon Ventures, Differential Ventures, Ascend Vietnam Ventures, Incubate Fund US, SaaS Ventures, Hypertherm Ventures, Stage Venture Partners, and Techstars.

Nov 30, 2023

Sep 19, 2023

Nick Adams Discusses How To Get Your Generative AI Startup Funded

Sep 19, 2023

AI and the Future of Work Podcast: Entrepreneurs wonder what it’s like to be a VC. And VCs without an operating background often don’t understand the grit required to turn an idea into a successful business. The best investors have been successful operators first.

Today’s guest is one of those. Nick Adams founded Differential Ventures in 2017 to invest in B2B, data-first seed-stage companies. Since then, Nick and the team have invested in an impressive group of companies including Private AI, Ocrolus, and Agnostiq.

Sep 19, 2023

Aug 8, 2023

On the Record with Elissa Ross, CoFounder & CEO of Metafold

Aug 8, 2023

On the Record with Elissa Ross, CoFounder & CEO of Metafold. Elissa Ross is a mathematician and the CEO of Toronto-based startup Metafold 3D. Metafold makes an engineering design platform for additive manufacturing, with an emphasis on supporting engineers using metamaterials, lattices and microstructures at industrial scales. Elissa holds a PhD in discrete geometry (2011), and worked as an industrial geometry consultant for the 8 years prior to cofounding Metafold. Metafold is the result of observations made in the consulting context about the challenges and opportunities of 3D printing.

Aug 8, 2023

Jul 26, 2023

Nick Adams: What Regulations Need to Be Put in Place to Ensure the Safe Use of AI in the U.S.?

Jul 26, 2023

Nick Adams on PM360: To get a better grasp on what eventual AI regulations could and should look like, PM360 spoke with Nick Adams, Founding Partner at Differential Ventures. In addition to starting the venture capital firm focused on AI/machine learning in 2018, Adams is also a member of the cybersecurity and national security subcommittee for the National Venture Capital Association and recently briefed members of Congress on AI policy and potential regulation.

Jul 26, 2023

Jul 18, 2023

Metafold 3D Closes $2.35 Million CAD To Fuel Industrial Adoption of 3D Printing

Jul 18, 2023

BETAKIT: Metafold 3D, which wants to make it easier for manufacturers to design and 3D print complex parts, has secured $2.35 million CAD ($1.78 million USD) in seed funding.

Toronto-based Metafold was founded in 2020 by a group of math, geometry, and architecture experts in CEO Elissa Ross, CTO Daniel Hambleton, and COO Tom Reslinski. Born out of Hambleton’s geometry-focused consulting agency, Mesh Consultants, Metafold sells design for additive-manufacturing software to sportswear and biopharmaceutical companies.

Jul 18, 2023

Jul 12, 2023

Nick Adams: Where’s AI headed in the workplace? VCs weigh in

Jul 12, 2023

Nick Adams on TECHBREW: For all the pixels spilled about the promises of generative AI, it’s starting to feel like we’re telling the same story over and over again. AI is serviceable at document summarization and shows promise in customer service applications. But it generates fictions (the industry prefers the euphemistic and anthropomorphizing term “hallucinates”) and is limited by the data on which it’s trained.

Jul 12, 2023

Jul 10, 2023

Mona Introduces Free, Self-Service Monitoring for GPT Applications

Jul 10, 2023

ATLANTA and TEL AVIV, Israel, June 29, 2023 /PRNewswire/ -- Mona, the leading intelligent monitoring platform, unveils a new monitoring solution for GPT-based applications. The free, self-service offering provides businesses with granular visibility into GPT-based products and valuable insights into costs, performance, and quality.

Jul 10, 2023

Jun 21, 2023

David Magerman: Artificial Intelligence’s Glass Ceiling

Jun 21, 2023

David Magerman on THEINFORMATION: OpenAI’s stated goal is to develop and promote a software system capable of artificial general intelligence. Toward that end, the company has released systems based on large-language models, which can respond to prompts with fluent conversation on many subjects. ChatGPT, Microsoft’s Bing chatbot and other new systems based on OpenAI’s GPT-3 and GPT-4 models are truly incredible and perform far beyond previous attempts at achieving AGI.

Jun 21, 2023

Jun 16, 2023

Morgan Stanley at Work Launches Carver Edison’s Cashless Participation®

Jun 16, 2023

BUSINESSWIRE: Morgan Stanley at Work and Carver Edison, a financial technology company, announced today that Shareworks has joined Equity Edge Online® in offering Cashless Participation® to U.S.-based corporate clients. Since the initial launch of Cashless Participation® on Equity Edge Online®, stock plan participants have purchased more than one million shares1 with Cashless Participation®. Now that Shareworks has also launched the tool, a wider cohort of Morgan Stanley at Work corporate clients will have access.

Jun 16, 2023

Jun 9, 2023

Nick Adams on Fox5: Artificial Intelligence Pros and Cons

Jun 9, 2023

FOX5 WASHINGTON DC: Nick Adams discusses the pros and cons of Artificial intelligence.

Jun 9, 2023

May 15, 2023

Differential Ventures Specializes In Being Advisors For AI Companies

May 15, 2023

PULSE 2.0: Differential Ventures is a seed-stage venture capital fund that was founded by data scientists and entrepreneurs for data-focused entrepreneurs. To learn more about the firm, Pulse 2.0 interviewed Differential Ventures’ managing partner and co-founder Nick Adams.

May 15, 2023

May 4, 2023

Golioth Secures $4.6M Seed Funding to Accelerate Time-to-Market for IoT

May 4, 2023

IoTForAll: Golioth, a leading developer platform for the Industrial Internet of Things (IIoT), announced open access to a library of new reference designs for embedded engineers to accelerate their time to market, the launch of a Select Partner Program for energy and construction developers, and the completion of a $4.6M round of seed funding led by Blackhorn Ventures and Differential Ventures with participation from existing investors, Zetta Venture Partners, MongoDB Ventures and Lorimer Ventures.

May 4, 2023

May 1, 2023

PrivateAI’s PrivateGPT aims to combat ChatGPT privacy concerns

May 1, 2023

VENTURE BEAT: Data privacy provider Private AI, announced the launch of PrivateGPT, a “privacy layer” for large language models (LLMs) such as OpenAI’s ChatGPT. The new tool is designed to automatically redact sensitive information and personally identifiable information (PII) from user prompts.

May 1, 2023

Apr 28, 2023

Quantum commercialization: softly, softly towards the inevitable future

Apr 28, 2023

DIGINOMICA: What can an early-stage investor tell enterprises about the nascent quantum market?

The quantum tipping point – that fabled moment when quantum technologies break through to commercial adoption at scale – has been questioned in a previous diginomica report…

Apr 28, 2023

AllDifferential Ventures

Guest User

Google Has Your Data, But How Do They Use It?

More News & Insights

Contact Us

Learn more

Google Has Your Data, But How Do They Use It?

More News & Insights

How to Create Good Research-driven Software Products

Nick Adams on ThriveLoud’s Podcast “Thrive in Business”

Contact Us

Learn more

Sign Up