Join the Community

23,369

Expert opinions

42,409

Total members

306

New members (last 30 days)

181

New opinions (last 30 days)

29,102

Total comments

Join Sign in

Maintaining Enterprise Data Privacy in a Data-Hungry World

04 November 2024 3 comments

Erica Andersen

Marketing

smartR AI

The rise of large language models (LLMs) and other AI systems has created an insatiable demand for data. While these models offer incredible potential, their hunger for data presents a significant challenge for enterprises striving to maintain data privacy.

We also need to see data privacy within the whole data is power context to guarantee a long-term rent seeking income. The control of our data and AI is finding itself in the hands of a small number of people within the big tech cartel. Their increasing power seeks to rent seek at the expense of free market competition, thus stifling the very competition and innovation that got us out of the caves in the first place.

Anyway, time is money, I will get off my soap box and plunge straight into an article that explores the key strategies and considerations for safeguarding sensitive information in our data-driven era.

Understanding the Risks

Data Extraction and Exposure: LLMs can inadvertently memorize and reproduce sensitive information from training datasets, potentially leading to data breaches.
Inference Attacks: Even without direct access to data, attackers can use clever prompts to extract sensitive information or infer patterns from model outputs.
Unintended Biases: If training data contains biases, the resulting models may perpetuate or even amplify those biases, leading to discriminatory outcomes.

Strategies for Protecting Enterprise Data

Data Minimization: Collect and retain only the data that is absolutely necessary for business purposes. Implement data retention policies and securely dispose of outdated information.
De-identification and Anonymization: Remove or obfuscate personally identifiable information (PII) before using data for training or other purposes. Techniques like differential privacy can add noise to data while preserving its statistical properties.
Federated Learning: Train models on decentralized datasets without directly accessing or sharing sensitive information. This approach allows multiple parties to collaborate on model development while maintaining data privacy.
Homomorphic Encryption: Perform computations on encrypted data without decrypting it. This technique enables secure data sharing and analysis without compromising privacy.
Privacy-Preserving Machine Learning: Utilize algorithms and techniques specifically designed to protect privacy during model training and deployment. This includes methods like secure multi-party computation and differential privacy.
Robust Access Controls: Implement strict access controls to limit who can access sensitive data and how it can be used. Regularly review and update access permissions to ensure data security.
Data Governance Framework: Establish a comprehensive data governance framework that includes policies, procedures, and guidelines for data privacy. This framework should address data collection, storage, usage, sharing, and disposal.
Employee Training and Awareness: Educate employees about data privacy best practices and the importance of protecting sensitive information. Foster a culture of data privacy within the organization.
Vendor Due Diligence: When working with third-party vendors, carefully assess their data privacy practices and ensure they align with your organization’s standards. Include data protection clauses in contracts.
Regular Audits and Monitoring: Conduct regular audits and monitoring to assess the effectiveness of data privacy controls. Identify and address any vulnerabilities or gaps in security measures.

Maintaining enterprise data privacy in a data-hungry world requires a multi-faceted approach. While implementing robust data protection strategies like de-identification, access controls, and federated learning are crucial, organizations can further enhance privacy by exploring the use of private Small Language Models (SLMs).

These SLMs, trained exclusively on premise with 100% permissioned data, offer a compelling solution. By leveraging internal data sources and maintaining complete control over the training process, companies can achieve a high degree of privacy and mitigate the risks associated with sharing data with public LLMs. This approach empowers organizations to harness the power of AI while upholding the confidentiality and security of sensitive information.

As the data landscape continues to evolve, embracing strategies like private SLMs will be vital for ensuring data privacy remains a top priority.

External

This content is provided by an external author without editing by Finextra. It expresses the views and opinions of the author.

11825

Report

Channels

/artificial intelligence /security

Artificial Intelligence and Financial Services

Join group

456 opinions 126 members 27 June 2025

Comments: (3)

Bo Harald Chairman/Founding member, board member at Trust Infra for Real Time Economy Prgrm & MyData,

07 November 2024

I have asked if there should be a GDPR also for enterprises. Protecting their Mydata and making art 20 like access to it not only mandatory - but also possible now when the EUDI general purpose organisation wallets/agents will have to be deployed for citizens, data issuers and service providers (verifiers). Or will this so much better controle and security delivered by deployment take care of it. 3-6% GDP-growth promised by McKinsey will in any case materialise mostly via enterprises and more effiecient and ethical AI - when verified and holder-permitted data should be a choice for it - wihtout legislation..

1 Like

Report

Oliver King-Smith CEO at smartR AI

07 November 2024

Thank you for your thoughtful comment, Bo. You raise some important points about the need to protect enterprise data and the potential role of GDPR-like regulations in that effort.

You make a good observation that as the EUDI digital wallet infrastructure is deployed for citizens, data issuers, and service providers, it could provide a more secure and controlled environment for data access and usage. This improved infrastructure may help address some of the data privacy challenges enterprises face with the rise of large language models and other data-hungry AI systems.

At the same time, you highlight the valid concern that enterprises may still need specific data privacy legislation, similar to GDPR, to fully protect their "MyData" and ensure transparent, user-controlled access to it. The potential 3-6% GDP growth promised by the increased efficiency and ethical use of AI does seem to hinge on enterprises being able to leverage verified, holder-permitted data. Robust data privacy laws could help enable this.

It's an interesting question whether the security and control provided by the EUDI ecosystem will be sufficient, or if additional enterprise-focused data regulation is still warranted. There are good arguments on both sides, and it's an area that will likely require ongoing policy discussion and debate.

We appreciate you taking the time to share your perspective on this important issue. We'll continue to monitor developments in this space and welcome further input from knowledgeable voices like yours. Please let us know if you have any other thoughts to share.

1 Like

Report

Bo Harald Chairman/Founding member, board member at Trust Infra for Real Time Economy Prgrm & MyData,

07 November 2024

Thank you Oliver for your insightful comment. As it is public here I take the liberty to use it in my almost daily LinkedIn posts..

1 Like

Report