Privacy-Preserving Analytics Pave Way for Secure Information Sharing

Person in front of the projector screen in room full of people, presentation up on projector with the info of their project, Flags to their left and standing banners to their right.

In an era where data fuels decision-making, the delicate balance between utility and privacy has never been more critical. Privacy-Preserving Analytics for Non-IID Data, led by Dr. Jingrui He at the University of Illinois Urbana-Champaign, alongside Dr. Ross Maciejewski of Arizona State University and Dr. Hanghang Tong at Illinois, is at the forefront of addressing these challenges. Their work not only strengthens the Homeland Security Enterprise (HSE) but also promises a paradigm shift in how sensitive information is utilized.


The Challenge: Non-IID Data in Homeland Security

HSE applications often rely on vast amounts of sensitive data, from TSA’s airport body scans to financial fraud detection patterns. This data, however, does not conform to the assumption of independent and identically distributed (IID) data, a foundation for many existing privacy-enhancing technologies. As Dr. He explained, “Most of the work is done on IID data, meaning it’s uniformly distributed, which isn’t the case for DHS applications. For example, body scan images differ significantly from Honolulu to Anchorage.”

The project’s focus on non-IID data represents a groundbreaking step. By addressing interdependencies within datasets, the team aims to enhance privacy while ensuring effective analysis—a necessity in safeguarding critical information across DHS components.


Innovative Solutions for Data Privacy

The project operates on three core research goals:

  1. Synthetic Data Generation: Faithfully generating synthetic data preserves privacy while retaining analytical utility. Dr. He highlighted, “This approach allows parties to contribute without sharing raw data, offering a secure way to advance joint decision-making.”
  2. Federated Learning for Non-IID Data: The team developed algorithms to facilitate collaborative learning without centralized data sharing. These methods ensure robust defenses against adversarial attacks while maintaining data confidentiality.
  3. Visual Analytic Tools: Transparency is key to building trust. The project’s visual tools bridge the gap between complex algorithms and end users, empowering subject matter experts to interpret model outputs.

Science Meets Impact

With the potential to revolutionize privacy-preserving analytics, the project’s outcomes include novel deep-learning techniques, synthetic datasets, and practical tools for DHS applications. These innovations directly address pressing challenges, such as de-identifying Software Bills of Materials (SBOMs) used in cybersecurity.

“We’ve proposed a generation technique that protects privacy while maintaining utility,” said Dr. He. “Our federated learning adaptations and visual analytic systems are equally groundbreaking.”

The team’s accomplishments include two journal publications, six conference presentations, three open-source tools, and one completed Ph.D. thesis. Stakeholder engagement remains strong, with DHS components like TSA and I&A expressing enthusiasm for the project’s applicability.


Personal Motivations and Broader Implications

For Dr. He, this work is deeply personal. “I can’t count how many times I’ve received alerts about my personal information being compromised,” she said. “These experiences fueled my commitment to privacy-preserving analytics.”

The stakes extend beyond individual privacy. By addressing data disparities across DHS operations, the project ensures that critical decisions are informed without compromising security. As HSE evolves, these innovations provide a blueprint for safeguarding sensitive information in increasingly complex data landscapes.

Looking ahead, the team plans to expand their methodologies and refine their tools, with a continued focus on stakeholder collaboration. Their work exemplifies how science and technology can drive practical, impactful solutions for the nation’s most pressing security challenges.

This project seeks to achieve the existence of privacy and progress, demonstrating the power of innovation to transform how sensitive data is managed and utilized. This endeavor, as Dr. He noted, is about “making the impossible possible”—a mission that resonates far beyond the boundaries of academia.