Abstract
The adoption of centralized machine learning in finance is limited by data privacy concerns and regulations. This study evaluates the impact of federated learning, synthetic data generation, and anonymization on classification performance using the Default of Credit Card Clients (DCCC) dataset. Experiments with four classification models assess the effects of privacy-preserving techniques. Results show that federated learning and synthetic data generation outperform anonymization in accuracy. Notably, models trained on synthetic data achieve performance comparable to or exceeding centrally trained models (highest accuracy: 80.2%, highest F1-score: 65.87%, Support Vector Machine model). These findings highlight federated learning and synthetic data as effective, privacy-preserving solutions for financial applications.
| Translated title of the contribution | The Impact of Data Privacy Methods on Credit Risk Classification |
|---|---|
| Original language | Turkish |
| Title of host publication | 33rd IEEE Conference on Signal Processing and Communications Applications, SIU 2025 - Proceedings |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9798331566555 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 33rd IEEE Conference on Signal Processing and Communications Applications, SIU 2025 - Istanbul, Turkey Duration: 25 Jun 2025 → 28 Jun 2025 |
Publication series
| Name | 33rd IEEE Conference on Signal Processing and Communications Applications, SIU 2025 - Proceedings |
|---|
Conference
| Conference | 33rd IEEE Conference on Signal Processing and Communications Applications, SIU 2025 |
|---|---|
| Country/Territory | Turkey |
| City | Istanbul |
| Period | 25/06/25 → 28/06/25 |
Bibliographical note
Publisher Copyright:© 2025 IEEE.