Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

WeekTaskStatusComments
20-MayStudy Work: State of art on the models, optimization and EvaluationDoneLook for optimization techniques, how they evaluate anonymization models.
27-MayFinalizing Dataset and Libraries to use -- suppression/rename/ .. etc.DoneKubernetes logs/Metrics, Openstack logs/metrics .. any data that has PII information
3-June

Anonymization Impact on the Model's utility

Done
10-JuneDone
17-JuneContaineration and the APIsDone
24-JuneAutomation using PythonDone
1-JulyTesting of the containerized ArchitectureDone
8-July

NLP Model for anonymizing Telco Data



15-July

22-July

29-July

5-AugEvaluation of the Model

12-AugIntegration of the developed model with the architecture

19-AugDocumentation and release of the code.

26-Aug[BUFFER]

...

  1. Metrics like precision, recall, and F1-score can be used to assess how well the method identifies sensitive information.
  2. However, the impact on models requires domain-specific evaluation. Some approaches that I will follow are:
    1. Compare model performance: Train and test models on original and anonymized data to see the accuracy drop.
    2. Evaluate information loss: Measure how much relevant information is lost due to anonymization.

Anonymization Impact on the Model's utility

The work has been updated on the personal page to prevent exposure of undergoing progress.