Skip to content

sreelekh/LAD

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LAD Anomaly Detection Model

Implementation of Anomaly Detection for High-Dimensional Data Using Large Deviations Principle: Sreelekha Guggilam and Varun Chandola and Abani Patra, Anomaly Detection for High-Dimensional Data Using Large Deviations Principle(In preparation)(2021).

Image

Image

Top 5 anomalous counties identified by the proposed LAD algorithm based on the daily multivariate time-series, consisting ofcumulative COVID-19 per-capita infections and deaths. At any time-instance, the algorithm analyzes the bi-variate time series for all thecounties to identify anomalies. The time-series for the non-anomalous counties are plotted (light-gray) in the background for reference. For the counties in North Dakota (Burleigh and Grand Forks), the number of confirmed cases (top), and the sharp rise in November 2020, is theprimary cause for anomaly. On the other hand, Wayne County in Michigan was identified as anomalous primarily because of its abnormallyhigh death rate, especially when compared to the relatively moderate confirmed infection rate.

File Descriptions

  1. Run import_libraries.ipynb, import_functions.ipynb, import_global_params.ipynb (optional) to import required libraries and functions
  2. Run LDP_paper_results_8-Evaluation Small, large.ipynb to run the LAD model on datasets
  3. Run LDP_paper_results_8-COVID TS plots only 50k population lower limit.ipynb to generate plots for COVID-19 data for US Counties

Data Sources

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published