Notes from 13/7–20/7
COVID-19
How McKinsey is making 100 million advising on the Government’s bumbling pandemic response
I’m not sure how to feel about this at all
AI
Full Stack Deep Learning
Brilliantly done, just uses Gitbook and Youtube videos.
Deep Learning for Anomaly Detection: A Comprehensive Survey
One of the most challenging issues is the difficulty to achieve high anomaly detection recall rate (Challenge #1). Since anomalies are highly rare and heterogeneous, it is difficult to identify all of the anomalies. Many normal instances are wrongly reported as anomalies while true yet sophisticated anomalies are missed.
Anomaly detection in high-dimensional and/or not-independent data (Challenge #2) is also a significant challenge. Anomalies often exhibit evident abnormal characteristics in a low-dimensional space yet become hidden and unnoticeable in a high-dimensional space. High-dimensional anomaly detection has been a long-standing problem. Subspace/feature selection-based methods may be a straightforward solution. However, identifying intricate (e.g., high-order, nonlinear and heterogeneous) feature interactions and couplings may be essential in high-dimensional data yet remains a major challenge for anomaly detection.
Due to the difficulty and cost of collecting large-scale labeled anomaly data, it is important to have data-efficient learning of normality/abnormality (Challenge #3). wo major challenges are how to learn expressive normality/abnormality representations with a small amount of labeled anomaly data and how to learn detection models that are generalized to novel anomalies uncovered by the given labeled anomaly data.
Many weakly/semi-supervised anomaly detection methods assume the given labeled training data is clean, which can be highly vulnerable to noisy instances that are mistakenly labeled as an opposite class label. One main challenge here is how to develop noise-resilient anomaly detection (Challenge #4).
Most of existing methods are for point anomalies, which cannot be used for conditional anomaly and group anomaly since they exhibit completely different behaviors from point anomalies. One main challenge here is to incorporate the concept of conditional/group anomalies into anomaly measures/models for the detection of those complex anomalies (Challenge #5).
In many critical domains there may be some major risks if anomaly detection models are directly used as black-box models. For example, the rare data instances reported as anomalies may lead to possible algorithmic bias against the minority groups presented in the data, such as under-represented groups in fraud detection and crime detection systems. An effective approach to mitigate this type of risk is to have anomaly explanation (Challenge #6) algorithms that provide straightforward clues about why a specific data instance is identified as anomaly. Providing such explanation can be as important as detection accuracy in some applications. To derive anomaly explanation from specific detection methods is still a largely unsolved problem, especially for complex models. Developing inherently interpretable anomaly detection models is also crucial, but it remains a main challenge to well balance the model’s interpretability and effectiveness.
A Practical Guide to building Enterprise Knowledge Graphs for Investment Analysis
I wonder if anyone has done this for Temasek/ Singapore
Blockchain
The Coded Income Model
Basically employment contracts made smart?
Development
The Docker Handbook
Brilliant and useful, will keep around
Education and Training
Lessonly enterprise training
This existed since 2014? It looks absolutely brilliant!
Microsoft help 25 million acquire digital skills plan
Will they actually pull this off though?
Product
The Perfect Formula for FinTech Products: CX = ML + UX
Covers quite a few fintech startups i’ve not actually come across before
Tools
4 Github products that will change the way you code
An operating system for development
mmhmm Video presentation tool
It’s a video on zoom? interesting idea
Loopy for thinking in systems
I’ve a feeling this will be useful when explaining effects