Carefully selected. Rigorously Tested. Enterprise Ready. Original algorithms for data scientists to read, use and share.
AI systems tend to operate in the darkness of black boxes. Input data is transformed into decisions without much human-readable justification. The journey from data to insights requires a crucial data comprehension stage, so the direction of an AI/ML model is fully explainable. Omitting this stage may produce a result that is not actionable, causing the AI project to dead-end.
AI-Rover™ is the full implementation of an explainable AI algorithm on a virtualized notebook, in concert with Zeblok's GPU-powered data lake, Edge connectivity and other components. AI-Rover™ discovers and visually explains patterns in complex datasets as well as the causal relationships underlying those patterns, supporting data analysts in the construction of trust-able decision-making AI models. Zeblok's accelerated data lake enables rapid querying of multiple disparate data sources. AI-Rover™ enables the crucial data comprehension phase to properly target the direction of an AI project.
Say goodbye to tedious data exploration and instead gain quick access to the hidden information in your data within an easy to understand visual interface that enables users to intuitively derive subpopulations of data attributes, leading to better predictive insights.
Computer Vision analytics to detect object-person configuration and crowd counting from Drones
We present a computer vision method for processing images from a drone-mounted camera to detect foreign objects in a scene or to detect specific person-object configurations of interest—computer vision method based on MaskRCNN, the state-of-the-art appearance-based object detector framework. We will further extend MaskRCNN with an attention mechanism to incorporate contextual cues in the detection process. It uses feature similarity and spatial relationships between semantically related entities. This approach performs well for detecting humans and human hands in images, and the advantages of this approach are explainable. In essence, a region is more likely to be a hand if there are other regions with similar skin tones, and the location of a hand is inferred by the presence of other semantically related body parts such as wrist and elbow. Crowd counting is a natural extension for this algorithm, and we've seen good results.
Computer Vision-based Real-Time Data Analytics Using Edge Computing Infrastructure with Application for Smart City
Augment Smart City Applications
Usually, traditional computer vision algorithms, such as Support Vector Machines (SVM) or edge detection, can be applied to tackle such kinds of tasks in computer vision. Given the development of deep learning and 5G networks, the exploration of more advanced machine learning, such as Variational Auto Encoders (VAE) and Generative Adversarial Networks (GAN) has become applicable for the same objectives.
This algorithm utilizes current deep learning techniques to inspect road surfaces and monitor road conditions in real-time. Previous works involved in such topics usually apply traditional computer vision algorithms, such as edge detection, color space segmentation, or support vector machine (SVM), which are designed to work in a centralized computing environment. This algorithm creates a novel deep learning structure, such as Variational Auto-encoder (VAE) and Generative Adversarial Networks (GANs) to detect road potholes and cracks as anomalies.
Deep Learning and Adversarial Learning in Credit Card Fraud
Augment FinTech Operations
Credit card fraud is a problem that can cost banks billions of dollars annually, leading to increased incentives among financial institutions for the development of fast, effective, and dynamic fraud detection systems. This research project addresses credit card fraud detection through a semi-supervised approach, in which clusters of account profiles are created and used for modeling classifiers. Accounts are profiled based on their behavioral trends and clustered into similar groups. Groups are further identified as distinct customer segments based on purchase characteristics such as amount, frequency, or distance. Random forest and XGBoost classifiers are trained on an entire sample and compared against classifiers trained at the transaction level across each cluster.
This research concludes that the overall weighted performance of classifiers trained at the cluster level does not significantly outperform classifiers trained on the full sample. However, this research finds that clustering can be used to find meaningful groups of account holders that also have varying fraud rates across each cluster. Additionally, some classifiers trained on specific clusters yield significant improvements in performance over the baseline, whereas classifiers for other clusters do not perform as well as the baseline. This research also concludes that the optimal classifier for a given cluster varies by cluster, highlighting the potential for further development of new classifiers, which may perform well on clusters that currently exhibit under performing models.