/products/network-attack-discoveryPT NAD

/products/af/PT Application Firewall

Machine learning in cybersecurity

Machine learning has revolutionized cybersecurity. In the past, cybersecurity relied on rule-based protection systems and analysts. However, with the advent of machine learning, security incident detection and response have become much more effective. By analyzing vast amounts of data and learning from it, ML algorithms can identify patterns and anomalies that indicate potential threats and take measures to prevent or mitigate them.

This text was generated by artificial intelligence (AI)

Overview ML at Positive Technologies Why we use ML ML in our products

Machine learning solves all applied cybersecurity tasks

ML technologies enable the creation of intelligent systems that can adapt to new types of attacks and learn from past incidents. This improves the efficiency of security teams, enables rapid threat response, and minimizes potential risks. However, cybercriminals also actively use machine learning, which requires appropriate defense methods.

Why ML in cybersecurity?

Hackers do not sleep

ML helps develop and refine attacks that traditional defenses cannot detect.

Growing information flows

Security specialists struggle to handle data flows, build correlations, and identify unknown threats manually.

Increasing damage

Stricter government sanctions for data breaches increase company risks and demand effective solutions against cyberthreats.

What security tasks does ML solve?

It quickly collects data from various sources at the security perimeter and within the infrastructure, processes it in real time, and identifies warning signs.

It can detect non-standard attacks for which detection rules have not yet been written.

It quickly and effectively detects anomalous behavior, identifies vulnerabilities, and predicts potential threats.

ML at Positive Technologies

We strive to ensure our products automatically prevent, detect, and respond to threats. ML models in Positive Technologies products continuously learn based on our expertise and user data, including self-learning. Thanks to machine learning, security teams eliminate repetitive tasks, analysts gain valuable insights for threat hunting, and managers can effectively prioritize fixing infrastructure weaknesses.

We have developed ML models that detect hackers' most dangerous tactics:

Execution:

Сode execution on compromised systems using living-off-the-land and bring-your-own-land techniques

Command and сontrol:

Managing infected devices using hacker tools or legitimate software

Lateral movement:

Attacker movement from system to system to trigger non-tolerable events

Why we use ML technologies in products

Protection systems begin by collecting raw data, such as logs, traffic, and executable files. This information must be standardized to detect attacks, identify security incidents, and conduct investigations. Machine learning should be applied at every stage, from working with raw data to creating incident reports.

Key vectors of ML development at Positive Technologies

Traffic analysis

Detecting attacks in unstructured data, analyzing user behavior, and reducing false positives

Event and incident analysis

Assessing user actions based on behavior analysis relative to various entities (such as launched programs, work schedules, and network activity)

Entity analysis

Determining the danger of binary files, and identifying indicators of compromise and vulnerabilities through indirect signs

ML security

Researching and testing the security of ML models during development to ensure they cannot be exploited by attackers

ML in Positive Technologies products

MaxPatrol SIEM

PT NAD

PT Sandbox

The ML model in PT Sandbox performs part of the behavioral analysis of files. Dynamic analysis involves running files in a virtual environment, logging their behavior, and analyzing the resulting log. Each running process leaves behind a sequence of system calls (a trace) through which it interacts with the operating system. The ML team at Positive Technologies has analyzed numerous malicious and clean traces to identify sequences characteristic of malware, including network requests to the internet, file operations, and registry accesses. These calls are reduced to a final feature vector that is processed by the ML model, which then classifies the behavior as "bad" or "good."

To implement these ML models in the product, a specific technology stack is used: PT Sandbox utilizes Python code, the ML model is serialized using ONNX, and MLflow is used for experiment tracking and as an artifact repository. Additionally, the model is trained on a daily stream of examples and a reference dataset that excludes false positives, delivering highly accurate detection results.

Tasks the ML model in PT Sandbox helps handle:

Detecting anomalous subprocess chains. A large number of branching sequences is legitimate on its own. However, the number of nodes, nesting depth, and the repetition or uniqueness of process names can only be effectively analyzed by the ML model.
Detecting non-standard values of call parameters. In most cases, analysts focus on significant function parameters when searching for malware. The ML model effectively analyzes the remaining parameters.
Investigating atypical sequences of function calls. Sometimes individual functions or combinations of functions may appear benign, but their sequence is not found in legitimate software. An analyst would need extensive experience to notice such a pattern manually. The ML model detects these patterns through classification using features that were not predefined as indicators of maliciousness.

The main task of ML in PT Sandbox is to continuously improve the accuracy of verdicts when determining if an object is malicious. By analyzing over 8,500 features of object behavior, the ML model ensures high detection quality that is unattainable for systems that use standard malware detection methods.

MaxPatrol VM

PT Application Firewall

Thinking about the best way to protect your company?

During the consultation we'll propose a solution precisely tailored to your organization.