Malware Analysis Fundamentals

Malware Categories

Type	Mechanism	Goal	Example
Virus	Attaches to and modifies legitimate host files; spreads when host file runs	Damage, disruption, propagation	CIH (Chernobyl), Melissa
Worm	Self-replicating; spreads over network without a host file	Network disruption, propagation, payload delivery	WannaCry, Conficker
Trojan	Disguised as legitimate software; does not self-replicate	Backdoor, data theft, dropper	Agent Tesla, Emotet
Ransomware	Encrypts victim files and demands payment for decryption key	Financial extortion	LockBit, REvil, BlackCat
Spyware	Silently monitors and steals data	Credential theft, surveillance	Pegasus, FinFisher
Rootkit	Hides presence at OS or hardware level	Persistence, stealth	TDSS/TDL, Necurs
RAT / Botnet	Remote access Trojan; controlled via C2; may join a botnet	Remote control, DDoS, spam	Cobalt Strike, AsyncRAT
Keylogger	Records keystrokes to capture credentials	Credential theft	Ardamax, HawkEye
Fileless Malware	Operates entirely in memory; no file written to disk	Evasion, stealth	Poweliks, Kovter
Adware	Displays unwanted advertisements; collects behavioral data	Revenue generation for operator	Various browser extensions

Malware Infection Vectors

Phishing email with malicious attachment — Office document with macros, PDF with embedded JavaScript, or ZIP archive containing executable
Phishing email with malicious link — drives to exploit kit or credential harvesting page
Drive-by download — visiting a compromised or malicious website triggers automatic malware download via browser exploit
USB and removable media — AutoRun or LNK file execution when USB is inserted
Supply chain compromise — malicious code inserted into legitimate software updates (SolarWinds, NotPetya via M.E.Doc update)
Watering hole attack — compromising a website frequently visited by the target community
Exploit of vulnerable public-facing services — RDP brute force, Log4Shell, ProxyLogon

Static Analysis

Static analysis examines a malware sample without executing it. This is the safer first step — no risk of infection, no need for a sandbox. The goal is to quickly characterize the file and extract useful IoCs and intelligence without running the code.

Hash Analysis

The first step is always to compute the file hash (MD5, SHA1, SHA256) and look it up in threat intelligence sources (VirusTotal, Malware Bazaar, your TIP). A known-malicious hash gives you an instant verdict and existing analysis reports.

String Extraction

The strings command (Linux) extracts printable ASCII and Unicode strings from a binary. Malware often contains useful strings: C2 URLs, registry key paths, API function names, encryption keys (sometimes hardcoded), error messages, and debug artifacts.

bash

# Extract printable strings (minimum 6 characters)
strings -n 6 malware.exe

# Extract Unicode strings
strings -e l malware.exe

# Extract strings and search for URLs
strings malware.exe | grep -E "https?://"

# Extract strings and search for registry paths
strings malware.exe | grep -i "HKCU\|HKLM"

PE Header Analysis

Windows executables follow the PE (Portable Executable) format. The PE header contains rich metadata:

Compilation timestamp — when the binary was compiled (often manipulated by malware authors, but still useful)
Import Address Table (IAT) — which Windows API functions the malware imports. Imports like VirtualAllocEx, WriteProcessMemory, CreateRemoteThread indicate process injection
Export table — functions exported by DLLs; useful for analyzing injected DLLs
Section names — standard sections (.text, .data, .rsrc). Non-standard or high-entropy sections suggest packing or encryption
Compiler and linker artifacts — often reveal the programming language or framework used

YARA Rules

YARA is a pattern-matching tool for malware identification. A YARA rule defines conditions based on strings, byte patterns, and PE header attributes. When a file matches a YARA rule, it is flagged as matching a known malware family or malicious pattern.

yara

rule AgentTesla_Keylogger {
    meta:
        description = "Detects Agent Tesla keylogger based on string patterns"
        author = "Tacuns Academy (example)"
        date = "2026-05-26"

    strings:
        $s1 = "smtp.gmail.com" ascii nocase
        $s2 = "KeyLogger" ascii
        $s3 = "GetKeyboardState" ascii
        $pdb = "AgentTesla" ascii nocase

    condition:
        uint16(0) == 0x5A4D and      // MZ header (Windows PE)
        filesize < 5MB and
        (
            ($s1 and $s2) or
            ($s3 and $pdb)
        )
}

Dynamic Analysis

Dynamic analysis executes the malware in a controlled environment and observes its behavior. This reveals what the malware actually does — network connections, file system changes, registry modifications, process creation — information that may be obfuscated in static analysis due to packing or encryption.

What to Monitor During Dynamic Analysis

Network traffic — capture all connections using Wireshark or the sandbox's network monitor. Look for C2 communications, DNS queries, HTTP POSTs exfiltrating data
Registry changes — which keys are created, modified, or deleted? Persistence mechanisms often use HKCU\Run or HKLM\Run
File system changes — which files are created, modified, or deleted? Look for dropped payloads, encrypted files
Process creation — which child processes are spawned? cmd.exe, powershell.exe, wscript.exe spawned by Office or browser is suspicious
API calls — using process monitoring tools, record which Windows API functions are called and with what parameters

Sandbox Tools

Tool	Type	Features
Any.run	Commercial/Free tier	Interactive sandbox; real-time analysis; MITRE ATT&CK mapping
VirusTotal	Freemium	Multi-AV scan; static analysis; behavioral report (Cuckoo-based)
Joe Sandbox	Commercial	Deep behavioral analysis; deobfuscation; YARA integration
Cuckoo Sandbox	Open source	Self-hosted; fully customizable analysis environment
CAPE Sandbox	Open source	Extends Cuckoo; focuses on config extraction

VirusTotal

VirusTotal (virustotal.com) scans uploaded files with 70+ antivirus engines and returns community comments, behavioral analysis reports, and IoC extraction. It is the fastest first-look tool for any suspicious file or URL. Be aware: files uploaded to VirusTotal are visible to all VirusTotal users — do not upload files containing sensitive organizational data.

Common Malware Techniques

Persistence Mechanisms

Registry Run Keys — HKCU/HKLM\Software\Microsoft\Windows\CurrentVersion\Run — executes on every user logon
Scheduled Tasks — schtasks.exe creates tasks that run on schedule, at logon, or on system event
Services — malware registers itself as a Windows Service for automatic startup with SYSTEM privileges
Startup Folder — drop executable in %AppData%\Roaming\Microsoft\Windows\Start Menu\Programs\Startup\
DLL Hijacking — place a malicious DLL in a location searched before the legitimate DLL path

Anti-Analysis Techniques

VM detection — check for VMware/VirtualBox artifacts (registry keys, MAC addresses, running processes) and exit cleanly to avoid sandbox analysis
Sandbox evasion — delay execution (sleep for 5+ minutes), check for user activity (mouse movement, foreground window), check for minimum RAM or processes
Code packing — compress or encrypt the malicious payload; decompress only at runtime to defeat static signature detection
Obfuscation — encode strings as XOR or Base64, split C2 URLs into fragments, use indirect API calls via GetProcAddress
Anti-debugging — detect attached debuggers via IsDebuggerPresent, NtQueryInformationProcess, timing checks

C2 Communication Patterns

→

HTTP/HTTPS beaconing — malware sends regular HTTP requests to its C2 server. Look for regular intervals (every 60 seconds, 5 minutes), HTTP POST with Base64-encoded body, or HTTP GET to unusual URLs.

→

DNS tunneling — encode commands and data in DNS subdomain labels and TXT records. Detectable by high query frequency and long domain names.

→

Domain Generation Algorithms (DGA) — malware generates hundreds of candidate C2 domains daily based on a seed (date, fixed value). Only the operator registers one; the rest are sinkholes or unregistered. Defenders can generate and pre-register/sinkhole DGA domains.

Previous Module Next Module