Skip to content

✍️ Text Generation

🤖 AAAI2026 · 2 paper notes

📌 Same area in other venues: 💬 ACL2026 (10) · 🔬 ICLR2026 (3) · 📹 ICCV2025 (1)

AutoMalDesc: Large-Scale Script Analysis for Cyber Threat Research

This paper proposes AutoMalDesc, an automated static analysis framework that employs an iterative self-paced learning pipeline — starting from 900 expert-annotated seed samples, fine-tuning Llama-3.3-70B via LoRA to generate pseudo-labels, applying multi-stage quality filtering to obtain 101K samples, and training a V2 model — to achieve automated malware classification and behavior description across five scripting languages, improving Batch script detection accuracy from 52.7% to 82.4%.

C3TG: Conflict-aware, Composite, and Collaborative Controlled Text Generation

This paper proposes the C3TG framework, which achieves fine-grained multi-attribute controllable text generation through a two-stage approach: in the generation stage, weighted KL divergence is used to fuse attribute distributions and adjust token probabilities; in the optimization stage, an energy function (combining classifier scores and conflict penalty terms) drives iterative rewriting via a Feedback Agent. C3TG achieves 90.4% attribute accuracy across 17 attribute subcategories while substantially reducing toxicity.