Learning Agentic Policy from Action Guidance · AI HOT