This document describes two implementations of an Advanced Encryption Standard (AES) cipher with online key expansion mapped to a fine-grained many-core system. The first implementation, called "One Task One Processor", maps each step of the AES algorithm to a separate processor. The second implementation unrolls the AES algorithm loop nine times to break data dependencies and process multiple data blocks in parallel using about 60 cores. Evaluation on an FPGA shows the unrolled implementation achieves a throughput of 85.15 Gbps compared to 1.98 Gbps for the single-task implementation. The document also proposes a masked S-box technique to protect AES implementations from differential power analysis attacks.