This document analyzes the distribution of source code file sizes across different programming languages. It finds that while file sizes appear lognormally distributed, the tails deviate significantly and are better modeled as a double Pareto distribution. This double Pareto distribution, with a lognormal body and power law tails, is found for C, C++, Java, Python and Lisp. Shell and Perl do not fit either model as well. The large files in the tails, though scarce, account for a disproportionately large percentage of the overall source code size. This challenges assumptions of lognormal models used in software estimation.