Frequencies for first 10,000 rows of statename2 variable in zipcode dataset : clean statename for | geocoding | Freq. Percent Cum. -----------------------+----------------------------------- CONNECTICUT | 433 4.33 4.33 DELAWARE | 96 0.96 5.29 DISTRICTOFCOLUMBIA | 278 2.78 8.07 MAINE | 486 4.86 12.93 MARYLAND | 605 6.05 18.98 MASSACHUSETTS | 690 6.90 25.88 NEWHAMPSHIRE | 284 2.84 28.72 NEWJERSEY | 723 7.23 35.95 NEWYORK | 2,158 21.58 57.53 PENNSYLVANIA | 2,185 21.85 79.38 PUERTORICO | 176 1.76 81.14 RHODEISLAND | 90 0.90 82.04 VERMONT | 309 3.09 85.13 VIRGINIA | 1,216 12.16 97.29 VIRGINISLANDS | 16 0.16 97.45 WESTVIRGINIA | 255 2.55 100.00 -----------------------+----------------------------------- Total | 10,000 100.00 by Jean Roth , jroth@nber.org , 4 Jan 2016 tail -n+3 /homes/data/zip/sas/2013/desc/zipcode/byvar/tab/statename2.log | egr