file1,
file2, etc. all in the same folder or directory;
first, locate one of the files with the [Browse...] button. Then alter the
file path in the box above to include '*' character (eg. file*).
Wildcard '?' to blanket only one character works too.| Historiographs |
Grand Totals: LCS 18020, GCS 180410, CR 319862 Collection span: 2000 - 2010 | |
| Word(i) List (7751) Word count: 41228, All words count: 58019 | ||
| # | Word | Recs | TLCS | TGCS |
|---|---|---|---|---|
| 1 | GENOME | 628 | 11916 | 50546 |
| 2 | HUMAN | 846 | 11275 | 50278 |
| 3 | ANALYSIS | 435 | 4690 | 24998 |
| 4 | SEQUENCE | 243 | 6486 | 18915 |
| 5 | GENE | 589 | 992 | 17580 |
| 6 | PROTEIN | 455 | 624 | 16080 |
| 7 | SEQUENCING | 130 | 3957 | 12324 |
| 8 | MOUSE | 161 | 1061 | 11124 |
| 9 | INITIAL | 8 | 3678 | 11042 |
| 10 | GENES | 332 | 629 | 8587 |
| 11 | DNA | 324 | 557 | 7949 |
| 12 | GENETIC | 247 | 278 | 7612 |
| 13 | EXPRESSION | 284 | 243 | 7406 |
| 14 | FUNCTIONAL | 205 | 443 | 7246 |
| 15 | USING | 224 | 361 | 6620 |
| 16 | EVOLUTION | 141 | 524 | 6461 |
| 17 | GENOMIC | 235 | 484 | 6296 |
| 18 | STRUCTURE | 126 | 263 | 6181 |
| 19 | COMPARATIVE | 117 | 769 | 5971 |
| 20 | IDENTIFICATION | 210 | 350 | 5890 |
| 21 | SPLICING | 85 | 287 | 5200 |
| 22 | TRANSCRIPTION | 118 | 194 | 5159 |
| 23 | CANCER | 189 | 186 | 5020 |
| 24 | HAPLOTYPE | 25 | 280 | 4895 |
| 25 | NUCLEOTIDE | 96 | 271 | 4839 |
| 26 | BASED | 171 | 228 | 4805 |
| 27 | NEW | 180 | 240 | 4724 |
| 28 | GENOMICS | 239 | 342 | 4653 |
| 29 | REGULATION | 85 | 135 | 4401 |
| 30 | DISCOVERY | 164 | 236 | 4205 |
| # | Word | Recs | TLCS | TGCS |
| 31 | FAMILY | 88 | 118 | 4164 |
| 32 | NOVEL | 209 | 311 | 4161 |
| 33 | RECEPTOR | 138 | 171 | 4136 |
| 34 | DISEASE | 144 | 226 | 4058 |
| 35 | BINDING | 104 | 157 | 4039 |
| 36 | MOLECULAR | 181 | 162 | 3985 |
| 37 | CHROMOSOME | 136 | 462 | 3979 |
| 38 | FUNCTION | 106 | 148 | 3856 |
| 39 | KINASE | 45 | 73 | 3842 |
| 40 | MAMMALIAN | 66 | 253 | 3791 |
| 41 | CELL | 154 | 135 | 3766 |
| 42 | PROTEINS | 155 | 134 | 3753 |
| 43 | DRUG | 165 | 186 | 3636 |
| 44 | MAP | 64 | 281 | 3632 |
| 45 | FACTOR | 80 | 114 | 3545 |
| 46 | LARGE | 78 | 182 | 3536 |
| 47 | COMPLEX | 76 | 235 | 3518 |
| 48 | PROJECT | 56 | 246 | 3514 |
| 49 | PROTEOMICS | 137 | 200 | 3508 |
| 50 | SINGLE | 102 | 265 | 3503 |
| 51 | RECEPTORS | 80 | 133 | 3440 |
| 52 | DRAFT | 14 | 278 | 3383 |
| 53 | HIGH | 164 | 190 | 3341 |
| 54 | WIDE | 75 | 213 | 3263 |
| 55 | SIGNALING | 61 | 51 | 3240 |
| 56 | ASSOCIATION | 58 | 119 | 3171 |
| 57 | SEQUENCES | 91 | 271 | 3158 |
| 58 | DEVELOPMENT | 121 | 110 | 3155 |
| 59 | CHARACTERIZATION | 103 | 123 | 3108 |
| 60 | POLYMORPHISMS | 92 | 197 | 3034 |
| # | Word | Recs | TLCS | TGCS |
| 61 | ALTERNATIVE | 64 | 224 | 2977 |
| 62 | COUPLED | 89 | 181 | 2933 |
| 63 | SCALE | 64 | 159 | 2921 |
| 64 | GENOMES | 104 | 211 | 2919 |
| 65 | SPECIFIC | 91 | 166 | 2907 |
| 66 | VARIATION | 67 | 204 | 2855 |
| 67 | CELLS | 95 | 81 | 2795 |
| 68 | LINKAGE | 32 | 91 | 2694 |
| 69 | ASSEMBLY | 49 | 345 | 2632 |
| 70 | MRNA | 49 | 103 | 2615 |
| 71 | BIOLOGY | 105 | 212 | 2606 |
| 72 | RICE | 21 | 152 | 2600 |
| 73 | RNA | 69 | 92 | 2599 |
| 74 | LENGTH | 33 | 210 | 2503 |
| 75 | DOMAINS | 43 | 91 | 2486 |
| 76 | SITES | 50 | 150 | 2486 |
| 77 | METHYLATION | 36 | 79 | 2483 |
| 78 | FULL | 23 | 215 | 2453 |
| 79 | VERTEBRATE | 36 | 152 | 2446 |
| 80 | FACTORS | 53 | 62 | 2418 |
| 81 | ORYZA | 5 | 137 | 2411 |
| 82 | SATIVA | 5 | 137 | 2411 |
| 83 | SSP | 3 | 136 | 2384 |
| 84 | ANNOTATION | 51 | 211 | 2358 |
| 85 | ROLE | 84 | 86 | 2331 |
| 86 | TRANSCRIPTOME | 39 | 242 | 2301 |
| 87 | ELEMENTS | 44 | 170 | 2288 |
| 88 | NON | 45 | 147 | 2223 |
| 89 | GENETICS | 127 | 58 | 2194 |
| 90 | TRANSCRIPTIONAL | 53 | 154 | 2166 |
| # | Word | Recs | TLCS | TGCS |
| 91 | PROTEOME | 51 | 128 | 2144 |
| 92 | PROFILING | 74 | 78 | 2141 |
| 93 | TRAITS | 12 | 74 | 2139 |
| 94 | DATA | 88 | 104 | 2130 |
| 95 | BLOCKS | 7 | 102 | 2122 |
| 96 | MUTATIONS | 39 | 56 | 2106 |
| 97 | RECENT | 34 | 177 | 2086 |
| 98 | INTEGRATION | 55 | 125 | 2073 |
| 99 | MASS | 64 | 72 | 2070 |
| 100 | APPROACHES | 71 | 150 | 2043 |
| 101 | INSIGHTS | 35 | 215 | 2007 |
| 102 | WHOLE | 52 | 307 | 2002 |
| 103 | MAPPING | 68 | 113 | 2001 |
| 104 | MICROARRAYS | 67 | 116 | 2000 |
| 105 | SYSTEM | 80 | 110 | 1940 |
| 106 | CDNA | 43 | 107 | 1929 |
| 107 | STRUCTURAL | 71 | 104 | 1921 |
| 108 | CODING | 39 | 147 | 1917 |
| 109 | SYSTEMS | 63 | 92 | 1871 |
| 110 | COMPARISON | 57 | 296 | 1860 |
| 111 | MICROARRAY | 67 | 81 | 1852 |
| 112 | DUPLICATIONS | 20 | 213 | 1839 |
| 113 | TARGET | 70 | 99 | 1814 |
| 114 | BETA | 54 | 25 | 1813 |
| 115 | CHROMOSOMES | 27 | 169 | 1779 |
| 116 | COMPLEMENT | 8 | 62 | 1738 |
| 117 | ASSOCIATED | 62 | 48 | 1714 |
| 118 | ANALYSES | 32 | 73 | 1705 |
| 119 | GLOBAL | 28 | 73 | 1699 |
| 120 | IMPLICATIONS | 68 | 120 | 1691 |
| # | Word | Recs | TLCS | TGCS |
| 121 | ACID | 42 | 48 | 1683 |
| 122 | GENOTYPING | 51 | 117 | 1678 |
| 123 | REGULATORY | 60 | 80 | 1669 |
| 124 | LIKE | 42 | 35 | 1650 |
| 125 | RNAS | 17 | 149 | 1649 |
| 126 | FUTURE | 76 | 124 | 1640 |
| 127 | DIVERSITY | 32 | 49 | 1629 |
| 128 | MUTATION | 43 | 57 | 1626 |
| 129 | PATHWAYS | 31 | 21 | 1609 |
| 130 | RELATED | 65 | 84 | 1600 |
| 131 | PATTERNS | 22 | 42 | 1597 |
| 132 | TRANSCRIPTOMES | 6 | 56 | 1596 |
| 133 | DOMAIN | 49 | 81 | 1584 |
| 134 | MICE | 32 | 38 | 1567 |
| 135 | ELEGANS | 15 | 53 | 1557 |
| 136 | MODEL | 77 | 40 | 1529 |
| 137 | ACTIVITY | 42 | 69 | 1525 |
| 138 | COMPUTATIONAL | 61 | 186 | 1525 |
| 139 | HAPMAP | 7 | 91 | 1509 |
| 140 | EPIGENETIC | 16 | 30 | 1504 |
| 141 | INTERNATIONAL | 2 | 90 | 1501 |
| 142 | DATABASE | 41 | 91 | 1496 |
| 143 | INTERACTIONS | 54 | 33 | 1495 |
| 144 | RESEARCH | 116 | 81 | 1494 |
| 145 | THROUGHPUT | 84 | 97 | 1485 |
| 146 | DISEASES | 61 | 64 | 1475 |
| 147 | APPROACH | 76 | 109 | 1458 |
| 148 | CONSERVED | 30 | 54 | 1454 |
| 149 | EUKARYOTIC | 35 | 102 | 1453 |
| 150 | NONSENSE | 9 | 44 | 1451 |
| # | Word | Recs | TLCS | TGCS |
| 151 | MEDIATED | 33 | 43 | 1439 |
| 152 | SYSTEMATIC | 22 | 61 | 1439 |
| 153 | SPECTROMETRY | 52 | 48 | 1429 |
| 154 | USE | 61 | 73 | 1426 |
| 155 | GENERATION | 38 | 115 | 1416 |
| 156 | CDNAS | 22 | 153 | 1412 |
| 157 | DETECTION | 63 | 76 | 1401 |
| 158 | BIOLOGICAL | 48 | 47 | 1399 |
| 159 | NUCLEAR | 38 | 30 | 1389 |
| 160 | UNDERSTANDING | 35 | 77 | 1383 |
| 161 | COMMON | 37 | 70 | 1356 |
| 162 | ENCODING | 26 | 31 | 1352 |
| 163 | TWO | 59 | 78 | 1324 |
| 164 | ARRAYS | 27 | 67 | 1310 |
| 165 | PROMOTERS | 16 | 67 | 1310 |
| 166 | CELLULAR | 39 | 30 | 1307 |
| 167 | DUPLICATION | 26 | 128 | 1299 |
| 168 | HAPLOTYPES | 13 | 23 | 1299 |
| 169 | PSEUDOGENES | 21 | 137 | 1295 |
| 170 | EXPRESSED | 51 | 73 | 1278 |
| 171 | CAENORHABDITIS | 10 | 30 | 1267 |
| 172 | MEMORY | 6 | 28 | 1258 |
| 173 | PHASE | 22 | 22 | 1250 |
| 174 | SEGMENTAL | 15 | 152 | 1235 |
| 175 | SHOTGUN | 24 | 254 | 1235 |
| 176 | REGION | 48 | 63 | 1227 |
| 177 | TARGETS | 55 | 50 | 1221 |
| 178 | BASIS | 18 | 26 | 1220 |
| 179 | JAPONICA | 2 | 69 | 1214 |
| 180 | TRANSPORTER | 20 | 18 | 1213 |
| # | Word | Recs | TLCS | TGCS |
| 181 | ENCODE | 6 | 111 | 1207 |
| 182 | EXPANSION | 15 | 47 | 1202 |
| 183 | INTERACTION | 29 | 42 | 1197 |
| 184 | INDICA | 2 | 69 | 1189 |
| 185 | ARCHITECTURE | 15 | 56 | 1186 |
| 186 | ACTIVE | 25 | 55 | 1168 |
| 187 | SNP | 50 | 70 | 1162 |
| 188 | MULTIPLE | 51 | 40 | 1161 |
| 189 | SMALL | 35 | 51 | 1154 |
| 190 | PRE | 21 | 49 | 1153 |
| 191 | DROSOPHILA | 48 | 55 | 1141 |
| 192 | MALARIA | 19 | 92 | 1140 |
| 193 | TECHNOLOGIES | 42 | 72 | 1136 |
| 194 | ZINC | 65 | 113 | 1128 |
| 195 | SELECTION | 40 | 59 | 1126 |
| 196 | CHORDATE | 5 | 101 | 1121 |
| 197 | RNAI | 3 | 27 | 1118 |
| 198 | DISTINCT | 18 | 15 | 1116 |
| 199 | PREDICTION | 51 | 47 | 1114 |
| 200 | MAPS | 29 | 92 | 1113 |