file1,
file2, etc. all in the same folder or directory;
first, locate one of the files with the [Browse...] button. Then alter the
file path in the box above to include '*' character (eg. file*).
Wildcard '?' to blanket only one character works too.| Historiographs |
Grand Totals: LCS 38623, GCS 253809, CR 422506 Collection span: 2000 - 2010 | |
| Word(i) List (9543) Word count: 60326, All words count: 85614 | ||
| # | Word | Recs | TLCS | TGCS |
|---|---|---|---|---|
| 1 | HUMAN | 1466 | 20633 | 71078 |
| 2 | GENOME | 985 | 19210 | 60454 |
| 3 | ANALYSIS | 625 | 11266 | 33517 |
| 4 | SEQUENCING | 167 | 9031 | 14163 |
| 5 | INITIAL | 15 | 8709 | 11680 |
| 6 | SEQUENCE | 362 | 5814 | 20037 |
| 7 | MOUSE | 269 | 2704 | 14766 |
| 8 | EVOLUTION | 378 | 2614 | 14632 |
| 9 | GENE | 900 | 2405 | 26252 |
| 10 | COMPARATIVE | 214 | 2183 | 10004 |
| 11 | ELEMENTS | 239 | 2102 | 8711 |
| 12 | GENOMIC | 382 | 1955 | 11236 |
| 13 | GENES | 508 | 1800 | 16039 |
| 14 | ALU | 129 | 1735 | 3651 |
| 15 | SPLICING | 238 | 1699 | 11392 |
| 16 | MAMMALIAN | 178 | 1567 | 7660 |
| 17 | DNA | 523 | 1559 | 14439 |
| 18 | ALTERNATIVE | 186 | 1406 | 7800 |
| 19 | SEQUENCES | 199 | 1282 | 6718 |
| 20 | PROTEIN | 522 | 1218 | 21616 |
| 21 | CHROMOSOME | 267 | 1203 | 6875 |
| 22 | GENOMES | 226 | 1161 | 6403 |
| 23 | RETROTRANSPOSITION | 67 | 1131 | 2072 |
| 24 | FUNCTIONAL | 262 | 1047 | 10496 |
| 25 | LINE | 107 | 985 | 2121 |
| 26 | ENDOGENOUS | 156 | 874 | 2870 |
| 27 | SPECIFIC | 185 | 785 | 5482 |
| 28 | TRANSPOSABLE | 96 | 748 | 2495 |
| 29 | IDENTIFICATION | 292 | 743 | 8456 |
| 30 | EXPRESSION | 391 | 732 | 9711 |
| # | Word | Recs | TLCS | TGCS |
| 31 | GENETIC | 280 | 710 | 9683 |
| 32 | VARIATION | 109 | 698 | 5106 |
| 33 | DUPLICATIONS | 43 | 689 | 2714 |
| 34 | TRANSCRIPTION | 153 | 684 | 6139 |
| 35 | GENOMICS | 311 | 679 | 7664 |
| 36 | DATABASE | 76 | 664 | 6636 |
| 37 | RETROTRANSPOSONS | 77 | 660 | 1683 |
| 38 | SEGMENTAL | 42 | 635 | 2191 |
| 39 | NEW | 242 | 623 | 6938 |
| 40 | MEDIATED | 71 | 620 | 2890 |
| 41 | DIVERSITY | 68 | 614 | 3630 |
| 42 | COMPARISON | 60 | 613 | 3745 |
| 43 | MAP | 86 | 605 | 4053 |
| 44 | MRNA | 99 | 586 | 5257 |
| 45 | TRANSCRIPTOME | 78 | 584 | 3429 |
| 46 | STRUCTURE | 194 | 582 | 7346 |
| 47 | INSIGHTS | 60 | 570 | 3434 |
| 48 | EVOLUTIONARY | 133 | 566 | 3217 |
| 49 | WIDE | 116 | 566 | 4680 |
| 50 | ASSEMBLY | 49 | 561 | 3015 |
| 51 | REGIONS | 96 | 561 | 2845 |
| 52 | PROJECT | 64 | 560 | 4138 |
| 53 | RNA | 151 | 556 | 4381 |
| 54 | FAMILY | 173 | 549 | 5783 |
| 55 | TRANSCRIPTIONAL | 93 | 540 | 4488 |
| 56 | REGULATORY | 99 | 528 | 4965 |
| 57 | VERTEBRATE | 70 | 528 | 4244 |
| 58 | RECOMBINATION | 56 | 526 | 2919 |
| 59 | USING | 259 | 525 | 7423 |
| 60 | NOVEL | 277 | 512 | 5135 |
| # | Word | Recs | TLCS | TGCS |
| 61 | WHOLE | 61 | 503 | 2330 |
| 62 | ELEMENT | 104 | 501 | 1888 |
| 63 | DISEASE | 196 | 500 | 5497 |
| 64 | LENGTH | 60 | 498 | 3197 |
| 65 | HIGH | 178 | 493 | 5881 |
| 66 | CHIMPANZEE | 38 | 488 | 1684 |
| 67 | REPEATS | 97 | 477 | 1764 |
| 68 | RAT | 42 | 474 | 1701 |
| 69 | HUMANS | 65 | 466 | 2629 |
| 70 | RECENT | 50 | 462 | 2208 |
| 71 | LARGE | 131 | 459 | 4692 |
| 72 | SCALE | 103 | 459 | 4593 |
| 73 | CHROMOSOMES | 57 | 457 | 2651 |
| 74 | FULL | 38 | 450 | 2750 |
| 75 | DUPLICATION | 58 | 446 | 2645 |
| 76 | BASED | 206 | 445 | 4950 |
| 77 | REVEALS | 66 | 443 | 2808 |
| 78 | ASSOCIATED | 114 | 438 | 3407 |
| 79 | ANNOTATION | 70 | 437 | 3217 |
| 80 | EXON | 56 | 434 | 2285 |
| 81 | BIOLOGY | 112 | 431 | 3606 |
| 82 | PRIMATE | 49 | 420 | 1538 |
| 83 | MOBILE | 29 | 417 | 1326 |
| 84 | RETROTRANSPOSON | 67 | 417 | 1140 |
| 85 | RETROVIRUS | 82 | 413 | 1341 |
| 86 | PSEUDOGENES | 47 | 412 | 1689 |
| 87 | CELLS | 163 | 410 | 5593 |
| 88 | NON | 109 | 410 | 3105 |
| 89 | EVIDENCE | 86 | 405 | 2887 |
| 90 | POLYMORPHISMS | 96 | 405 | 3332 |
| # | Word | Recs | TLCS | TGCS |
| 91 | SELECTION | 80 | 405 | 2399 |
| 92 | EXONS | 36 | 398 | 1420 |
| 93 | IMPLICATIONS | 98 | 397 | 2649 |
| 94 | ROLE | 139 | 392 | 4106 |
| 95 | CHARACTERIZATION | 154 | 391 | 4019 |
| 96 | SITES | 94 | 389 | 3116 |
| 97 | INSERTION | 51 | 382 | 1225 |
| 98 | DRAFT | 20 | 379 | 1525 |
| 99 | REPEAT | 77 | 370 | 2184 |
| 100 | CODING | 80 | 367 | 2447 |
| 101 | COMPLEX | 125 | 365 | 4069 |
| 102 | ANALYSES | 51 | 364 | 2931 |
| 103 | SHOTGUN | 21 | 362 | 1390 |
| 104 | EUKARYOTIC | 65 | 360 | 2047 |
| 105 | ACTIVITY | 70 | 354 | 1974 |
| 106 | INTEGRATION | 69 | 349 | 2372 |
| 107 | NUCLEOTIDE | 101 | 347 | 4096 |
| 108 | ORIGIN | 39 | 346 | 1396 |
| 109 | PRE | 49 | 343 | 2763 |
| 110 | ISOCHORES | 27 | 342 | 715 |
| 111 | PROTEINS | 207 | 338 | 5784 |
| 112 | MOLECULAR | 224 | 336 | 4900 |
| 113 | PROTEOME | 62 | 335 | 4212 |
| 114 | RETROVIRUSES | 55 | 334 | 888 |
| 115 | CANCER | 234 | 328 | 6781 |
| 116 | SPECIES | 55 | 328 | 2335 |
| 117 | DISTRIBUTION | 76 | 325 | 1512 |
| 118 | RESOLUTION | 45 | 323 | 3150 |
| 119 | ORGANIZATION | 63 | 314 | 3731 |
| 120 | REGULATION | 118 | 309 | 4640 |
| # | Word | Recs | TLCS | TGCS |
| 121 | PHYSICAL | 40 | 304 | 1400 |
| 122 | BINDING | 133 | 302 | 4962 |
| 123 | CONSERVED | 66 | 300 | 3637 |
| 124 | ACTIVE | 42 | 299 | 1572 |
| 125 | EXPANSION | 29 | 297 | 1429 |
| 126 | SINES | 28 | 291 | 694 |
| 127 | FAMILIES | 51 | 290 | 3523 |
| 128 | LIKE | 64 | 288 | 3203 |
| 129 | BROWN | 2 | 287 | 798 |
| 130 | NORWAY | 2 | 287 | 798 |
| 131 | COMPUTATIONAL | 81 | 283 | 2063 |
| 132 | WIDESPREAD | 22 | 279 | 2101 |
| 133 | ENSEMBL | 10 | 278 | 1286 |
| 134 | LINES | 37 | 278 | 826 |
| 135 | DROSOPHILA | 83 | 275 | 2370 |
| 136 | RNAS | 38 | 273 | 2386 |
| 137 | MAPS | 45 | 272 | 1389 |
| 138 | METHYLATION | 72 | 272 | 3648 |
| 139 | ALTERNATIVELY | 22 | 270 | 772 |
| 140 | MICROARRAYS | 78 | 270 | 3122 |
| 141 | SPLICED | 24 | 270 | 812 |
| 142 | HERV | 51 | 269 | 583 |
| 143 | YIELDS | 2 | 269 | 758 |
| 144 | SYSTEMATIC | 34 | 268 | 4067 |
| 145 | TOOL | 49 | 267 | 1903 |
| 146 | REARRANGEMENTS | 25 | 260 | 1164 |
| 147 | MUTATION | 84 | 259 | 1940 |
| 148 | YEAST | 57 | 258 | 3902 |
| 149 | PROMOTER | 66 | 257 | 1776 |
| 150 | REGION | 113 | 257 | 1965 |
| # | Word | Recs | TLCS | TGCS |
| 151 | MAMMALS | 37 | 256 | 1849 |
| 152 | STRUCTURAL | 99 | 251 | 2611 |
| 153 | CDNAS | 29 | 250 | 1529 |
| 154 | SPLICE | 54 | 250 | 1139 |
| 155 | CHICKEN | 36 | 248 | 1363 |
| 156 | EXPRESSED | 89 | 246 | 2314 |
| 157 | RELATED | 84 | 246 | 2309 |
| 158 | EUCHROMATIC | 2 | 245 | 928 |
| 159 | IMPACT | 68 | 244 | 1584 |
| 160 | DISCOVERY | 154 | 243 | 4633 |
| 161 | MODEL | 95 | 243 | 2049 |
| 162 | CELL | 184 | 241 | 5496 |
| 163 | LONG | 74 | 241 | 1076 |
| 164 | REPETITIVE | 49 | 237 | 1014 |
| 165 | FINISHING | 4 | 234 | 912 |
| 166 | PATTERNS | 63 | 233 | 2780 |
| 167 | CONTENT | 40 | 231 | 1010 |
| 168 | FUNCTION | 122 | 231 | 3582 |
| 169 | MULTIPLE | 84 | 231 | 2004 |
| 170 | RETROVIRAL | 50 | 231 | 1319 |
| 171 | SINGLE | 108 | 230 | 3388 |
| 172 | LTR | 55 | 229 | 625 |
| 173 | TRANSCRIPTS | 52 | 228 | 1239 |
| 174 | TRANSPOSITION | 21 | 228 | 980 |
| 175 | SITE | 68 | 226 | 1880 |
| 176 | ALIGNMENT | 18 | 220 | 1534 |
| 177 | SURVEY | 24 | 220 | 1608 |
| 178 | DETECTION | 96 | 218 | 2374 |
| 179 | TARGET | 81 | 218 | 1954 |
| 180 | BLAST | 8 | 213 | 1651 |
| # | Word | Recs | TLCS | TGCS |
| 181 | DOMAINS | 60 | 209 | 3044 |
| 182 | GENERATION | 50 | 207 | 1970 |
| 183 | VIVO | 42 | 207 | 991 |
| 184 | BLAT | 1 | 206 | 1355 |
| 185 | DELETION | 38 | 206 | 865 |
| 186 | PERSPECTIVES | 28 | 205 | 1186 |
| 187 | PROMOTERS | 38 | 204 | 2439 |
| 188 | COMPLETE | 35 | 201 | 1737 |
| 189 | TRANSCRIPTOMES | 7 | 201 | 1829 |
| 190 | DELETIONS | 30 | 200 | 650 |
| 191 | HISTORY | 44 | 199 | 1214 |
| 192 | INSERTIONS | 37 | 199 | 597 |
| 193 | HAPLOTYPE | 26 | 197 | 1761 |
| 194 | DOMAIN | 84 | 195 | 2504 |
| 195 | FUGU | 8 | 194 | 880 |
| 196 | FUTURE | 70 | 194 | 1633 |
| 197 | VIEW | 25 | 194 | 1196 |
| 198 | SHORT | 43 | 193 | 1038 |
| 199 | EUKARYOTES | 33 | 192 | 2089 |
| 200 | NONCODING | 28 | 192 | 1396 |