file1,
file2, etc. all in the same folder or directory;
first, locate one of the files with the [Browse...] button. Then alter the
file path in the box above to include '*' character (eg. file*).
Wildcard '?' to blanket only one character works too.| Historiographs |
Grand Totals: LCS 18020, GCS 180410, CR 319862 Collection span: 2000 - 2010 | |
| Word(i) List (7751) Word count: 41228, All words count: 58019 | ||
| # | Word | Recs | TLCS | TGCS |
|---|---|---|---|---|
| 1 | GENOME | 628 | 11916 | 50546 |
| 2 | HUMAN | 846 | 11275 | 50278 |
| 3 | SEQUENCE | 243 | 6486 | 18915 |
| 4 | ANALYSIS | 435 | 4690 | 24998 |
| 5 | SEQUENCING | 130 | 3957 | 12324 |
| 6 | INITIAL | 8 | 3678 | 11042 |
| 7 | MOUSE | 161 | 1061 | 11124 |
| 8 | GENE | 589 | 992 | 17580 |
| 9 | COMPARATIVE | 117 | 769 | 5971 |
| 10 | GENES | 332 | 629 | 8587 |
| 11 | PROTEIN | 455 | 624 | 16080 |
| 12 | DNA | 324 | 557 | 7949 |
| 13 | EVOLUTION | 141 | 524 | 6461 |
| 14 | GENOMIC | 235 | 484 | 6296 |
| 15 | CHROMOSOME | 136 | 462 | 3979 |
| 16 | FUNCTIONAL | 205 | 443 | 7246 |
| 17 | USING | 224 | 361 | 6620 |
| 18 | IDENTIFICATION | 210 | 350 | 5890 |
| 19 | ASSEMBLY | 49 | 345 | 2632 |
| 20 | GENOMICS | 239 | 342 | 4653 |
| 21 | NOVEL | 209 | 311 | 4161 |
| 22 | WHOLE | 52 | 307 | 2002 |
| 23 | COMPARISON | 57 | 296 | 1860 |
| 24 | SPLICING | 85 | 287 | 5200 |
| 25 | MAP | 64 | 281 | 3632 |
| 26 | HAPLOTYPE | 25 | 280 | 4895 |
| 27 | DRAFT | 14 | 278 | 3383 |
| 28 | GENETIC | 247 | 278 | 7612 |
| 29 | NUCLEOTIDE | 96 | 271 | 4839 |
| 30 | SEQUENCES | 91 | 271 | 3158 |
| # | Word | Recs | TLCS | TGCS |
| 31 | SINGLE | 102 | 265 | 3503 |
| 32 | STRUCTURE | 126 | 263 | 6181 |
| 33 | SHOTGUN | 24 | 254 | 1235 |
| 34 | MAMMALIAN | 66 | 253 | 3791 |
| 35 | PROJECT | 56 | 246 | 3514 |
| 36 | EXPRESSION | 284 | 243 | 7406 |
| 37 | TRANSCRIPTOME | 39 | 242 | 2301 |
| 38 | NEW | 180 | 240 | 4724 |
| 39 | DISCOVERY | 164 | 236 | 4205 |
| 40 | COMPLEX | 76 | 235 | 3518 |
| 41 | BASED | 171 | 228 | 4805 |
| 42 | DISEASE | 144 | 226 | 4058 |
| 43 | ALTERNATIVE | 64 | 224 | 2977 |
| 44 | FULL | 23 | 215 | 2453 |
| 45 | INSIGHTS | 35 | 215 | 2007 |
| 46 | DUPLICATIONS | 20 | 213 | 1839 |
| 47 | WIDE | 75 | 213 | 3263 |
| 48 | BIOLOGY | 105 | 212 | 2606 |
| 49 | ANNOTATION | 51 | 211 | 2358 |
| 50 | GENOMES | 104 | 211 | 2919 |
| 51 | LENGTH | 33 | 210 | 2503 |
| 52 | VARIATION | 67 | 204 | 2855 |
| 53 | PROTEOMICS | 137 | 200 | 3508 |
| 54 | POLYMORPHISMS | 92 | 197 | 3034 |
| 55 | TRANSCRIPTION | 118 | 194 | 5159 |
| 56 | HIGH | 164 | 190 | 3341 |
| 57 | CANCER | 189 | 186 | 5020 |
| 58 | COMPUTATIONAL | 61 | 186 | 1525 |
| 59 | DRUG | 165 | 186 | 3636 |
| 60 | LARGE | 78 | 182 | 3536 |
| # | Word | Recs | TLCS | TGCS |
| 61 | COUPLED | 89 | 181 | 2933 |
| 62 | RECENT | 34 | 177 | 2086 |
| 63 | RECEPTOR | 138 | 171 | 4136 |
| 64 | ELEMENTS | 44 | 170 | 2288 |
| 65 | CHROMOSOMES | 27 | 169 | 1779 |
| 66 | FINISHING | 3 | 168 | 912 |
| 67 | EUCHROMATIC | 1 | 167 | 901 |
| 68 | SPECIFIC | 91 | 166 | 2907 |
| 69 | MOLECULAR | 181 | 162 | 3985 |
| 70 | SCALE | 64 | 159 | 2921 |
| 71 | BINDING | 104 | 157 | 4039 |
| 72 | TRANSCRIPTIONAL | 53 | 154 | 2166 |
| 73 | CDNAS | 22 | 153 | 1412 |
| 74 | RICE | 21 | 152 | 2600 |
| 75 | SEGMENTAL | 15 | 152 | 1235 |
| 76 | VERTEBRATE | 36 | 152 | 2446 |
| 77 | REVEALS | 35 | 151 | 995 |
| 78 | APPROACHES | 71 | 150 | 2043 |
| 79 | SITES | 50 | 150 | 2486 |
| 80 | RNAS | 17 | 149 | 1649 |
| 81 | FUNCTION | 106 | 148 | 3856 |
| 82 | CODING | 39 | 147 | 1917 |
| 83 | NON | 45 | 147 | 2223 |
| 84 | ORYZA | 5 | 137 | 2411 |
| 85 | PSEUDOGENES | 21 | 137 | 1295 |
| 86 | SATIVA | 5 | 137 | 2411 |
| 87 | RAT | 26 | 136 | 1101 |
| 88 | SSP | 3 | 136 | 2384 |
| 89 | CELL | 154 | 135 | 3766 |
| 90 | REGULATION | 85 | 135 | 4401 |
| # | Word | Recs | TLCS | TGCS |
| 91 | PROTEINS | 155 | 134 | 3753 |
| 92 | RECEPTORS | 80 | 133 | 3440 |
| 93 | DUPLICATION | 26 | 128 | 1299 |
| 94 | PROTEOME | 51 | 128 | 2144 |
| 95 | INTEGRATION | 55 | 125 | 2073 |
| 96 | FUTURE | 76 | 124 | 1640 |
| 97 | CHARACTERIZATION | 103 | 123 | 3108 |
| 98 | IMPLICATIONS | 68 | 120 | 1691 |
| 99 | ASSOCIATION | 58 | 119 | 3171 |
| 100 | BROWN | 1 | 118 | 752 |
| 101 | FAMILY | 88 | 118 | 4164 |
| 102 | NORWAY | 1 | 118 | 752 |
| 103 | YIELDS | 2 | 118 | 758 |
| 104 | GENOTYPING | 51 | 117 | 1678 |
| 105 | MICROARRAYS | 67 | 116 | 2000 |
| 106 | GENERATION | 38 | 115 | 1416 |
| 107 | FACTOR | 80 | 114 | 3545 |
| 108 | FUGU | 6 | 114 | 858 |
| 109 | MAPPING | 68 | 113 | 2001 |
| 110 | ZINC | 65 | 113 | 1128 |
| 111 | ENCODE | 6 | 111 | 1207 |
| 112 | FINGER | 59 | 111 | 991 |
| 113 | MEDICINE | 58 | 111 | 783 |
| 114 | CELERA | 5 | 110 | 190 |
| 115 | DEVELOPMENT | 121 | 110 | 3155 |
| 116 | RUBRIPES | 3 | 110 | 811 |
| 117 | SYSTEM | 80 | 110 | 1940 |
| 118 | APPROACH | 76 | 109 | 1458 |
| 119 | CDNA | 43 | 107 | 1929 |
| 120 | ENSEMBL | 4 | 107 | 351 |
| # | Word | Recs | TLCS | TGCS |
| 121 | DATA | 88 | 104 | 2130 |
| 122 | STRUCTURAL | 71 | 104 | 1921 |
| 123 | MRNA | 49 | 103 | 2615 |
| 124 | REGIONS | 39 | 103 | 1023 |
| 125 | BLOCKS | 7 | 102 | 2122 |
| 126 | EUKARYOTIC | 35 | 102 | 1453 |
| 127 | CHORDATE | 5 | 101 | 1121 |
| 128 | TARGET | 70 | 99 | 1814 |
| 129 | RESOLUTION | 26 | 97 | 1055 |
| 130 | SETS | 5 | 97 | 210 |
| 131 | THROUGHPUT | 84 | 97 | 1485 |
| 132 | PREDICTED | 6 | 96 | 274 |
| 133 | OVERLAP | 3 | 95 | 145 |
| 134 | COMPLEXITY | 24 | 93 | 814 |
| 135 | NUMBER | 24 | 93 | 506 |
| 136 | MALARIA | 19 | 92 | 1140 |
| 137 | MAPS | 29 | 92 | 1113 |
| 138 | RNA | 69 | 92 | 2599 |
| 139 | SYSTEMS | 63 | 92 | 1871 |
| 140 | DATABASE | 41 | 91 | 1496 |
| 141 | DOMAINS | 43 | 91 | 2486 |
| 142 | HAPMAP | 7 | 91 | 1509 |
| 143 | LINKAGE | 32 | 91 | 2694 |
| 144 | INTERNATIONAL | 2 | 90 | 1501 |
| 145 | ANOPHELES | 5 | 89 | 943 |
| 146 | DOG | 8 | 86 | 991 |
| 147 | ROLE | 84 | 86 | 2331 |
| 148 | RELATED | 65 | 84 | 1600 |
| 149 | VIEW | 16 | 83 | 988 |
| 150 | CELLS | 95 | 81 | 2795 |
| # | Word | Recs | TLCS | TGCS |
| 151 | DOMAIN | 49 | 81 | 1584 |
| 152 | MICROARRAY | 67 | 81 | 1852 |
| 153 | RESEARCH | 116 | 81 | 1494 |
| 154 | REGULATORY | 60 | 80 | 1669 |
| 155 | GAMBIAE | 4 | 79 | 862 |
| 156 | METHYLATION | 36 | 79 | 2483 |
| 157 | PROFILING | 74 | 78 | 2141 |
| 158 | TWO | 59 | 78 | 1324 |
| 159 | MOSQUITO | 3 | 77 | 852 |
| 160 | UNDERSTANDING | 35 | 77 | 1383 |
| 161 | DETECTION | 63 | 76 | 1401 |
| 162 | CIONA | 11 | 75 | 911 |
| 163 | ASSEMBLIES | 17 | 74 | 197 |
| 164 | ASSESSMENT | 31 | 74 | 772 |
| 165 | TRAITS | 12 | 74 | 2139 |
| 166 | ANALYSES | 32 | 73 | 1705 |
| 167 | EXPRESSED | 51 | 73 | 1278 |
| 168 | GLOBAL | 28 | 73 | 1699 |
| 169 | KINASE | 45 | 73 | 3842 |
| 170 | ORIGINS | 8 | 73 | 923 |
| 171 | USE | 61 | 73 | 1426 |
| 172 | MASS | 64 | 72 | 2070 |
| 173 | TECHNOLOGIES | 42 | 72 | 1136 |
| 174 | COMMON | 37 | 70 | 1356 |
| 175 | INTESTINALIS | 7 | 70 | 818 |
| 176 | SNP | 50 | 70 | 1162 |
| 177 | ACTIVITY | 42 | 69 | 1525 |
| 178 | INDICA | 2 | 69 | 1189 |
| 179 | JAPONICA | 2 | 69 | 1214 |
| 180 | TRANSCRIPTS | 30 | 69 | 724 |
| # | Word | Recs | TLCS | TGCS |
| 181 | DERIVED | 15 | 68 | 584 |
| 182 | ARRAYS | 27 | 67 | 1310 |
| 183 | ELECTROPHORESIS | 58 | 67 | 917 |
| 184 | EVIDENCE | 39 | 67 | 926 |
| 185 | PROMOTERS | 16 | 67 | 1310 |
| 186 | PHENOTYPES | 7 | 66 | 677 |
| 187 | SCREENING | 65 | 65 | 1094 |
| 188 | DISEASES | 61 | 64 | 1475 |
| 189 | ERA | 53 | 63 | 1061 |
| 190 | FIRST | 19 | 63 | 895 |
| 191 | NONCODING | 15 | 63 | 934 |
| 192 | REGION | 48 | 63 | 1227 |
| 193 | TAGS | 11 | 63 | 542 |
| 194 | UNDERLYING | 9 | 63 | 715 |
| 195 | COMPLEMENT | 8 | 62 | 1738 |
| 196 | DEFINITION | 11 | 62 | 584 |
| 197 | DISORDERS | 31 | 62 | 896 |
| 198 | FACTORS | 53 | 62 | 2418 |
| 199 | PAST | 13 | 62 | 693 |
| 200 | MENDELIAN | 3 | 61 | 555 |