GenBank style presentation of the D. melanogaster histone cluster (S form). Primary sequence regions are noted in the Features section. The third-strand target is noted by lower case lettering. This sequence has been modified from GenBank accession number X14215 (Matsuo 1989a), the histone cluster L form, and given its own identification name: "DMHISTS".
LOCUS DMHISTS 4801 bp DNA INV 22-AUG-1997 DEFINITION Drosophila histone gene cluster, S form of repeating unit (4.8 kB). ACCESSION X14215a (non-GenBank) SOURCE fruit fly. ORGANISM Drosophila melanogaster Eukaryotae; mitochondrial eukaryotes; Metazoa; Arthropoda; Tracheata; Insecta; Pterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila. REFERENCE 1 (bases 1 to 4801) AUTHORS Matsuo,Y, Yamazaki,T. MODIFIED Niederstrasser, H. FEATURES Location/Qualifiers source 1. .4801 /organism="Drosophila melanogaster" /strain="AK-194" /clone_lib="part. MboI in lambda EMBL4" /chromosome="IIR 39D-E." CDS 1. .463 /note="H1 histone (AA ) /label=H1_End CDS complement(870. .1241) /note="H2B histone (AA 1-123)" /label=H2B CDS 1468. .1842 /note="H2A histone (AA 1-124)" /label=H2A CDS complement(2317. .2628) /note="H4 histone (AA 1-103)" /label=H4 CDS 2925. .3335 /note="H3 histone (AA 1-136)" /label=H3 CDS 4494. .4801 /note="H1 histone (AA 1-256)" /label=H1_Start misc_feature 2787. .2802 /note=" Homopurine Target " /label=Homopurine_Target BASE COUNT 1433 a 1019 c 1013 g 1576 t ORIGIN Dmhists.gb Length: 4801 April 13, 1998 14:54 Type: N Check: 889 .. 1 TATTCAAACT AAGGGAAAGG GTGCATCTGG ATCTTTCAAA CTGTCGGCCT 51 CTGCCAAGAA GGAAAAGGAT CCGAAGGCAA AGTCGAAGGT TTTGTCTGCT 101 GAGAAAAAAG TTCAAAGCAA GAAGGTAGCC TCTAAGAAGA TTGGTGTCTC 151 CTCCAAAAAA ACTGCCGTTG GGGCTGCTGA CAAAAAGCCC AAAGCTAAGA 201 AGGCTGTGGC TACCAAAAAG ACTGCCGAAA ATAAGAAAAC TGAGAAGGCA 251 AAAGCCAAGG ATGCCAAGAA AACTGGAATC ATAAAGTCGA AGCCCGCCGC 301 AACAAAGGCG AAAGTGACTG CAGCGAAGCC AAAGGCTGTA GTAGCGAAAG 351 CGTCAAAGGC AAAGCCAGCG GTGTCTGCAA AACCCAAAAA GACGGTGAAG 401 AAAGCATCGG TTTCTGCTAC CGCCAAGAAG CCGAAAGCGA AGACTACGGC 451 TGCCAAAAAG TAAATTGTGA AAAAGTGCAG TATTTGGTAC ATGTTCGCAA 501 TTAAAATTTT AGATTTATGA TTTATAGATC TGAAATTTGT TTAAACAAGT 551 CCTTTTCAGG GCTACAACGT TCCGTTGCAA GAGAAAAAAA CTTTTATTTT 601 CTTCCACTTA TTTATTAGCT GACGTTCGCA GCAACAATAA AACGTTTCAT 651 GTCATGAATT ACATTGAATG TTGGTCGCAT TCAGTTTTCG TTCCCGATTT 701 TTTTGTATTT ATTTGAACAT TACCCAATTA CCCATATTGC GGGTAAATAA 751 GTTTTATTTG TAAATTCATA TTCGATGATT GGTGGTTGAA AAATGCATTT 801 CTTTGGTATA ACACATTGTG GCCCTGAAAA GGGCCGTTTT GGATTATTGT 851 CCGCATTCGC AGGAGAAAAT TATTTAGAGC TGGTGTACTT GGTGACAGCC 901 TTGGTTCCCT CACTGACAGC ATGCTTGGCC AACTCTCCAG GCAAAAGCAG 951 GCGAACAGCC GTTTGGATCT CCCGACTGGT GATGGTCGAG CGCTTGTTGT 1001 AGTGAGCTAG ACGAGACGCT TCGGCAGCAA TTCGCTCGAA AATATCATTT 1051 ACAAAGCTGT TCATTATGCT CATCGCCTTC GACGAAATTC CGGTGTCAGG 1101 ATGGACCTGC TTGAGAACCT TGTAAATGTA GATGGCATAG CTCTCCTTCC 1151 TTTTGCGCTT CTTTTTCTTG TCGGTCTTGG TGATGTTCTT CTGAGCCTTG 1201 CCAGCCTTCT TGGCTGCCTT TCCACTAGTT TTCGGAGGCA TTGTTCACGT 1251 TACTTATATT TTCACAAACA CAATTCACTT ATCGTAATGT GGGCCCGAAC 1301 GCGTTCACGT TTATACTTTT TTTCGAGCAG TCAATTCAGG TCTAAGTCAC 1351 CCACCCCTAA CTGAATGCGC AGGCAAACGG AAAAGTATAA ATATTTCGCT 1401 GTCTGGGTTA GGCGAGCATT CGTGTTCCGT GTGTAAAGTG AACTAAGTGA 1451 AATAAACGCA AAGCAAAATG TCTGGACGTG GAAAAGGTGG CAAAGTGAAG 1501 GGAAAGGCAA AGTCCCGCTC AAACCGTGCC GGTCTTCAAT TCCCTGTGGG 1551 CCGTATTCAC CGTTTGCTCC GGAAGGGAAA CTACGCAGAG CGTGTTGGTG 1601 CAGGCGCTCC AGTTTACCTA GCTGCCGTAA TGGAATATCT GGCCGCTGAG 1651 GTTCTCGAGT TGGCTGGCAA TGCTGCTCGT GACAACAAGA AGACTAGAAT 1701 TATTCCGCGT CATCTGCAAC TGGCCATCCG CAACGACGAG GAGTTAAACA 1751 AGCTGCTCTC CGGCGTCACA ATTGCACAAG GTGGCGTGTT GCCTAATATA 1801 CAGGCTGTTC TGTTGCCCAA GAAGACCGAG AAGAAGGCCT AAACGTTTCA 1851 AAGGCTAAGC TAAAAACCTA CATGTACATA AAATCGTCAA TCAAACCGTC 1901 CTTTTCAGGA CGACCAAATT ATTACCAAAG AATTGAAAAA TTTTTTAGCT 1951 TGGCAATTTG TTGTAATTAA TAAATCATAA AGAATTATTA ACGTAAAGAT 2001 GGTAATGTAG TAAGGGTTTT CTACTATATG CGGTATAAAC TATAATTTGC 2051 TTCTTTAAAC AATCGCACAC CACGATGTGA TGCTGTACAT GCGGTGTCTG 2101 AAACCATTTG TACAGTCTGT ACAAATCCAT GTTAGAAATA CACATTCTAT 2151 TTGAAAGAGT ACGAACGACA GACATTTATT TTTAGTTTAA CATATTTTTT 2201 GGGAGTCCCG ACCAATAAAA TTAAATACTT TTTGAAAATC TTCCTCCTTT 2251 TAAAAACTGA ATGGTGGTCC TGAAAAGGAC CGATTGCTTA ATAGGGGTAC 2301 ACAGGATGTA CACTTTTTAA CCGCCAAATC CGTAGAGGGT GCGGCCTTGC 2351 CTCTTCAGAG CGTACACAAC ATCCATGGCT GTAACTGTCT TCCTCTTGGC 2401 GTGTTCCGTG TAGGTCACGG CATCACGAAT TACGTTCTCC AAGAAAACCT 2451 TCAGAACGCC ACGCGTTTCC TCGTATATGA GTCCAGATAT GCGCTTCACA 2501 CCGCCTCGAC GGGCCAAACG GCGGATAGCA GGCTTCGTGA TACCTTGGAT 2551 GTTATCACGC AGCACTTTGC GATGACGCTT GGCGCCACCC TTTCCCAAGC 2601 CTTTGCCTCC TTTACCACGA CCAGTCATTT TTCACTGTTC TATACTATTA 2651 TACACGCACA GCACGAAAGT CACTAAAGAA CTAATTTCAA CGTTTCTGTG 2701 TGCCCCTATT TATAGGTAAA ACGACAAAAA CCCGAGAGAG TACGAACGAT 2751 ATGTTCGTTC GCTTTTCGCT CGTCAAATGA AATGGCctct gtttttctct 2801 ctCTCTCTCT CTCTCTTTCA CCGTCCACGA TTGCTATATA AGTAGGTAGC 2851 AAATGCTCTG ATCGTTFIRE WHENTTTCAA ACGTGAAGTA GTGAACGTGA 2901 ACTTTAGTGA AACCREADY, GRIDLEY!CT CGTACCAAGC AAACTGCTCG 2951 CAAATCGACT GGTGGAAAGG CGCCACGCAA ACAACTGGCT ACTAAGGCCG 3001 CTCGCAAGAG TGCTCCAGCC ACCGGAGGTG TGAAGAAGCC CCACCGCTAT 3051 CGCCCTGGAA CCGTGGCCTT GCGTGAAATT CGTCGCTACC AAAAGAGCAC 3101 CGAGCTTCTA ATCCGCAAGC TGCCTTTCCA GCGTCTGGTG CGTGAAATCG 3151 CTCAGGACTT TAAGACGGAC TTGCGATTCC AGAGCTCGGC GGTTATGGCT 3201 CTGCAGGAAG CTAGCGAAGC CTACCTGGTT GGTCTCTTCG AAGATACCAA 3251 CTTGTGTGCC ATTCATGCCA AGCGTATCAC CATAATGCCC AAAGACATCC 3301 AGTTAGCGCG ACGCATTCGC GGCGAGCGTG CTTAAGCTGA CACGGCATTA 3351 ACTTGCAGAT AAAGCGCTAG CGTACTCTAT AATCGGTCCT TTTCAGGACC 3401 ACAAACCAGA TTCAATGAGA TAAAATTTTC TGTTGCCGAC TATTTATAAC 3451 TTAAAAAAAA TAAGAACAAA ATTCATATTC TATTATTTAT GGCGCAAACG 3501 GTACTGGGTC TTAAATCATA TGTAAAAATA ATATTTATAA AATAACAGAA 3551 AATAATAAAA TAAAACTAGC TATTTTATAT TTTTTCCATG TGTTAACTGA 3601 AGAATGTGTT ATTATTGAAG AGGTCGTACG GGACAATTGA CACTGTCCCT 3651 TCAAACGTCT GTAAAAAATA AAACCTATGT AAAATTCAGC ACGGAAATTG 3701 GCTAATTTTG TTGCGGAATG TAATATATAT TACATAATAA AGGATAATAC 3751 AAAAATTGTT TCTTTTTATT TTTTATTTGA TTTATTTATT TGACTACATA 3801 GACGGTAATG CATATGTGGC GAGGAAATCG ATTGATTTCA GAACAAATTA 3851 TTTTAAAATA TGCATGAAAA CACATTAATA ACAAGCAAAC ACATTAATAA 3901 TTTAAGAAAA TATTATTTAT TATATTAATA TTATGTTATT TAAGAAAGTA 3951 TCTGTATTTT TAACGATCGA AAATTATTTC TGAATGCTGC TTTAAAGCAA 4001 ATTTTTCTGT AGTTCAATGT GAACTTAAAT CAAGTAATAA AGTATCTTAA 4051 TTAATAATAG ACGCTTCTTT CAGAAGCCTT CTAGGGATGA ACGTTTCAAT 4101 TTTAATAAAC ATAACGAATT AGTGAAATAT TTGCCATGAT TCTTATTTTA 4151 ATAGATGTTT TTTTATAAAT TGGTCCAGTT AAAAATTTGA TTATAAAAAT 4201 TCAATCAACA TTTGAAAGTC TCAAAACCCA TATTACATCC TTTTAAAAAT 4251 GGAAAGTGAC GAAAAAATTA TTTAAAAGTG TAGAACTATT AAAACCTTAT 4301 TTTTATTAAT GATTTAAAAT ACTAAAAAAT TAAAAAAAGT TTACACTTCA 4351 AGCAAACTTT GACATAGTAA ATGACTGATG TCAGTAGCAT TGTTAAAGTG 4401 CTCTCCTCCT CGATTCTCAT CAGAGCAAAG GAGGTTGGTA GGCAGCGCGC 4451 GAGCCATTTT TAACAGAAAA AAAGTGTTCT CAGTGAAAAA AAGATGTCTG 4501 ATTCTGCAGT TGCAACGTCC GCTTCCCCAG TGGCTGCCCC ACCAGCGACA 4551 GTTGAGAAGA AAGTGGTCCA AAAAAAGGCA TCTGGATCTG CTGGCACAAA 4601 GGCAAAGAAA GCCTCTGCGA CGCCGTCACA TCCGCCAACT CAGCAAATGG 4651 TGGACGCTTC CATTAAAAAT TTAAAGGAAC GTGGCGGTTC ATCACTTCTG 4701 GCAATCAAAA AATATATCAC TGCCACTTAT AAATGCGACG CCCAAAAGTT 4751 AGCGCCATTC ATCAAGAAGT ACTTAAAATC GGCCGTGGTC AATGGAAAGC 4801 T