GENSCAN 1.0 Date run: 4-Nov-116 Time: 05:52:36 Sequence gi568815597f:28712384_28963280 : 250897 bp : 46.69% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1690 2390 701 2 2 97 36 637 0.862 53.10 1.02 PlyA + 5805 5810 6 1.05 2.00 Prom + 9765 9804 40 -6.46 2.01 Init + 24738 24764 27 2 0 79 92 28 0.670 2.06 2.02 Intr + 25275 25299 25 2 1 105 115 16 0.785 3.60 2.03 Intr + 25876 25955 80 2 2 80 86 23 0.784 0.57 2.04 Intr + 30020 31603 1584 1 0 130 86 830 0.743 76.04 2.05 Term + 37224 37241 18 2 0 96 42 26 0.302 -2.88 2.06 PlyA + 38551 38556 6 1.05 3.00 Prom + 46482 46521 40 -5.06 3.01 Init + 63104 63161 58 1 1 83 99 74 0.646 8.05 3.02 Term + 76527 76612 86 1 2 75 53 56 0.118 -1.38 3.03 PlyA + 76791 76796 6 1.05 4.00 Prom + 91582 91621 40 -6.26 4.01 Init + 100001 100227 227 1 2 97 117 411 0.995 42.84 4.02 Intr + 100344 100414 71 0 2 101 22 88 0.515 2.33 4.03 Term + 102516 102559 44 1 2 64 55 64 0.283 -1.98 4.04 PlyA + 103855 103860 6 1.05 5.04 PlyA - 108712 108707 6 1.05 5.03 Term - 113632 113435 198 1 0 79 37 105 0.896 1.90 5.02 Intr - 114373 114309 65 2 2 54 94 94 0.962 5.04 5.01 Init - 115301 115124 178 2 1 74 -29 174 0.534 3.72 5.00 Prom - 116044 116005 40 -4.26 6.00 Prom + 118826 118865 40 -3.16 6.01 Init + 146581 146920 340 0 1 68 84 689 0.527 63.92 6.02 Term + 150359 150900 542 0 2 65 47 1226 0.984 110.62 6.03 PlyA + 151723 151728 6 -0.45 7.06 PlyA - 152732 152727 6 1.05 7.05 Term - 155493 155448 46 2 1 93 42 31 0.043 -4.32 7.04 Intr - 164887 164807 81 0 0 59 85 52 0.189 0.75 7.03 Intr - 165087 164991 97 1 1 122 94 63 0.407 9.37 7.02 Intr - 178620 178520 101 2 2 89 110 22 0.286 4.25 7.01 Init - 184079 184027 53 2 2 80 73 8 0.084 -0.87 7.00 Prom - 197803 197764 40 -4.56 8.03 PlyA - 199791 199786 6 1.05 8.02 Term - 202606 202253 354 1 0 102 42 252 0.963 16.49 8.01 Init - 225845 225603 243 2 0 80 12 148 0.046 4.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 225845 225555 291 2 0 80 38 139 0.902 3.75 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:28712384_28963280|GENSCAN_predicted_peptide_1|233_aa XLERQLEEQKKQGQDHRLKSQTVQNVVLMPVSTPKPPKRPRLQRPASTTVLSPSPPVQQP QFTVISPITITPVGQSFSMGNIPVATLSQGSSPVTVHTLPSGPQLFRYATVVSSAKSSSP DTVTIHPSSSLALLSSTAMQDGSTLGNMTTMVSPVELVAMESGLTSAIQAVESTSEDGQT IIEIDPAPDPEAEDTEGKAVILETELRTEEKVVAEMEEHQHQVHNVEIVVLED >gi568815597f:28712384_28963280|GENSCAN_predicted_CDS_1|702_bp nacttggaacgccagttggaggagcagaagaagcaaggccaggatcacaggctgaaatct cagacagttcaaaatgtggtactgatgcctgtgagcactcctaagcctccaaaaaggccc cggctccagcggccagcctccaccactgtcttgagcccttctcctcctgtccagcagcct cagttcacagtcatctcacccatcaccatcaccccagtgggtcagtcattttccatgggc aatattccagtggccaccctcagccagggctccagtcctgtgactgtccacacactgcct tctggccctcagctcttccgctatgccacagtggtctcctctgccaagagcagctcacca gacacagtgaccatccacccttcatctagcttggcgctgctgagctctactgccatgcag gatgggagtacactgggcaacatgaccaccatggttagccctgtggaattggtggccatg gagtccggcctaacctcggcaattcaggctgttgaaagcacctcagaggatgggcagacc atcattgagattgatccagccccggacccagaagctgaagatactgagggcaaagcagtc atcttggagacagagctgaggactgaggagaaagttgtggctgagatggaagaacaccag catcaagttcacaatgtggagattgtggtcttagaggattaa >gi568815597f:28712384_28963280|GENSCAN_predicted_peptide_2|577_aa MSASSLLEQRPKGQGNKVQNGSVHQKDGLNDDDFEPYLSPQARPNNAYTAMSDSYLPSYY SPSIGFSYSLGEAAWSTGGDTAMPYLTSYGQLSNGEPHFLPDAMFGQPGALGSTPFLGQH GFNFFPSGIDFSAWGNNSSQGQSTQSSGYSSNYAYAPSSLGGAMIDGQSAFANETLNKAP GMNTIDQGMAALKLGSTEVASNVPKVVGSAVGSGSITSNIVASNSLPPATIAPPKPASWA DIASKPAKQQPKLKTKNGIAGSSLPPPPIKHNMDIGTWDNKGPVAKAPSQALVQNIGQPT QGSPQPVGQQANNSPPVAQASVGQQTQPLPPPPPQPAQLSVQQQAAQPTRWVAPRNRGSG FGHNGVDGNGVGQSQAGSGSTPSEPHPVLEKLRSINNYNPKDFDWNLKHGRVFIIKSYSE DDIHRSIKYNIWCSTEHGNKRLDAAYRSMNGKGPVYLLFSVNGSGHFCGVAEMKSAVDYN TCAGVWSQDKWKGRFDVRWIFVKDVPNSQLRHIRLENNENKPVTNSRDTQEVPLEKAKQV LKIIASYKHTTSIFDDFSHYEKRQEEEESVKKPFNYK >gi568815597f:28712384_28963280|GENSCAN_predicted_CDS_2|1734_bp atgtcggccagcagcctcttggagcagagaccaaaaggtcaaggaaacaaagtacaaaat ggatctgtacatcaaaaggatggattaaacgatgatgattttgaaccttacttgagtcca caggcaaggcccaataatgcatatactgccatgtcagattcctacttacccagttactac agtccctccattggcttctcctattctttgggtgaagctgcttggtctacggggggtgac acagccatgccctacttaacttcttatggacagctgagcaacggagagccccacttccta ccagatgcaatgtttgggcaaccaggagccctaggtagcactccatttcttggtcagcat ggttttaatttctttcccagtgggattgacttctcagcatggggaaataacagttctcag ggacagtctactcagagctctggatatagtagcaattatgcttatgcacctagctcctta ggtggagccatgattgatggacagtcagcttttgccaatgagaccctcaataaggctcct ggcatgaatactatagaccaagggatggcagcactgaagttgggtagcacagaagttgca agcaatgttccaaaagttgtaggttctgctgttggtagcgggtccattactagtaacatc gtggcttccaatagtttgcctccagccaccattgctcctccaaaaccagcatcttgggct gatattgctagcaagcctgcaaaacagcaacctaaactgaagaccaagaatggcattgca gggtcaagtcttccgccacccccgataaagcataacatggatattggaacttgggataac aagggtcccgttgcaaaagccccctcacaggctttggttcagaatataggtcagccaacc caggggtctcctcagcctgtaggtcagcaggctaacaatagcccaccagtggctcaggca tcagtagggcaacagacacagccattgcctccacctccaccacagcctgcccagctttca gtccagcaacaggcagctcagccaacccgctgggtagcacctcggaaccgtggcagtggg ttcggtcataatggggtggatggtaatggagtaggacagtctcaggctggttctggatct actccttcagaaccccacccagtgttggagaagcttcggtccattaataactataacccc aaagattttgactggaatctgaaacatggccgggttttcatcattaagagctactctgag gacgatattcaccgttccattaagtataatatttggtgcagcacagagcatggtaacaag agactggatgctgcttatcgttccatgaacgggaaaggccccgtttacttacttttcagt gtcaacggcagtggacacttctgtggcgtggcagaaatgaaatctgctgtggactacaac acatgtgcaggtgtgtggtcccaggacaaatggaagggtcgttttgatgtcaggtggatt tttgtgaaggacgttcccaatagccaactgcgacacattcgcctagagaacaacgagaat aaaccagtgaccaactctagggacactcaggaagtgcctctggaaaaggctaagcaggtg ttgaaaattatagccagctacaagcacaccacttccatttttgatgacttctcacactat gagaaacgccaagaggaagaagaaagtgttaaaaagccctttaactacaagtaa >gi568815597f:28712384_28963280|GENSCAN_predicted_peptide_3|47_aa MVSALASAPVRKRPALTCAGSLSWWMSASSNSLLKPEACASHSRLES >gi568815597f:28712384_28963280|GENSCAN_predicted_CDS_3|144_bp atggtgtccgcccttgcttctgcgcctgtgcggaagcgcccggccctcacctgcgcaggg tccctgtcatggtggatgtctgcatcatccaactcgttgcttaagccagaggcctgtgcg agtcactctcgactcgagtcctga >gi568815597f:28712384_28963280|GENSCAN_predicted_peptide_4|113_aa MEPAPSAGAELQPPLFANASDAYPSACPSAGANASGPPGARSASSLALAIAITALYSAVC AVGLLGNVLVMFGIVRGTWGPARGEATYIEGNTGMCVIVCIRIIISTRILQIK >gi568815597f:28712384_28963280|GENSCAN_predicted_CDS_4|342_bp atggaaccggccccctccgccggcgccgagctgcagcccccgctcttcgccaacgcctcg gacgcctaccctagcgcctgccccagcgctggcgccaatgcgtcggggccgccaggcgcg cggagcgcctcgtccctcgccctggcaatcgccatcaccgcgctctactcggccgtgtgc gccgtggggctgctgggcaacgtgcttgtcatgttcggcatcgtccggggcacctggggc ccagcgagaggcgaggccacttacatcgaggggaacacaggaatgtgtgtcatcgtgtgc attagaatcatcatcagtacccgcatcctgcagataaaatga >gi568815597f:28712384_28963280|GENSCAN_predicted_peptide_5|146_aa MDDFEGFEASVEEVTADVVEIARELELEVGPEDVTELLLLQSHEKTLMDEKLLLIDEQEK IATATPTFGNHYPQPAAINIETVSSMRARTVSSSLLHPKYLELCLVLQGGQPNDVEFFYH PHFTDEKTDSTLSSTVDSQQEAGFEP >gi568815597f:28712384_28963280|GENSCAN_predicted_CDS_5|441_bp atggatgactttgaggggttcgaggcttcagtggaggaagtaactgcagatgtagtagaa atagcaagagaactagaattagaagtggggcctgaagatgtgactgaattgctgctgctg caatctcatgaaaaaactttaatggatgagaagttgcttcttatagatgagcaagaaaaa attgccacagccaccccaaccttcggcaaccactacccccaaccagcagccatcaatatt gagactgtcagctccatgagggcaaggactgtgtcttcatcactgttgcatcccaagtac ctagaactgtgcctggtacttcaggggggacaacccaatgatgtagagtttttctatcat ccccatttcacagatgagaaaacagacagcacgctgagttcaacagtggacagccagcaa gaagctggatttgaaccctag >gi568815597f:28712384_28963280|GENSCAN_predicted_peptide_6|293_aa MKTATNIYIFNLALADALATSTLPFQSAKYLMETWPFGELLCKAVLSIDYYNMFTSIFTL TMMSVDRYIAVCHPVKALDFRTPAKAKLINICIWVLASGVGVPIMVMAVTRPRDGAVVCM LQFPSPSWYWDTVTKICVFLFAFVVPILIITVCYGLMLLRLRSVRLLSGSKEKDRSLRRI TRMVLVVVGAFVVCWAPIHIFVIVWTLVDIDRRDPLVVAALHLCIALGYANSSLNPVLYA FLDENFKRCFRQLCRKPCGRPDPSSFSRAREATARERVTACTPSDGPGGGAAA >gi568815597f:28712384_28963280|GENSCAN_predicted_CDS_6|882_bp atgaagacggccaccaacatctacatcttcaacctggccttagccgatgcgctggccacc agcacgctgcctttccagagtgccaagtacctgatggagacgtggcccttcggcgagctg ctctgcaaggctgtgctctccatcgactactacaatatgttcaccagcatcttcacgctc accatgatgagtgttgaccgctacatcgctgtctgccaccctgtcaaggccctggacttc cgcacgcctgccaaggccaagctgatcaacatctgtatctgggtcctggcctcaggcgtt ggcgtgcccatcatggtcatggctgtgacccgtccccgggacggggcagtggtgtgcatg ctccagttccccagccccagctggtactgggacacggtgaccaagatctgcgtgttcctc ttcgccttcgtggtgcccatcctcatcatcaccgtgtgctatggcctcatgctgctgcgc ctgcgcagtgtgcgcctgctgtcgggctccaaggagaaggaccgcagcctgcggcgcatc acgcgcatggtgctggtggttgtgggcgccttcgtggtgtgttgggcgcccatccacatc ttcgtcatcgtctggacgctggtggacatcgaccggcgcgacccgctggtggtggctgcg ctgcacctgtgcatcgcgctgggctacgccaatagcagcctcaaccccgtgctctacgct ttcctcgacgagaacttcaagcgctgcttccgccagctctgccgcaagccctgcggccgc ccagaccccagcagcttcagccgcgcccgcgaagccacggcccgcgagcgtgtcaccgcc tgcaccccgtccgatggtcccggcggtggcgctgccgcctga >gi568815597f:28712384_28963280|GENSCAN_predicted_peptide_7|125_aa MPSTGPDTEEKQMTRKGRGHPTEAAQLVPQFPSLKIPELRALLQQGSNSSTGTNSNAQLP GMSNGGQNGIEQELSKDANARYSRCTDVHVYGSGVWVTGVQMEAYLYVAVRSLALTLNPG STCKQ >gi568815597f:28712384_28963280|GENSCAN_predicted_CDS_7|378_bp atgcccagcacagggcctgacacagaggaaaagcagatgactagaaaagggaggggccac ccaacagaggcagcacagcttgttccccagttccccagcctcaagatcccagagctcaga gctttgctccagcaggggtcaaacagttccacaggaaccaattccaatgcacagcttcca gggatgagcaatggaggacagaatggaatagagcaggaactaagcaaggatgctaatgca agatatagccggtgtacggatgtgcatgtgtatgggtcaggtgtatgggtcacaggtgta cagatggaggcctatttgtatgtggctgtgaggtctctagctctgaccctgaaccctggc tccacttgtaagcagtga >gi568815597f:28712384_28963280|GENSCAN_predicted_peptide_8|198_aa MDKTDAKIHMEIQGIARTAKTMLKNKTTVRGLTLTNFKAYHKPTVIKTMWYQHKDKHVDQ RNGTESPEINSYIYIQLVATRRLWPAPRSSGRVPAGDSACTRGSSAGEPAPRPGACRGLQ TKRLQRRRGRQDRPHGDSSTPTAPAPAPPPPPRLTGGSGAGLLRPPLLQPDEQWQEEAPA SEARALCGSRARRLLLAD >gi568815597f:28712384_28963280|GENSCAN_predicted_CDS_8|597_bp atggacaaaactgatgctaaaattcatatggaaatacaagggatagccagaacagccaaa acaatgttgaaaaataagaccacagttagaggactcacacttaccaatttcaaagcttac cacaaacctacagtaatcaagacaatgtggtaccagcataaggataaacacgtagatcaa cggaatggaactgagagtccagaaataaactcatacatctatatccagttggttgcaaca aggcggctctggcccgcccctcgctcctcgggccgcgtcccggctggcgactcggcgtgc acacgaggctcctccgcgggagagcccgcgccccggcccggggcctgtcgggggttgcag acaaagaggctgcagcgccgccgcggccgccaggaccgtccccacggggacagctccacg cccaccgccccggctcccgcgccgccgccgccgcctcgcctcaccggtggctccggggcc gggctcctgcgcccgccactgctgcagcccgacgaacaatggcaggaggaggcgccggcg tccgaggctcgggccctctgcggctcgcgggcccgccggctgctgctggctgactga