GENSCAN 1.0 Date run: 7-Nov-116 Time: 15:22:39 Sequence gi568815592r:134070271_134417460 : 347190 bp : 41.72% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 13479 13586 108 2 0 63 100 76 0.686 6.57 1.02 Term + 15525 15635 111 2 0 89 49 145 0.992 8.28 1.03 PlyA + 15759 15764 6 1.05 2.06 PlyA - 16085 16080 6 1.05 2.05 Term - 32843 32778 66 2 0 91 42 70 0.313 -0.34 2.04 Intr - 32980 32892 89 2 2 25 45 150 0.174 3.17 2.03 Intr - 45231 44767 465 1 0 80 54 263 0.320 14.27 2.02 Intr - 47352 47274 79 0 1 90 86 15 0.645 -0.29 2.01 Init - 53818 53747 72 1 0 57 65 70 0.245 2.72 2.00 Prom - 66589 66550 40 -3.65 3.00 Prom + 69942 69981 40 -5.45 3.01 Init + 76724 76859 136 1 1 98 76 92 0.431 9.35 3.02 Intr + 90869 91046 178 0 1 63 85 115 0.119 6.76 3.03 Term + 98531 98636 106 2 1 58 36 82 0.003 -3.10 3.04 PlyA + 98688 98693 6 1.05 4.12 PlyA - 99004 98999 6 1.05 4.11 Term - 100165 99998 168 1 0 79 46 290 0.996 20.80 4.10 Intr - 100645 100556 90 1 0 76 89 30 0.692 1.17 4.09 Intr - 100908 100753 156 0 0 71 69 158 0.996 11.49 4.08 Intr - 101462 101367 96 2 0 91 105 86 0.991 9.89 4.07 Intr - 102046 101923 124 0 1 78 80 49 0.995 2.77 4.06 Intr - 102504 102392 113 0 2 83 87 126 0.833 10.26 4.05 Intr - 102884 102753 132 2 0 87 83 104 0.996 9.72 4.04 Intr - 103296 103192 105 0 0 78 86 108 0.987 9.09 4.03 Intr - 103810 103735 76 0 1 99 100 55 0.991 6.40 4.02 Intr - 104316 104241 76 1 1 111 98 112 0.995 12.15 4.01 Init - 105391 105274 118 1 1 63 94 151 0.684 11.61 4.00 Prom - 109340 109301 40 -4.65 5.00 Prom + 118243 118282 40 -5.65 5.01 Sngl + 123621 123854 234 2 0 25 49 201 0.782 4.75 5.02 PlyA + 124657 124662 6 1.05 6.00 Prom + 127253 127292 40 -5.15 6.01 Sngl + 147013 147417 405 0 0 45 38 260 0.795 12.73 6.02 PlyA + 148046 148051 6 1.05 7.00 Prom + 154581 154620 40 -3.55 7.01 Init + 182017 182102 86 0 2 91 72 64 0.073 5.44 7.02 Intr + 197628 197821 194 0 2 50 44 96 0.012 -0.39 7.03 Intr + 207162 207254 93 1 0 58 64 106 0.303 4.22 7.04 Term + 223749 223867 119 1 2 4 42 132 0.017 -2.08 7.05 PlyA + 225082 225087 6 1.05 8.05 PlyA - 226499 226494 6 1.05 8.04 Term - 227739 227530 210 0 0 -51 47 402 0.927 18.51 8.03 Intr - 228108 227915 194 1 2 65 81 204 0.816 15.69 8.02 Intr - 231630 231582 49 0 1 80 88 5 0.137 -2.97 8.01 Init - 239147 239073 75 2 0 64 70 155 0.944 12.44 8.00 Prom - 239761 239722 40 -8.55 9.07 PlyA - 241001 240996 6 1.05 9.06 Term - 246016 245880 137 2 2 65 44 106 0.431 1.10 9.05 Intr - 247674 247545 130 0 1 101 70 78 0.166 6.65 9.04 Intr - 262062 261944 119 1 2 105 68 38 0.083 2.76 9.03 Intr - 276557 276354 204 0 0 50 62 120 0.471 3.85 9.02 Intr - 278103 277926 178 0 1 22 81 126 0.808 3.87 9.01 Init - 281416 281303 114 1 0 87 -7 126 0.685 3.26 9.00 Prom - 292133 292094 40 -4.95 10.05 PlyA - 293653 293648 6 1.05 10.04 Term - 298457 298163 295 1 1 91 48 159 0.835 5.99 10.03 Intr - 298637 298565 73 0 1 52 87 55 0.815 -0.65 10.02 Intr - 299126 298973 154 2 1 32 52 124 0.010 1.92 10.01 Init - 306693 306454 240 0 0 62 52 178 0.553 9.52 10.00 Prom - 317018 316979 40 -3.05 11.05 PlyA - 317789 317784 6 1.05 11.04 Term - 323178 322931 248 1 2 80 42 281 0.858 17.67 11.03 Intr - 324354 324291 64 0 1 64 77 8 0.137 -5.33 11.02 Intr - 326193 325740 454 2 1 47 59 190 0.118 4.33 11.01 Intr - 337645 337516 130 2 1 48 49 137 0.009 4.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 227226 227062 165 0 0 80 43 272 0.945 16.40 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:134070271_134417460|GENSCAN_predicted_peptide_1|72_aa MRIVTGSIWKESYIGEQLSEGIPSVSGQGNKGQVLKCWMLHTLDHWTPGSSAFGLLDLHQ WFARGSRAFATD >gi568815592r:134070271_134417460|GENSCAN_predicted_CDS_1|219_bp atgaggattgtaactggatccatctggaaggaatcctacataggagagcagctgagtgag gggatcccatcagtatcagggcaagggaacaagggtcaggtgctgaagtgctggatgctt cataccctcgatcactggactccaggttcttcagcttttggactcttggacctacaccag tggtttgccaggggctctcgggcctttgccacagactga >gi568815592r:134070271_134417460|GENSCAN_predicted_peptide_2|256_aa MHMKRLALHPARCQPLLLQPPSSSRFASALTPQPLPLLLWLVTASASSTTQGKKSQSSWR SSQPLASKKEKDGAEKRGGGSSTSEGAQHVLTPKTLGNRKEANPRMLPRPGKPPQLQEGK QRAGPKHWEGGRGSISRESSREEPLPSGTGQFCSPPLPLSFPQATVTCTAHLRPHHRGRA PVGPVTAVFLWLQFPATHTLALLDEAQEASLLPLVTVQEAKQELLPQLAQNVQDLDQPEK ANYMPPANHTGCPTSQ >gi568815592r:134070271_134417460|GENSCAN_predicted_CDS_2|771_bp atgcacatgaagcgcttagccctgcatccagcacgttgtcagccattgctattacaacca ccttcatcatctcgttttgcttcagccctcactcctcagccacttccactgctcctgtgg ttagtgaccgcctctgcctcctccactacccaagggaagaagagccagtcgagctggagg tccagccagcccttggcctccaagaaggaaaaggacggcgctgagaagcgaggtgggggc agctccacgtccgaaggagcccagcacgtgctaacacctaagaccctgggcaaccgaaag gaagcaaaccccaggatgctgccacgacccggaaaaccaccacagctccaggagggaaag caaagggccggcccaaaacactgggaaggaggaagagggagcatctctcgggagtcctcc cgggaggagccacttccttctgggaccggacagttttgctcacccccattgcccctgtcc ttcccccaggccaccgtcacctgcaccgcccacctgcgtcctcaccaccgcggcagggcc cctgttggcccagtgacagcagtgttcctctggcttcagttcccagccacgcacacactt gccctcctggacgaggcccaggaagccagtttgctacctctggtaaccgtccaggaagct aaacaagaacttctgccacaactggcacaaaatgtccaggacttggaccaacctgagaaa gccaattatatgcccccagccaatcacacgggatgccccacttctcaatag >gi568815592r:134070271_134417460|GENSCAN_predicted_peptide_3|139_aa MAEGEANMLFFTWQQEEVRSKSGEKFLIKPSDLGRTFSLSGEQLGGLAPEAIKFMTPGLR DLSYSGVFMTLWNWKDRIVHSESDSQEWLEGKVPPNQEGFGKTFRLWKGEVVCFVHYDIL STRNRAQSRLSNISGKNYL >gi568815592r:134070271_134417460|GENSCAN_predicted_CDS_3|420_bp atggcggaaggggaagcaaacatgctgttcttcacatggcagcaggaagaagtgcggagt aaaagcggggaaaagttccttataaaaccttcggatcttgggagaaccttctcactatca ggagaacagcttggaggactggctcctgaagccattaagtttatgacccctggacttaga gatctttcctattctggcgtttttatgactctatggaattggaaggacaggatagttcat agtgagagtgactcacaggaatggctggaagggaaagtaccccctaatcaggaaggtttt gggaaaacattcaggctctggaagggcgaagttgtctgctttgtccactacgatatcctg agcactcgcaatagagcccagagtaggctttcaaatatttctggaaagaattacctgtaa >gi568815592r:134070271_134417460|GENSCAN_predicted_peptide_4|417_aa MGEMQGALARARLESLLRPRHKKRAEAQKRSESFLLSGLAFMKQRRMGLNDFIQKIANNS YACKHPEVQSILKISQPQEPELMNANPSPPPSPSQQINLGPSSNPHAKPSDFHFLKVIGK GSFGKEKHIMSERNVLLKNVKHPFLVGLHFSFQTADKLYFVLDYINGGELFYHLQRERCF LEPRARFYAAEIASALGYLHSLNIVYRDLKPENILLDSQGHIVLTDFGLCKENIEHNSTT STFCGTPEYLAPEVLHKQPYDRTVDWWCLGAVLYEMLYGLPPFYSRNTAEMYDNILNKPL QLKPNITNSARHLLEGLLQKDRTKRLGAKDDFMEIKSHVFFSLINWDDLINKKITPPFNP NVSGPNDLRHFDPEFTEEPVPNSIGKSPDSVLVTASVKEAAEAFLGFSYAPPTDSFL >gi568815592r:134070271_134417460|GENSCAN_predicted_CDS_4|1254_bp atgggggagatgcagggcgcgctggccagagcccggctcgagtccctgctgcggccccgc cacaaaaagagggccgaggcgcagaaaaggagcgagtccttcctgctgagcggactggct ttcatgaagcagaggaggatgggtctgaacgactttattcagaagattgccaataactcc tatgcatgcaaacaccctgaagttcagtccatcttgaagatctcccaacctcaggagcct gagcttatgaatgccaacccttctcctccaccaagtccttctcagcaaatcaaccttggc ccgtcgtccaatcctcatgctaaaccatctgactttcacttcttgaaagtgatcggaaag ggcagttttggaaaggagaagcatattatgtcggagcggaatgttctgttgaagaatgtg aagcaccctttcctggtgggccttcacttctctttccagactgctgacaaattgtacttt gtcctagactacattaatggtggagagttgttctaccatctccagagggaacgctgcttc ctggaaccacgggctcgtttctatgctgctgaaatagccagtgccttgggctacctgcat tcactgaacatcgtttatagagacttaaaaccagagaatattttgctagattcacaggga cacattgtccttactgacttcggactctgcaaggagaacattgaacacaacagcacaaca tccaccttctgtggcacgccggagtatctcgcacctgaggtgcttcataagcagccttat gacaggactgtggactggtggtgcctgggagctgtcttgtatgagatgctgtatggcctg ccgcctttttatagccgaaacacagctgaaatgtacgacaacattctgaacaagcctctc cagctgaaaccaaatattacaaattccgcaagacacctcctggagggcctcctgcagaag gacaggacaaagcggctcggggccaaggatgacttcatggagattaagagtcatgtcttc ttctccttaattaactgggatgatctcattaataagaagattactcccccttttaaccca aatgtgagtgggcccaacgacctacggcactttgaccccgagtttaccgaagagcctgtc cccaactccattggcaagtcccctgacagcgtcctcgtcacagccagcgtcaaggaagct gccgaggctttcctaggcttttcctatgcgcctcccacggactctttcctctga >gi568815592r:134070271_134417460|GENSCAN_predicted_peptide_5|77_aa MIPLEVVDNPSLSWQLREQGDTNLGQESVPSSRHTHTHTHTHAHSDWDNVNTPMNLNMHI FGMWEESGVTWRKPTKT >gi568815592r:134070271_134417460|GENSCAN_predicted_CDS_5|234_bp atgattccacttgaggtggtagacaacccaagcttatcctggcagctcagggaacaaggt gacaccaaccttggacaggagtccgtcccatcctcaaggcacactcacactcacacccac actcatgctcactcagactgggacaacgtgaacacgccaatgaacctcaacatgcatatc tttgggatgtgggaggaaagtggagttacatggagaaaacctacaaagacatga >gi568815592r:134070271_134417460|GENSCAN_predicted_peptide_6|134_aa MVIEGQSCPRAPATGGSTAFCAQTSVAGLQNACGPLFLCLAEGFLRAVAPSLTHRTTTAF GSWVVGFTHTVEWVSHAFVPGLQSSTSHHTGQDGMWPHYLTTANMPIVCKFGTSTATLIK MMVWTRIAGVWSRD >gi568815592r:134070271_134417460|GENSCAN_predicted_CDS_6|405_bp atggtcatagagggtcaaagctgtccacgagctccagcaacaggaggaagcacagcattc tgtgctcagactagtgtggcaggcctgcagaatgcgtgtgggcccttgttcctgtgcttg gctgagggcttcctgagagcagttgctccatctctgacccaccggaccactactgccttt ggatcctgggtggtgggcttcactcacacagtcgaatgggtttctcatgcctttgtccct gggctacagtcctccacttcacaccacaccggtcaagatggaatgtggccacattatctc actacagcaaacatgcccatagtttgcaaatttggtacatcaaccgccacgctcataaag atgatggtttggaccaggatagcaggagtctggtcaagagattaa >gi568815592r:134070271_134417460|GENSCAN_predicted_peptide_7|163_aa MEVLLEEKDLASWLSGSLAAHTMCALHRQFCLHWLSPYTGFNVSLLYDSLTLFTSFEASQ VIFTGPIKDRVYDVCLLLREWLDRPSECTGVRQALCDLDEGQGRLNGARNDQGPGGVRGG QTQADEDSGHVASFSEHVAVQIAQVLSKNNSKEPFGSEAYLGL >gi568815592r:134070271_134417460|GENSCAN_predicted_CDS_7|492_bp atggaagtactcctggaggagaaggatttagcttcctggttatcaggctctctggctgct cacacaatgtgcgcgttgcacagacagttttgtctccattggcttagtccttatactggc ttcaacgtgtcccttctttatgactccttgacacttttcaccagctttgaagcctcacaa gtgatttttacaggcccaattaaggacagagtttacgatgtttgcttgctcttgagagag tggctggaccgacccagtgaatgcacaggagttaggcaagctctatgtgacctagatgaa gggcagggaaggctaaatggagcacgaaatgaccagggtcctggaggtgtgagaggtggg cagacccaggcagatgaagattcaggtcatgttgcaagtttttctgaacatgtggctgtg cagatagctcaggtcctgagcaaaaacaacagcaaagaaccctttggcagtgaagcctat cttggcctctga >gi568815592r:134070271_134417460|GENSCAN_predicted_peptide_8|175_aa MWESGGEGGGGGEGGAGGGGQEKQTFTRAVVAHANNPSALGGKVRFLEQQNKMLETKWSL LQQQKTAWSNMDNMFESYFNNLRRQLETRGQEKLKLEAELGNMQGLISDTSVMLSMDNSR SPDMDGIITEVKAQYEEITNRSRSEAESMYQIKYEELQMLAGKHGDDLQHTKTEI >gi568815592r:134070271_134417460|GENSCAN_predicted_CDS_8|528_bp atgtgggaaagtggaggagaaggaggtggaggaggtgaaggaggtgcaggaggaggagga caggaaaagcagacatttaccagagcagtggtggctcatgccaataatcccagtgctttg ggaggcaaggtacggttcctggagcagcagaacaagatgctagagaccaagtggagcctc ctgcagcagcagaagacagcttggagcaacatggacaacatgttcgagagctacttcaac aaccttaggcggcagctggagactcggggccaggagaagctgaagctggaggcagagctt ggcaacatgcaggggctgatctcggatacgtctgtgatgctgtccatggacaacagccgc tccccggacatggacggcatcatcactgaggtcaaggcgcagtacgaggagatcaccaac cgcagccggagtgaggctgagagcatgtaccagatcaaatatgaggagctgcagatgctg gctgggaagcacggggatgacctgcagcatacaaagactgagatctag >gi568815592r:134070271_134417460|GENSCAN_predicted_peptide_9|293_aa MSSSKSLQQSIQIREHPLTISKRLLDLQIIVRFFKKAELWKVQWQVWSEANQRTQEIGEE LSFRSGSRQSTLPSFFCKITTGYFSERCYGDASALNTMHADHHSKCPTLHICIPSPESCL WTQCPHPGSRLEVPDNEYPHPTPTVAFKPELNRNWVSCALSLLVFRLPDTLCLLPPQHSS HNPEIASQTCSLTLKQGECLIHQHVLLNYSSHWNLHNQKQLSSLKPPPGAHHLEKSDSLP APAAEEQPEHSEFQGGLDYLPAPNGQAGKARLEMQATCQAQLEGQRRLPSTLD >gi568815592r:134070271_134417460|GENSCAN_predicted_CDS_9|882_bp atgagcagctctaagtcccttcagcagagtatacaaatccgggagcatccactgactatc agcaaacgactcttagacttacagataattgttaggttcttcaagaaagcagagctatgg aaagttcaatggcaggtgtggtccgaagccaaccagaggacgcaggagattggagaagag ctctccttccgttcaggctcacggcaaagcacgttgccttctttcttttgcaaaataacc acaggatacttctcagagagatgctacggagatgcttcagccttgaacaccatgcatgca gaccaccattccaaatgcccaactctccacatctgcattccttcacctgagagctgcctc tggacccagtgtccccacccaggcagcagactggaagtgccagacaatgaatatccccac cccacccctaccgtggccttcaaaccagaactaaacaggaactgggtttcctgtgccctt tccttgctggtgtttcggctcccagacactctgtgcttactgccaccacaacactcatca cataaccctgaaattgccagccagacttgcagtctaactcttaagcaaggagaatgtctt attcaccagcatgtgctgcttaactacagctcccactggaacttgcacaatcaaaaacaa ctctcctctctcaagccgcctccaggagcgcatcacctggagaagagcgactcgctcccc gcgccggccgcggaagagcagccagaacacagtgaatttcagggtgggctggactatctc cctgcaccaaatgggcaagctgggaaggcaaggctagaaatgcaggcaacatgtcaggcc cagctagagggccagcgaaggctcccaagcacccttgactga >gi568815592r:134070271_134417460|GENSCAN_predicted_peptide_10|253_aa MLGLVTADVTTLRCKGARGESVERVTLRGVMVFARNHAASPSQLCRQEAGECGTRPHCDL PSDLLQGLSIGQSQPETRGQVMKAPVATQKAEPTLLLGREPGQRESFKSPLRSSCSTTQP TNERDCLQRELAFMAKVQLLSDCSCQMQVPRFHRTRKVVFLGLSKKFTTFSHKLFTFILH QHQNLAIELTHSLHEIEERRQSLDTKVSLQLSSQLKVFNEPALHQMEPGGIIIRGSHCPQ KPNQSHPCLLSLS >gi568815592r:134070271_134417460|GENSCAN_predicted_CDS_10|762_bp atgctggggctagtcacagcagatgttacaaccctgaggtgtaaaggagcaagaggagag agtgtggaaagggtcaccttgagaggagttatggtctttgctcgcaatcatgcagccagc ccaagccaactctgtaggcaggaagcaggggaatgtgggaccagaccgcactgtgatctt ccttctgacctcctgcaggggctctccattggacaaagccaaccagaaaccagagggcaa gtaatgaaggctcctgtagctactcagaaagcagagcctaccctgctgctagggagagaa ccagggcagcgtgagagctttaaaagccccctcaggtcctcctgttctacgactcaacca accaatgaaagagattgccttcagagggagctggcttttatggcaaaggtgcagttatta agtgattgcagctgccaaatgcaagtgccccggttccacaggacaagaaaagttgtattc cttgggctctccaagaagttcacaactttctctcataaattatttacatttatcctccac caacatcagaatctagcaatcgaactcacacactcattacatgaaattgaagagagacgg cagtcattagacacaaaggtctcactacagctctcctcgcagctgaaggtgttcaatgaa cctgccctccatcaaatggaacctggtggaatcattatccgaggcagtcactgtccgcaa aagccaaaccagtctcatccttgcctgctctcgctctcatga >gi568815592r:134070271_134417460|GENSCAN_predicted_peptide_11|298_aa XPGIGPGSSVEPTASTPSSVHRKSFQKYYWHLAASKCEDIPHVRETLERRKERKKERKKE RKKERKKERKKERKKERKKERKKERKKGKKGKKERKKKRKKERRKERKKERKKERKERKE RKKERKKERKKEGKKERKKEERKKIKERKEGGKEGRKESGGEGGRKEERKEKERKEKVTR SASAYKSDPTKCRGEFRKQSPLAESKQKPGCKGSYSSGPPAAGPRQPGLMARIATTGAGV GYTVGHTITGGFNEGSNAEPASPDITYQEPQGTQMSQRQQPCFYEIKPFLECARNRVT >gi568815592r:134070271_134417460|GENSCAN_predicted_CDS_11|897_bp ngtcctggcataggtcctgggtcctctgtagaacccacagcttctacaccttcatcggtc cacaggaaaagctttcagaaatattactggcatttggctgcatccaagtgtgaagatatc cctcatgtcagagagaccctggaaagaaggaaagaaagaaagaaagaaagaaagaaagaa agaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaa agaaagaaagaaagaaagaaaggaaagaaaggaaagaaagaaagaaagaaaaaaagaaag aaagaaagaaggaaagaaagaaagaaagaaagaaagaaagaaagaaaggaaagaaaggaa agaaagaaagaaagaaaaaaagaaagaaagaaagaaggaaagaaagaaagaaagaaagaa gaaagaaagaaaataaaagaaagaaaggaaggagggaaggaaggaaggaaagaaagcggg ggagagggagggaggaaggaagaaaggaaagaaaaagaaagaaaagaaaaggtgacaagg tctgcttctgcttataaatctgaccctaccaagtgcagaggagagttccgcaagcaatcc ccattagctgaatccaaacagaagccagggtgtaagggatcttattcatctggccctcct gctgctgggcccaggcagccaggtctgatggcccggatagcaaccactggagctggcgtg gggtacacggtgggtcataccatcactgggggcttcaatgaaggaagtaatgctgagcct gcaagtcctgacatcacctaccaggagccccagggaacccagatgtcacagcggcagcag ccttgcttctatgagataaaaccgtttttggagtgtgcccggaaccgggtgacataa