GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:57:03 Sequence gi568815589r:125338890_125772574 : 433685 bp : 43.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7930 8052 123 2 0 135 59 67 0.590 9.06 1.02 Intr + 11465 11515 51 0 0 84 94 24 0.686 1.48 1.03 Intr + 11824 11983 160 2 1 34 90 47 0.428 -1.45 1.04 Intr + 15765 15952 188 0 2 77 88 78 0.882 6.03 1.05 Intr + 16755 16968 214 1 1 -2 100 274 0.702 17.37 1.06 Intr + 20531 20603 73 2 1 37 75 19 0.496 -5.09 1.07 Intr + 21639 21836 198 2 0 52 86 83 0.525 4.05 1.08 Intr + 31848 31952 105 2 0 102 40 53 0.042 2.31 1.09 Term + 33696 33806 111 2 0 75 42 90 0.036 1.66 1.10 PlyA + 39462 39467 6 1.05 2.00 Prom + 56068 56107 40 -4.76 2.01 Init + 63532 63610 79 0 1 91 68 97 0.191 7.29 2.02 Intr + 69126 69255 130 1 1 49 86 57 0.056 1.35 2.03 Intr + 69513 69606 94 0 1 120 41 42 0.838 2.67 2.04 Intr + 72482 72627 146 1 2 56 100 67 0.650 3.78 2.05 Intr + 74660 74758 99 2 0 83 76 45 0.385 2.03 2.06 Intr + 77633 77676 44 2 2 105 98 12 0.037 1.78 2.07 Intr + 90695 90764 70 0 1 118 64 48 0.092 3.64 2.08 Intr + 91166 91347 182 2 2 101 74 81 0.728 7.51 2.09 Term + 91619 91755 137 0 2 62 43 60 0.217 -2.92 2.10 PlyA + 92440 92445 6 1.05 3.05 PlyA - 98527 98522 6 1.05 3.04 Term - 100964 100779 186 2 0 102 54 157 0.767 11.19 3.03 Intr - 105709 105612 98 2 2 121 113 83 0.942 13.83 3.02 Intr - 107706 107616 91 0 1 93 28 97 0.595 3.77 3.01 Init - 112705 112616 90 1 0 75 75 16 0.158 -0.47 3.00 Prom - 119251 119212 40 -2.16 4.06 PlyA - 121093 121088 6 1.05 4.05 Term - 123461 123289 173 0 2 98 34 60 0.332 -0.41 4.04 Intr - 129220 129083 138 2 0 92 81 61 0.575 6.24 4.03 Intr - 145694 145554 141 0 0 116 97 56 0.638 9.62 4.02 Intr - 158350 158320 31 1 1 81 94 12 0.156 -1.20 4.01 Init - 163369 163364 6 1 0 82 103 4 0.382 2.04 4.00 Prom - 164138 164099 40 -1.66 5.08 PlyA - 165430 165425 6 1.05 5.07 Term - 167528 167368 161 0 2 113 43 186 0.747 14.70 5.06 Intr - 204279 204170 110 2 2 111 75 77 0.802 8.63 5.05 Intr - 220920 220744 177 2 0 102 82 209 0.827 20.73 5.04 Intr - 230679 230629 51 2 0 70 91 52 0.425 1.72 5.03 Intr - 246838 246666 173 1 2 123 28 205 0.079 16.74 5.02 Intr - 250533 250517 17 1 2 55 92 8 0.129 -6.84 5.01 Init - 255060 254988 73 0 1 77 71 63 0.460 4.73 5.00 Prom - 255303 255264 40 -7.86 6.00 Prom + 256347 256386 40 -8.56 6.01 Sngl + 256995 257561 567 2 0 14 37 412 0.475 24.85 6.02 PlyA + 258061 258066 6 1.05 7.00 Prom + 263079 263118 40 -1.86 7.01 Init + 264513 264568 56 2 2 87 41 74 0.131 3.36 7.02 Term + 274294 274465 172 1 1 65 37 123 0.253 2.20 7.03 PlyA + 275323 275328 6 1.05 8.00 Prom + 276503 276542 40 -2.26 8.01 Init + 283307 283336 30 1 0 62 110 8 0.202 0.14 8.02 Term + 293779 293895 117 0 0 83 44 83 0.424 1.94 8.03 PlyA + 294455 294460 6 1.05 9.05 PlyA - 295419 295414 6 1.05 9.04 Term - 297178 297128 51 1 0 86 43 47 0.017 -2.57 9.03 Intr - 318910 318762 149 2 2 76 115 93 0.891 10.75 9.02 Intr - 326075 326022 54 0 0 101 86 13 0.636 1.35 9.01 Init - 333685 333427 259 1 1 79 97 190 0.812 16.30 9.00 Prom - 356533 356494 40 -3.36 10.00 Prom + 359069 359108 40 -4.76 10.01 Init + 367926 368022 97 2 1 73 80 82 0.454 4.37 10.02 Term + 368229 368404 176 1 2 42 48 214 0.368 10.72 10.03 PlyA + 369177 369182 6 1.05 11.00 Prom + 370144 370183 40 -6.86 11.01 Init + 371699 371777 79 1 1 93 61 52 0.408 4.22 11.02 Intr + 392534 392652 119 0 2 145 50 -12 0.001 0.88 11.03 Intr + 406513 406587 75 0 0 113 84 80 0.915 9.81 11.04 Intr + 407377 407528 152 0 2 43 33 84 0.779 -2.64 11.05 Intr + 408045 408149 105 0 0 108 9 87 0.684 2.13 11.06 Intr + 408522 408764 243 0 0 68 -20 326 0.708 16.41 11.07 Intr + 409661 409734 74 2 2 105 110 56 0.472 8.55 11.08 Term + 410056 410135 80 2 2 74 42 51 0.301 -2.97 11.09 PlyA + 412058 412063 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 90167 90264 98 1 2 79 69 130 0.876 8.01 S.002 Term - 246838 246662 177 1 0 123 36 220 0.898 17.99 S.003 Init - 396157 396088 70 1 1 97 48 87 0.867 6.81 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:125338890_125772574|GENSCAN_predicted_peptide_1|407_aa XDPSPRLSAQAQVAEDILDKYRNAIKRTSPSDGAMANYESTVLTHSTRNGLPDHTDPEDN EIVCFLKVQIAEAINLQDKNLMAQLQETMRCVCRFDNRTCRKLLASIAEDYRKRAPYIAY LTRCRQGLQTTQAHLERLLQRVLRDKEVANRYFTTVCVRLLLESKEKKIREFIQDFQKLT AADDKTAQVEDFLQFLYGAMAQDVIWQNASEEQLQDAQLAIERSVMNRIFKLAFYPNQDG DILRDQVLHEHIQRLSKVVTANHRALQIPEVYLREAPWPSAQSEIRTISAYKTPRDKVQC ILRMCSTIMNLLSLANEDSVPGADDFVPVLVFVLIKVPLHFDHWRPYFPFRVAVAVTAEA ASAAAALGQHARFPGLRRTATCKGKSSSTQPCSCHSSCQSLLVGICV >gi568815589r:125338890_125772574|GENSCAN_predicted_CDS_1|1224_bp natgatcccagccctagactcagtgcacaagctcaggtggctgaggatattctggacaaa tacaggaatgccattaaacggaccagccccagtgatggagcaatggcaaactatgaaagt acagtgctgacccattcaacaaggaatggtttaccagaccacacagacccagaagacaat gaaattgtatgcttcttaaaagttcaaatagctgaagcaattaatttacaagataagaat ctaatggctcaacttcaagaaacaatgcgctgtgtgtgccgttttgataataggacttgt aggaaactgctggcttcgattgctgaggactacagaaaaagagccccatatattgcttat ctcactcgttgtcgacaaggactacagaccacacaggctcacctggaaaggctattgcaa agagttttgcgggacaaagaagtggccaatcgatactttaccactgtctgtgtgagatta ctgcttgagagcaaagaaaagaagatcagggaattcattcaagactttcagaaactcacc gcagctgacgataaaactgctcaggtagaagattttctgcagtttctttatggtgcaatg gcccaggatgtcatatggcaaaacgcgagtgaagaacagcttcaagatgcacagctggcc attgagcgaagcgtgatgaaccggattttcaagctcgccttctaccctaatcaagatggg gacatacttcgcgaccaggttcttcatgaacatatccagagattgtctaaagtagtgact gcaaatcacagagctcttcagataccagaggtttatcttcgagaagcaccatggccatct gcacaatcagaaatcaggacaataagtgcttataaaaccccccgggacaaagtgcagtgc atcctgagaatgtgctctacgattatgaacctcctgagcctggccaatgaggactctgtc cctggagcggatgactttgttcctgtgttggtgtttgtgttgataaaggtccctttacat tttgatcactggcgaccctattttccttttcgtgtggctgtggcggtgacagcagaggct gcatctgctgctgctgctttggggcagcatgctaggtttccaggactgcgacgcacagcg acctgcaaaggcaagagcagcagcacccagccctgcagctgtcacagttcttgccagtcc ctgctggttggtatctgcgtctga >gi568815589r:125338890_125772574|GENSCAN_predicted_peptide_2|326_aa MAMMRMVTVPWAQLCAKSFLIPLMGAGRRRDLGWEPPEEEETNREPEGAGYTAGLTGRPG AGNLATESGRCHRWASAAKAPETGPPYPRAFPAARACEGAEAGPSPEFLNHLQGCLHLHL PLLLCPDPMCWVNWVLPSPPALDANPRAASAWIKGMRHRAQLGCGFSRREQEGKPRIQGS KSRVRPLLQAALPGTQKGWLLRSLTMEGGHMGGELASMSGRCPPLSPSNGMHQGRGSFSP GMFSGPGYRQDTGRTCPPPLPSCLLQTSGHPGTRGWVKALDSSCTDAAHSTSAFPCRDAD ALYKSQLTRLLQSQTLVGAQTVAHLC >gi568815589r:125338890_125772574|GENSCAN_predicted_CDS_2|981_bp atggcgatgatgaggatggtgaccgtaccgtgggctcagctgtgcgccaagtcctttctg atccccttgatgggtgctgggcggcgtcgagacttggggtgggagcccccggaggaggaa gagacgaaccgggagccagagggagccgggtacaccgcgggactcacggggcgccccggg gccgggaatctagctacggaatccggtcgctgccaccgctgggcctcggctgcgaaggct ccagaaacgggtcccccgtaccctcgggccttcccagccgcgcgggcgtgcgagggagcc gaggctggaccaagtcctgagttcctaaaccaccttcagggctgcctgcacctgcacctc ccgctcctgctgtgccctgatcccatgtgctgggtgaactgggttcttccgagccctcca gccctggatgctaaccccagagctgcaagtgcttggattaaaggcatgcgccaccgcgcc cagctaggatgtggcttttcgagaagagaacaggagggcaagccacggatccagggctcc aaaagcagggttcggcccttgcttcaggcggcactgccaggtacccagaaagggtggctg ctgaggtccctgaccatggaaggtgggcacatgggtggcgagctggcctccatgtctggg aggtgtcctccactgtcaccctccaatgggatgcatcagggccggggcagcttttcccca ggcatgttctcaggcccgggctacagacaggacacggggagaacgtgtccacctccactc cccagctgcctgctgcagacatctggtcacccgggcactagaggatgggtcaaagccctg gactcatcctgcacggatgcagctcactccaccagtgccttcccctgccgggatgcagac gctctttataaatctcaattaacaaggctgctgcagtcacagacactggttggagcccaa actgtggctcatctctgttga >gi568815589r:125338890_125772574|GENSCAN_predicted_peptide_3|154_aa MLMSVLDMLHWGPCGGGSHTHCSALKLWRKMAVLVHSMANKGARIVPPHRDVTDKILMLW GHAIFKLTYLSNHDYKHLYFESDAATVNEIVLKCWFCCWTPTHQAASSPGPGPSLSAVAL GKRLAVQLLHRVEDAHATPGRLPGLRGSSSVQKA >gi568815589r:125338890_125772574|GENSCAN_predicted_CDS_3|465_bp atgctgatgtctgttctggacatgctgcactgggggccatgtggaggtggcagccatacc cactgctcagctctgaaactctggagaaagatggctgtgcttgttcacagcatggccaac aaaggtgcccgcattgttcctccacacagggatgtcaccgacaagattctgatgttgtgg ggtcacgcaatatttaaactcacgtatctaagcaatcacgactataaacacctctacttt gaatcggacgctgctaccgtcaatgaaattgtgctcaagtgctggttctgctgctggaca cccactcaccaggcagcaagctccccagggccaggaccaagtctcagcgctgtagcgctg gggaagcgcctggcggtgcagctgctacacagagtggaagatgcacatgcgactccgggc agactcccagggcttcgtggaagcagcagcgtgcagaaggcctga >gi568815589r:125338890_125772574|GENSCAN_predicted_peptide_4|162_aa MVVILRKGLNSPGSRADGVFEEDSQIDIATVQDMLSSHHYKSFKVSMIHRLRFTTDVQLG ISGDKVEIDPVTNQKASTKFWIKQKPISIDSDLLCACDLAEEKSPKLQASHVYVWLNVPV CAFLTPSPPPTKESRETVAEGFMLVTEITLGTPQLIRRILAT >gi568815589r:125338890_125772574|GENSCAN_predicted_CDS_4|489_bp atggtggttatactgaggaagggcctcaactcaccaggttcaagggcagacggggttttt gaggaggattcgcaaattgacatagccacagtacaggatatgcttagcagccaccattac aagtcattcaaagtcagcatgatccacagactgcgattcacaaccgacgtacagctaggt atctctggagacaaagtagagatagaccctgttacgaatcagaaagccagcactaagttt tggattaagcagaaacccatctcaatcgattccgacctgctctgtgcctgtgaccttgct gaagagaaaagccccaagctgcaggcctcacatgtatatgtgtggctgaatgtgccagta tgtgcattcctcacccccagcccacccccaacaaaggagagcagggagactgtggctgaa ggattcatgcttgttactgaaatcaccctgggcacaccccagcttatccgtcggatcctc gccacctga >gi568815589r:125338890_125772574|GENSCAN_predicted_peptide_5|253_aa MPVKKKIGVGYVKFQILTAFAKPSGSSVCQGHVGTTATKKIDVYLPLHSSQDRLLPMTVV TMASARVQDLIGLICWQYTSEGREPKLKISYAKQLIFLAFEFYHSDNVSAYCLHIAEDDG EVDTDFPPLDSNEPIHKFGFSTLALVEKYSSPGLTSKESLFVRINAAHGFSLIQVDNTKV TMKEILLKAVKRRKGSQKVSGPQYRLEKQSEPNVAVDLDSTLESQSAWEFCLVRENSTLA ASLHARFVRCKLA >gi568815589r:125338890_125772574|GENSCAN_predicted_CDS_5|762_bp atgccagtgaagaagaaaattggtgttggatatgttaaatttcagattctgactgcattt gcaaagccatcagggtcttctgtgtgccagggtcatgtaggtacaacagcaaccaagaag atcgatgtctacctccctctgcactcgagccaggacagactgctgccaatgaccgtggtg acaatggccagcgccagggtgcaggacctgatcgggctcatctgctggcagtatacaagc gaaggacgggagccgaagctcaaaatttcttatgccaagcagctgatatttctggctttt gaattttaccacagtgacaatgtcagtgcctactgcctgcatattgctgaggatgatggg gaggtggacaccgatttccccccgctggattccaatgagcccattcataagtttggcttc agtactttggccctggttgaaaagtactcatctcctggtctgacatccaaagagtcactc tttgttcgaataaatgctgctcatggattctcccttattcaggtggacaacacaaaggtt accatgaaggaaatcttactgaaggcagtgaagcgaagaaaaggatcccagaaagtttca ggccctcagtaccgcctggagaagcagagcgagcccaatgtcgccgttgacctggacagc actttggagagccagagcgcatgggagttctgcctggtccgcgagaacagtacgttggcc gcctcgctccatgcccgctttgtccgttgcaaacttgcttag >gi568815589r:125338890_125772574|GENSCAN_predicted_peptide_6|188_aa MQGHTRWIGVVEPKRAVSREDSQRPGAHLTVKKIFVGGIKEDTEEHHLRDYFEQRGKTEV IEITTDRGSGKKRGLAFVTFDGHDSVDKTVIQKHHTVNGHNCEVRKALSKQEMANTSSSE RGRSGSGNFGGGREGDFGRNDNFGHGGNFSGCGAFGGNHGGGGYGGSRDGYNGFGTDGSN FGGWWKLQ >gi568815589r:125338890_125772574|GENSCAN_predicted_CDS_6|567_bp atgcaaggccacacaaggtggatcggagttgtggaaccaaagagagctgtctcaagagaa gattctcaaagaccaggtgcccacttaactgtgaaaaagatatttgttggtggcattaaa gaagacactgaagaacatcacctgcgagattattttgaacagcgtggaaaaactgaagtg attgaaatcacgactgaccgaggcagtggcaagaaaaggggacttgcctttgtaaccttt gatggccacgactccgtggataagactgtcattcaaaaacaccatactgtgaatggccac aattgtgaagttaggaaagctctgtcaaagcaagagatggctaatacttcatccagcgaa agaggtcgaagtggttctggaaactttggtggtggtcgtgaaggtgacttcggtaggaat gacaactttggtcatggaggaaacttcagtggttgtggtgcctttggtggcaaccatggt ggtggtggatatggtggcagtagggatggctataatggatttggtactgatggaagcaat tttggagggtggtggaagctacaatga >gi568815589r:125338890_125772574|GENSCAN_predicted_peptide_7|75_aa MGEQAHDINMASSSNTSKGNEVQRRPEGVLPELNMKPVNPSSTKGGLYHPDSLFSGIKRI FPNCWKECWLMAISC >gi568815589r:125338890_125772574|GENSCAN_predicted_CDS_7|228_bp atgggcgagcaagcacatgacatcaacatggcatcgagctccaacacaagcaaaggaaac gaagtccaacgcagacctgaaggtgtgctcccagaacttaacatgaagccagtaaaccca agcagcaccaagggtggactgtaccacccagattccctcttttcaggaataaagcgcata tttcccaactgctggaaggaatgctggctgatggctatcagctgttag >gi568815589r:125338890_125772574|GENSCAN_predicted_peptide_8|48_aa MSSGVLQHCRPSAFHKCHQPIINVLIQVPDSNEHTRDLQLATKDSSPV >gi568815589r:125338890_125772574|GENSCAN_predicted_CDS_8|147_bp atgagctctggtgttctgcagcactgtaggccttctgccttccacaagtgtcatcagcct atcatcaatgttctcattcaggtccctgacagcaatgaacacacaagagacctacagctt gccacgaaagatagctctccagtttga >gi568815589r:125338890_125772574|GENSCAN_predicted_peptide_9|170_aa MAFLDNPTIILAHIRQSHVTSDDTGMCEMVLIDHDVDLEKIHPPSMPGDSGSEIQGSNGE TQGYVYAQSVDITSSWDFGIRRRSNTGEKTGAQRPNNFPKLLVSTQELKSLFEKKSLKEK PPISGKQSILSVRLEQCPLQLNNPFNEYSKFDGKEKALQSSSCSHYTSRS >gi568815589r:125338890_125772574|GENSCAN_predicted_CDS_9|513_bp atggccttcttggacaatccaactatcattctagctcatattcgacagtcacatgtgacc agtgatgacacgggaatgtgtgagatggttctcattgatcatgatgttgacctagagaag attcatcctccttcaatgcctggagacagtgggtcagaaattcagggaagcaatggtgag actcagggctatgtatatgcccagtcagtcgatattacctcaagttgggactttggtatt agaagacgctcaaacacaggtgagaaaactggtgctcagaggccaaataactttcccaag ttacttgtaagtacccaggagttaaagtcactgtttgaaaaaaaatctctcaaagagaag cctccaatttctgggaagcagtcgatattatctgtacgcctagaacagtgccctctgcag ctgaataacccttttaacgagtattccaaatttgatggcaaggagaaggcacttcaaagc agcagctgttcccattacaccagccgcagctga >gi568815589r:125338890_125772574|GENSCAN_predicted_peptide_10|90_aa MARKRTRPGHLNKGPAGGPPGSAETGGVRGPPAAASRVSPHAPAARRRRPAEQQPYYPEP HTTRNHTPAFRKSPTGKEPLAGDVVRKPQS >gi568815589r:125338890_125772574|GENSCAN_predicted_CDS_10|273_bp atggccaggaaacgaacgcgcccgggccacctgaacaagggcccggcaggcggcccgccg ggcagcgctgaaactggaggggtgagagggccgccagctgccgcttcccgggttagccct catgcccctgctgctcgccgccgccggccggccgagcagcagccctattaccccgagccg cacacgacccggaaccacaccccggcgttccggaagtccccgaccgggaaggagcctctg gctggggacgtagtgcgcaagccccagagctga >gi568815589r:125338890_125772574|GENSCAN_predicted_peptide_11|308_aa MVDRHGVKRKPDVLPMASATEEQHWGGIAAKGSITQAIDPIPSTSPGIHSWGTFIPLHLS CYRLLPVRQLTEFGEGTKRLFRTTEPKDSSREERESAPGAVRGVKLTGAAGEHTVGFAAS CPLSHTQPPAPMTAKSMEQESGPPPPPPPAAAAAASPGAAGARIPPGRGGGGGRGGHLRL SRRPLPPARGGMDDQSRMLQTLAGVNLAGHSVQGGMALPPPPHGHEGADGDGRKQDIGDI LHQIMTITDQSLDEAQAKKHALNCHRMKPALFSVLCEIKEKTVFKRSNGRLDARTESPGE ASPELCFT >gi568815589r:125338890_125772574|GENSCAN_predicted_CDS_11|927_bp atggtggacagacatggggtgaaaaggaaaccagacgtgctccccatggcatctgcaact gaggaacagcactggggaggtattgctgccaagggcagcatcacccaggccatagacccc atcccctctacttccccagggattcactcatgggggacttttatccccctccatctctcc tgctacagactccttcctgtaagacagcttactgagtttggagaaggaacaaagagactt ttccgaaccaccgaaccaaaggactcatctagggaagagcgggaaagtgcaccaggagct gtccgaggcgtgaagctcacgggcgcggcgggggagcacacggttggctttgcagcgagt tgcccactgtcccatacacaaccccccgctcccatgactgccaaaagcatggagcaggag tcggggcctcctcctccgccgccgcccgccgccgccgccgctgcctcccccggggccgcc ggggctcggatcccgcctggccgaggcggcggcggcggccgaggagggcaccttcgcctc agccgccgcccgctcccgcccgcgcgcggcgggatggacgatcaatccaggatgctgcag actctggccggggtgaacctggctggccactcggtgcaggggggcatggccctgccgcct cccccgcacggccacgaaggggcggacggcgacggcaggaagcaggacatcggcgacatc ctccaccagatcatgaccatcaccgaccagagcttggacgaggcgcaagcaaagaaacat gccctgaactgtcacagaatgaaaccagcgctcttcagcgtcctgtgtgagatcaaagag aaaacagtttttaaaaggagcaacgggagactggatgccaggaccgagtccccgggagaa gcatcgcccgaactttgttttacttag