GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:42:26 Sequence gi568815588f:62276194_62499786 : 223593 bp : 38.88% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3334 3473 140 1 2 -15 91 159 0.596 5.09 1.02 Intr + 4404 4655 252 1 0 55 44 175 0.043 6.28 1.03 Intr + 4767 4928 162 1 0 15 76 128 0.065 3.33 1.04 Intr + 7956 8062 107 1 2 43 87 29 0.006 -2.69 1.05 Intr + 30919 31047 129 0 0 72 81 49 0.031 2.57 1.06 Intr + 33912 33957 46 2 1 131 86 39 0.043 5.06 1.07 Term + 40895 41214 320 0 2 40 33 179 0.096 1.66 1.08 PlyA + 41465 41470 6 1.05 2.00 Prom + 48390 48429 40 -0.95 2.01 Init + 62119 62305 187 0 1 65 90 126 0.373 9.77 2.02 Term + 70521 70606 86 1 2 90 48 84 0.715 1.44 2.03 PlyA + 71327 71332 6 1.05 3.02 PlyA - 72701 72696 6 1.05 3.01 Sngl - 75829 75260 570 1 0 53 53 218 0.916 10.71 3.00 Prom - 76481 76442 40 -3.45 4.00 Prom + 76912 76951 40 -9.65 4.01 Init + 78226 78350 125 0 2 79 131 63 0.883 9.40 4.02 Intr + 84788 84929 142 2 1 37 101 110 0.785 6.63 4.03 Intr + 91099 91265 167 0 2 93 58 113 0.213 6.64 4.04 Intr + 97807 97912 106 1 1 70 21 90 0.089 -0.20 4.05 Intr + 98219 98265 47 1 2 98 94 115 0.490 9.39 4.06 Intr + 99988 100743 756 1 0 122 89 752 0.130 68.35 4.07 Intr + 112203 112566 364 0 1 93 94 280 0.051 23.36 4.08 Term + 123399 123596 198 2 0 56 42 218 0.739 10.42 4.09 PlyA + 123821 123826 6 1.05 5.03 PlyA - 124256 124251 6 1.05 5.02 Term - 127108 127019 90 1 0 109 34 79 0.155 1.34 5.01 Init - 131991 131923 69 0 0 66 95 45 0.107 4.10 5.00 Prom - 135215 135176 40 -3.65 6.00 Prom + 142562 142601 40 -2.15 6.01 Init + 160464 160584 121 2 1 59 52 97 0.038 3.60 6.02 Intr + 172438 172524 87 2 0 105 99 -10 0.076 0.82 6.03 Intr + 175141 175260 120 2 0 52 72 120 0.483 6.35 6.04 Intr + 175467 175681 215 1 2 6 30 115 0.026 -4.99 6.05 Intr + 183548 183604 57 1 0 109 88 87 0.653 8.96 6.06 Intr + 184965 185100 136 2 1 26 63 123 0.386 2.82 6.07 Intr + 190023 190175 153 1 0 51 58 130 0.092 5.42 6.08 Intr + 190264 190390 127 2 1 87 27 79 0.025 0.52 6.09 Intr + 216995 217032 38 2 2 122 110 60 0.108 8.59 6.10 Term + 221378 221460 83 0 2 82 48 48 0.028 -2.92 6.11 PlyA + 222666 222671 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 40882 41214 333 0 0 39 33 186 0.803 3.98 S.002 Term - 52884 52777 108 0 0 56 48 155 0.812 5.83 S.003 Intr - 117065 116885 181 1 1 78 12 188 0.829 9.15 S.004 Init - 118037 117967 71 2 2 112 74 68 0.947 8.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:62276194_62499786|GENSCAN_predicted_peptide_1|385_aa XVCEPVSSSASVDEEYTLITEAVSEKELCALRNVVFGLPFAVMNVQAALAAFPRGRARDL QPAMPEPPTHSMGSCAARASPTSTTPCSTAPSPIDHPRAGECERTARDWQAAPPAALVWD PLGEASWAPESAALVRMWRTFMSSSGLVNTPVDTLYLAALVRTWRTFMSSSGIVNTPIGT LYLAQGTLCPQGPSPYIPLSTAATFERLLVQFVFPAVSLKATAYVLAAQCRVSVTPFACT LLAPKFLSSIQEELAHISELKMVKLPLVLFTDHIQLELKERPYAWVPTAGAVPLRKLWGS SSREGLCLASSVELGEAASFTDARLAVWRGSNGVEMVKGVSEQRSLRHPHKVPKTQTAGG AVGCFLERPSDRQRPLIESWTQEAL >gi568815588f:62276194_62499786|GENSCAN_predicted_CDS_1|1158_bp nnagtatgtgagcctgttagtagttctgcatcggttgatgaagaatatacactgatcact gaagctgtcagtgagaaggaactctgcgccttacggaatgttgtttttggcttgccattt gctgttatgaatgtccaggcagccttagctgccttccctcggggcagggctcgggacctg cagcccgccatgcctgagcctcccacccactccatgggctcctgtgcagcccgagcctcc ccaacgagcaccaccccctgctccaccgcacccagtcccatcgaccacccaagggctggg gaatgtgagcgcacggcgcgggactggcaggcagctccacctgcagccctggtgtgggat ccactaggtgaagccagctgggctcctgagtctgctgctctggtgaggatgtggagaacc tttatgtctagctcagggcttgtaaatacaccagtggacactctgtatctagctgctctg gtgaggacgtggagaacctttatgtctagctcagggattgtaaatacaccaattggcact ctgtatctagctcaaggtacattatgtcctcaaggcccatccccttatattcctctctca acagcagcaacatttgaaagactcttggtgcagtttgtctttccagctgtgagcctaaaa gctacggcttatgtgttagcagctcagtgtagggtcagtgtgacaccttttgcatgcacc ctcttggcacccaagttcttgtctagcattcaggaggaattagctcacataagtgagttg aagatggtaaagctcccactggttctcttcactgaccacatccaactggagctaaaggag agaccctatgcctgggtccccactgctggggctgttcctttgagaaagctttggggatcc agcagcagggaaggtctgtgccttgcttccagtgtggagctaggagaggcagcaagcttc acagatgcccgtttggcagtatggcggggaagcaatggcgtagaaatggtgaaaggggtg tctgaacagaggagcctgagacaccctcacaaggtccccaaaacccagacggctggggga gcagtaggatgctttctagagagacccagcgataggcaaaggccattgatagaaagctgg acccaagaggctttatag >gi568815588f:62276194_62499786|GENSCAN_predicted_peptide_2|90_aa MPAIMYPYKPQTPGSMRRPTEEQKSSRATWQRGREGKEHLNIKRSLAGDDQRGVSCRTAE LQVFLSWSWRLTALQLHDQPLEGQYEFGAP >gi568815588f:62276194_62499786|GENSCAN_predicted_CDS_2|273_bp atgcctgctatcatgtacccatataaaccccaaaccccaggatctatgaggagaccaaca gaagagcagaagagcagcagagcaacatggcagagagggagagaaggaaaggagcatctg aatatcaagaggagtttggctggggacgatcagagaggagtcagctgcaggacagctgaa ctccaggttttcttgtcatggtcttggaggctaacagcattacaactacatgatcagcca ctggaggggcagtacgagtttggagctccttga >gi568815588f:62276194_62499786|GENSCAN_predicted_peptide_3|189_aa MWFCGPGPGSPCCVQPKDLVPCISATPAVAERGQHRAWVVASESASPKSWQLPYGVEPAS TQKSRTEVWEPLPRFQRLYGNAWIPRQKFAAGEGPSWRMSARAVQKGKVGLEPQHRVPTG APPSRAVRRGPWPSRPQNGRSTNSLHCVPGKATDTQHQPVKAARRKAVPCKATGAELPKT TSCISVTLM >gi568815588f:62276194_62499786|GENSCAN_predicted_CDS_3|570_bp atgtggttttgtgggccaggcccagggtccccatgctgtgtgcagcctaaggacttggtg ccttgcatctcagccaccccagccgtggctgaaaggggacaacatagagcttgggttgtg gcttcagagagtgcaagccccaagtcttggcagcttccatatggtgttgagcctgcgagc acacagaagtcaagaactgaggtttgggaacctctgcctagatttcagaggctgtatggg aatgcctggatccccaggcagaaatttgctgcaggggaggggccctcatggagaatgtct gctagggcagtgcagaagggaaaggtggggttggagccccaacacagagtccctactggg gcaccacctagtagagctgtgagaagaggaccatggccctccagaccccagaatggtaga tccaccaacagcttgcactgtgtgcctggaaaagccacagacactcaacaccagccagtg aaagcagccaggaggaaagctgtaccctgcaaagccacaggagccgagctgcccaagacc acctcttgcatcagtgtgaccctaatgtga >gi568815588f:62276194_62499786|GENSCAN_predicted_peptide_4|634_aa MVGTLWLRVLPEIAARCRLELQSSEGLTGAVDSVIKALLQSRDRQYETLAMLGSSSELQL PVSYMVTRVDNHTLQCTVLPDDFAQLQANKMSEFLLYESGTFSAPPTRPSCDLCTRWFLF TSAMSGSSPSPSTEADVLLVQPAEPAKKLARACRKRRDCEADTRLSFPPPGNDPSISRSD HQHRRRRLQQQQQQVGTFRVMQQKAFEESRYPWQESFENVAVCLPLRCPRCGDHTRFRSL SSLRAHLEFSHSYEERTLLTKCSLFPSLKDTDLVTSSELLKPGKLQSSGNVVKQKPSYVN LYSISHEHSKDRKPFEVVAERPVSYVQTYTAMDLHADSLDGTRSGPGLPTSDTKASFEAH VREKFNRMVEAVDRTIEKRIDKLTKELAQKTAELLEVRAAFVQLTQKKQEVQRRERALNR QVDVAVEMIAVLRQRLTESEEELLRKEEEVVTFNHFLEAAAEKEVQGKARLQDFIENLLQ RVELAEKQLEYYQSQQASGFVRDLSGHVVSHPGPATECKIPVVGENGLRTEKRELGDLQG LSRHHLRALPGPCLYCGNVTGESRTPGSKGRNHLKKAKDDRASMQPAKAIHEQAESSRDL CRPPKKGELLGFGRKGNIRPKMAKKKPTAIVNII >gi568815588f:62276194_62499786|GENSCAN_predicted_CDS_4|1905_bp atggttggtacactctggctcagggtcttacctgagattgcagccagatgtaggctagag ctgcagtcatctgaaggcttgaccggggcagtggattcagttataaaggctctcttacaa agccgtgataggcagtatgaaacccttgcgatgctgggcagcagcagtgagctgcagctc ccagtcagctatatggtcacgagagtagataaccataccctacagtgtactgtgctgcca gatgattttgcccaactgcaggctaataaaatgagtgagttcttactctatgagtctggc accttctctgcccctccaacccgcccatcatgtgatctctgcacacgttggttcctgttc acttctgccatgagtggaagtagcccaagtccctccacagaagcagatgtgctccttgta cagcctgcagaacctgcaaaaaagcttgccagggcctgcagaaagcgccgcgactgcgag gctgacacgcgtctttccttccctcccccggggaatgatcccagcatctctcgcagtgat catcagcaccggaggcggcggctgcagcagcagcagcaacaagtcgggacttttagagta atgcaacagaaggcttttgaggaaagcagatatccctggcaggagtcctttgagaatgtt gctgtgtgcctgccattacgctgcccgaggtgtggagaccataccagatttagaagcttg tcatccttgagggcccatctggagttcagtcacagctacgaagaaagaaccctcttgaca aaatgcagtctctttccatccctcaaagacacagacctagtcacttcctcagaactcctg aaaccgggaaaattgcagagcagtggcaacgtggtaaagcagaaaccgagctatgttaac ttgtacagcatttcacatgaacattccaaggacaggaagccatttgaggtggtggcagag aggcctgtgtcctatgtgcagacctacactgccatggacctccatgcagactcgctggat gggacacggtcgggtcctggactgcccacctcagacaccaaagcttctttcgaggcacat gtcagagaaaaattcaatcgaatggttgaggctgtggataggaccattgagaagagaatt gataaactcaccaaagagttggcccagaaaactgcggaactgttggaagttcgggcagct tttgtgcagctgactcagaaaaagcaggaagttcagagacgagagcgggccttaaacaga caggtggacgtggccgtggaaatgatagctgtactgaggcaacgcctgacggaatctgag gaggagcttcttaggaaagaagaagaagttgtcacattcaaccatttcctggaagcggca gctgagaaggaggttcaagggaaagcccggctccaggactttattgagaatctgttacag cgggtagaactggcggagaagcagcttgagtactatcagagccagcaggcctctggcttt gtccgtgatctcagcgggcacgtggtgagtcaccccgggccagccaccgagtgtaagatc cctgtggttggtgagaatgggctgaggacagagaaaagggagctgggtgacttgcaggga ctctctagacatcatttgagagcactgcctgggccatgcctgtattgtggaaatgttact ggggagtccaggacgcctggaagtaagggaaggaaccacctgaaaaaggccaaggatgac agagccagcatgcagcctgccaaggccattcacgaacaggctgagtcctcaagagacctc tgcagacctccaaagaaaggggagctcctggggtttggccgcaaaggcaacatcaggccc aaaatggctaaaaaaaagccaacagccattgtgaacatcatctaa >gi568815588f:62276194_62499786|GENSCAN_predicted_peptide_5|52_aa MAKARHHSIYFPDEETGPTVKADPTQREDNDDEVLYDDPLPLNEYNCIFSSL >gi568815588f:62276194_62499786|GENSCAN_predicted_CDS_5|159_bp atggccaaggcaagacatcattctatctattttccagatgaggaaactggacccacagtc aaagcagatcccacccaacgtgaagacaatgacgatgaagtcctctatgatgatccactt ccacttaatgaatataactgtatcttctcttccttatga >gi568815588f:62276194_62499786|GENSCAN_predicted_peptide_6|378_aa MTNIIALTKFLLLVEQSQLTEKANDGRTTNWDLVRSSPVADMVSIIGIYQVQRAYESLRF LSNLGDYGSVAQFELLVHGLSDVASRVFMAWQRQKAWLSLFPGVTCSVEGAWLQFTSGYR HSLSGPTQLNNAHHCFQKGFPCVWCRPVEAAPQQHPADATASSELEHASQPPTFQTGGLS WSWKGAGEARLVCQNDLELEKYERAGLKDKYTSPLWEMGFFKGQFNQLSQGVLSHVGQEP SAGGAVPGISKLSGATAFPLSVQTLVPAVAVTWGMQPHMKPVPVLAPGAAHSAAAAASMG SGPVARAEHSLLGCVGETSPAGVSKTQAEALLAIEVSGWSVSGVAFRKILPDFSILIGNL KYNNLAFTMKVQLQVLKK >gi568815588f:62276194_62499786|GENSCAN_predicted_CDS_6|1137_bp atgaccaacatcattgcacttactaagttcctgcttcttgttgagcagagccaattaact gaaaaagctaatgatggaaggacaaccaattgggatctagtaagatctagccctgtagct gatatggtgtctattataggtatctaccaagtgcagagggcttacgaaagtctaagattc ctttcaaatctaggagactatggatctgttgctcaatttgagctgctcgttcatggcttg tctgatgtggcctccagggtcttcatggcttggcagcgacaaaaagcttggctctcactc ttccctggtgtgacttgttctgtggagggggcttggctacagttcactagtggttaccga cattccctctctggccccacccagctcaataatgcccatcattgtttccaaaagggcttc ccctgtgtctggtgcagacctgtagaggcagctccacagcagcatccagctgacgccaca gcctcctcagagctggagcatgcttcccaaccaccaacattccaaactggaggattgtca tggagctggaaaggtgctggcgaagctcgcctggtgtgccaaaatgacctggaattggag aaatatgagcgagcaggccttaaagacaagtacacatctccactgtgggagatgggattc tttaaagggcagtttaatcagctttcgcagggcgttctatcacacgtgggccaagagcct agcgctggaggagctgttcctggcatctccaagctttcaggtgccactgcattccccttg tctgtccagaccctggtgcctgcagtggcagtcacttggggcatgcagcctcacatgaag ccagtgcctgtgctggcacctggagctgcccactctgctgcagcagctgcaagtatggga tctgggccagtagcacgggccgagcacagcctgctgggctgtgtgggtgagacaagccca gcaggagtaagcaaaacccaagcagaggcactgctggccatagaggtttctggctggtct gtttcaggtgtggccttccgaaagattttgccagatttctccattctgattggaaacctc aaatacaacaatctggctttcacaatgaaggtccaactgcaggtgctcaaaaaatga