GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:57:27 Sequence gi568815588f:92211131_92451204 : 240074 bp : 42.93% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 5202 5663 462 2 0 82 55 473 0.932 39.31 1.02 PlyA + 6159 6164 6 1.05 2.06 PlyA - 8673 8668 6 1.05 2.05 Term - 11404 11358 47 2 2 123 47 18 0.643 -2.41 2.04 Intr - 15500 15414 87 0 0 97 54 48 0.532 1.32 2.03 Intr - 15728 15650 79 2 1 144 76 14 0.797 4.01 2.02 Intr - 29231 28216 1016 0 2 119 90 1169 0.179 109.66 2.01 Init - 63153 63039 115 0 1 69 77 54 0.155 2.82 2.00 Prom - 65852 65813 40 -2.95 3.00 Prom + 66309 66348 40 -1.65 3.01 Init + 77929 77931 3 0 0 103 81 0 0.351 0.85 3.02 Intr + 79367 80477 1111 1 1 -16 85 575 0.314 35.21 3.03 Term + 81020 81231 212 0 2 -18 43 208 0.736 2.17 3.04 PlyA + 85503 85508 6 1.05 4.00 Prom + 99964 100003 40 -6.05 4.01 Init + 115791 115800 10 2 1 96 108 1 0.217 3.65 4.02 Intr + 129543 129673 131 1 2 121 25 81 0.505 4.59 4.03 Intr + 138219 138402 184 2 1 83 110 38 0.775 3.94 4.04 Intr + 138541 138707 167 2 2 115 91 68 0.988 8.56 4.05 Term + 139961 140077 117 1 0 86 41 116 0.996 4.26 4.06 PlyA + 140204 140209 6 1.05 5.07 PlyA - 140414 140409 6 1.05 5.06 Term - 154276 153931 346 0 1 10 39 184 0.003 -1.32 5.05 Intr - 176514 176347 168 2 0 81 66 110 0.023 6.24 5.04 Intr - 184367 184248 120 1 0 81 15 110 0.031 1.69 5.03 Intr - 185450 185348 103 0 1 37 131 66 0.425 4.11 5.02 Intr - 194512 194451 62 0 2 57 121 45 0.754 2.16 5.01 Init - 194709 194561 149 0 2 56 13 202 0.947 7.11 5.00 Prom - 205695 205656 40 -8.85 6.00 Prom + 206467 206506 40 -4.45 6.01 Init + 208255 209099 845 0 2 57 50 429 0.068 30.19 6.02 Intr + 209158 209234 77 1 2 39 18 69 0.073 -6.76 6.03 Intr + 209445 209603 159 1 0 71 -10 167 0.006 4.24 6.04 Term + 228925 229196 272 2 2 96 49 160 0.479 7.56 6.05 PlyA + 229227 229232 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 166123 166003 121 1 1 52 74 69 0.824 2.30 S.002 Sngl - 200056 199838 219 1 0 71 47 196 0.935 8.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:92211131_92451204|GENSCAN_predicted_peptide_1|153_aa MTKIKADPEGPEAQAEVCSGERTYQELLVNQNPIAQLLASRRLKRKLYKCIKKAVKQKQI RRGVKEVQKFVNKGEKGIMVLAGDTLPIEVYCHLPVRCEDRNLPYVYIPSKTDLGAATGS KRPTCVIMVKPHEEYQEAYDECLEEVQSLPLPL >gi568815588f:92211131_92451204|GENSCAN_predicted_CDS_1|462_bp atgaccaaaataaaggcagatcccgaagggcccgaggctcaggcggaggtgtgttccggg gagcgcacctaccaggagctgctggtcaaccagaaccccatcgcgcagctcctggcttct cgccgcctcaagcggaagctctacaaatgcattaagaaagcggtgaagcagaagcagatt cggcgcggggtgaaagaggttcagaaatttgtcaacaaaggagaaaaagggatcatggtt ttggcaggagacacactgcccattgaggtatactgccatctcccagttaggtgtgaggac cgaaatctgccctatgtctatatcccctctaagacggacctgggtgcagccacaggctcc aagcgccccacctgtgtgataatggtcaagccccacgaggagtaccaggaggcttacgac gagtgcctggaggaggtgcagtccctgcccctacccctatga >gi568815588f:92211131_92451204|GENSCAN_predicted_peptide_2|447_aa MTAALREHLFGIYLLNICNGHVLGPSHALSQIVTTTLSAAQTMQDDLLMDKSKTQPQPQQ QQRQQQQPQPESSVSEAPSTPLSSETPKPEENSAVPALSPAAAPPAPNGPDKMQMESPLL PGLSFHQPPQQPPPPQEPAAPGASLSPSFGSTWSTGTTNAVEDSFFQGITPVNGTMLFQN FPHHVNPVFGGTFSPQIGLAQTQHHQQPPPPAPAPQPAQPAQPPQAQPPQQRRSPASPSQ APYAQRSAAAAYGHQPIMTSKPSSSSAVAAAAAAAAASSASSSWNTHQSVNAAWSAPSNP WGGLQAGRDPRRAVGVGVGVGVGVPSPLNPISPLKKPFSSNVIAPPKFPRAAPLTSKSWM EDNAFRTDNGNNLLPFQAPTPTGPFSQLLAPPAGLLPEPPLCPVECRLHKGKDLFTEVSP GAVIVPGIFWAHELREFLLNFDNVLNK >gi568815588f:92211131_92451204|GENSCAN_predicted_CDS_2|1344_bp atgacagctgcactcagggagcacttgtttggcatctacttactgaacatctgcaatggg catgtgttgggcccttcacatgcattatctcagatcgtgacgacaaccctttcagctgcg caaaccatgcaggatgatttactgatggacaaaagcaaaacccagccccagccccagcag cagcagcggcagcagcagcagccccaacctgagtccagcgtatccgaagccccgtccacg cccctctcctcagagacccccaagccggaggaaaacagcgcagtgccggccctcagccca gccgctgcccccccggcccccaacggcccggacaagatgcagatggaatcaccgctcctg ccaggcttgagtttccatcagcctcctcagcagccgccgccgcctcaggagcccgcggca ccgggcgcgtcgctgtcgccgtccttcggcagcacctggtccacgggcaccaccaacgcg gtagaggacagcttcttccaggggatcaccccagtcaacgggaccatgctcttccagaac ttcccgcaccatgtcaacccagtcttcggaggcactttctccccgcagatcggcctggcg cagacccagcaccaccagcagccgccgccgcctgcgcccgcgccgcagccggcacagcca gcgcagccaccacaggcgcagcccccgcagcagcgccgctcacccgccagccccagccag gcgccctacgcgcagaggagcgccgccgcggcgtacggccaccagcccatcatgaccagc aagccgtcctcgtcttcggcggttgcagccgctgctgccgcagccgccgcctcgtcggcc tcgtccagctggaacacgcaccaaagcgtgaatgcagcctggagcgcaccgtccaacccc tggggcggcctgcaggcgggccgggaccctcgccgggcggtcggtgtgggcgtgggtgtg ggtgtcggggtgccttccccgctcaaccccatctcgccgctcaaaaagcccttctccagc aacgtgatcgcgccgcccaagttccctcgcgcggcccctctcacttccaagtcctggatg gaggataacgctttccggaccgataatggtaacaatctgttgccatttcaggcccccacc cccacaggccccttctcccagctcctggcccctcctgcaggtctcctccctgagccccca ctctgcccagtagaatgtaggctccacaagggcaaggaccttttcactgaagtgtctcca ggggctgtgatagtaccagggatattttgggcacatgagctgagagaattcctgctgaac tttgataacgttttgaataaatga >gi568815588f:92211131_92451204|GENSCAN_predicted_peptide_3|441_aa MTVGEPWKAQAHLRRFLERERIIFPSPNSPAPNTQRDSGGSCLVQAVPICPPPQEEPHAP NQPTPRGGGVIPPKWALPQERLAPAPGASEGVSATPTAATDRPDATAGAGDPDKKWPFSP LPSPGSVPPEGKSGGKNKQAGRADLVATDSGAAHPAASWKAARKTFFPPGGRKREGAGQT AAGAEALVRAGCAASSGDGPKTTSPRLLWAALRRNPGAHGGGDSYLTKVAPPPAATRRPR SMDRNLGPTDGNPGRDRRLPASGSSSSLSAASAGLPQALHRRRQASGAAPGSWVTGSRLP PDCGLLRRLSVLLSLGLALSGSTGEGRVRRLRGRCRTKPYSRCWTGTGSCGGDREPPTLA LALPAPALEALVYLTIVEFIEDVSARTGVQLPILGGPQSVKLFVKSGAPPSAVEILVRGS YHEFLETLDPHFPRAKSYFLA >gi568815588f:92211131_92451204|GENSCAN_predicted_CDS_3|1326_bp atgactgtcggcgagccctggaaagcccaggcacatttgagaaggttcctggagagggag agaatcatctttccttcccccaactccccagccccaaacacacagagagactcgggtgga agctgtctggtccaggcagtccctatttgcccgccccctcaagaagaacctcacgcacca aaccaaccgacccctcgcggaggtggtgtaattcccccaaaatgggctctgccgcaggag aggctggctccagcgccgggggcttcggaaggagtttctgccacccccactgccgccact gaccgccccgacgccacggccggggccggggaccctgataagaaatggcccttcagcccc ctcccctcacctggctcggtcccacctgagggcaagagcggaggcaaaaacaaacaggca gggagggctgaccttgttgctaccgacagtggagcggcgcatcctgctgcttcctggaaa gcggcccgaaagacattttttccccctggaggaaggaaacgggagggcgccggccagacg gcggcaggcgcggaggcgttggtccgggcgggctgtgcagcctctagtggagacggtccg aagactacatctcccaggctgctctgggccgccctgcgtcgtaaccctggcgcgcacgga ggcggcgactcttacctcacaaaggtagctcctccgccggcagcaactcggcgcccgcgg tccatggaccggaacctcgggccgacggacgggaacccgggccgcgatcgccgcctcccc gcctcaggctcctcctcctcgctctccgccgcctccgccggactcccgcaggccctgcac cgccgccgccaggctagcggagctgccccgggaagctgggtgacgggttcgcggctgccg ccggactgcggcctactccgccgcctctcagtgctattgtccctgggcctggccttgagc gggtccactggggaaggccgtgtgcgccggctccgcggaagatgccggaccaagccctac agcagatgctggacaggtacgggcagctgtgggggggaccgggagccgccgaccctcgct ctggccctgcccgcgcccgccctcgaggctcttgtgtacttgacgatcgttgagttcatt gaagacgtttcagctcgtacaggagtccagttgcccattcttggggggccccagtcggta aaactctttgtgaagtcgggtgcaccgccctctgctgttgaaattctggtccgtgggtcc taccatgagtttttggagactctagatccacatttcccccgagccaagtcttacttcctg gcatag >gi568815588f:92211131_92451204|GENSCAN_predicted_peptide_4|202_aa MVPSPVVYVLDLADRLISKACPFAAAGIMVGSIYWTAVTYGAVTVMQVVGHKEGLDVMER ADPLFLLIGLPTIPVMLILGKMIRWEDYVLRLWRKYSNKLQILNSIFPGIGCPVPRIPAE ANPLADHVSATRILCGALVFPTIATIVGKLMFSSVNSNLQRTILGGIAFVAIKGAFKVYF KQQQYLRQAHRKILNYPEQEEA >gi568815588f:92211131_92451204|GENSCAN_predicted_CDS_4|609_bp atggtgccaagtccagtggtttacgtcttggatcttgcagatagactgatctcaaaagcc tgtccatttgctgcagcaggaataatggtcggctctatctattggacagctgtgacttat ggagcagtgacagtgatgcaggttgtaggtcataaagaaggtctggatgttatggagaga gctgatcctttattccttttaattggacttcctactattcctgtcatgctgatattaggc aagatgattcgctgggaggactatgtgcttagactgtggcgcaaatactcgaataaacta caaattttaaatagtatatttccagggataggttgtcctgttcctcgaattccagctgag gccaatcctttagcagatcatgtctctgctactcgaatcttgtgtggagcccttgtcttt cctactattgctacaatagttggtaaattgatgttcagtagtgttaactctaatttacaa aggacaatcttgggtggaattgcgtttgttgccataaaaggagcatttaaagtttacttc aaacagcagcaatatttacgacaggcacaccgcaaaattctgaattatccagaacaagaa gaagcataa >gi568815588f:92211131_92451204|GENSCAN_predicted_peptide_5|315_aa MKVLGCLCSVAQAVFFPMLLLLSSHLHLVQEEPVLTMQPYAKGFDSAAALETRVAPGEDT GLPGADAEEKGSCTLTIPQLITYNLGFPSWVLVPVAISAHGLLLRQAAFLVLACHGAVTA AVQLGRELMEAGLLQKTADSYSYLRNQPSQSTWDVGLCSQCFYRAFPLIPIGSHVVNYCP NSNDVSGPRIAVYMSHSSDRWANLRVISLKKEVEKKIGVESLFKGITENVPNLEKGINIQ VQEGYRTPSRFNPKKTTSRHLIIKLPKVKDKERTLKAAREKKQNTQWSSNTSGSRLFSGN LTNQEIVAEEKNLLS >gi568815588f:92211131_92451204|GENSCAN_predicted_CDS_5|948_bp atgaaggttctgggctgtctgtgctctgtggctcaagctgtgttcttcccgatgctgctg ttgctctcctctcacctgcacttggtgcaggaggagccagtccttaccatgcaaccttat gccaagggctttgattcagctgctgccttggagacaagggtggcccctggggaggatact ggactcccaggagcagatgctgaggagaaaggctcctgcacactgaccatcccgcagctc attacttacaatttaggtttcccttcctgggtgctggttcctgtggccatttcagctcat gggttgctgctccgccaagcagcttttctagtcctggcctgccatggtgcagtgactgca gctgtccagcttggcagggagctgatggaagcaggtcttctgcaaaagactgcggactcc tattcctacctaagaaaccaaccaagtcagagtacctgggatgtgggactttgcagtcag tgcttttatcgtgctttcccattgattccgattggcagccatgttgtaaactactgccct aactctaatgacgtttcaggaccaaggatagcggtctacatgtcccactcctctgacaga tgggcaaatctaagagttattagccttaaaaaggaggtagagaaaaagataggagtagaa agcttattcaaaggaataacagagaacgtcccaaaccttgagaaaggtatcaatatccaa gtacaagaaggttatagaacaccaagcagatttaacccaaagaagactacctcaaggcat ttaataatcaaactcccaaaggtcaaggataaagaaaggaccctaaaagcagcaagagaa aagaaacaaaacacacaatggagctccaatacgtctggcagcagacttttcagtggaaac cttacaaatcaggagatagtggctgaagaaaaaaaccttttatcctag >gi568815588f:92211131_92451204|GENSCAN_predicted_peptide_6|450_aa MSPESENLLKKFLIVNCRKRGPSEHTVRGPWMNASHEGEELKRCVQPLSHWEDPLQTKFM VSVGYRQEEIQDLLRSMRKQWSPACSWVLLSSQEVQLRVSASPEQQWFQQPAFPIFHGLR RLKGATQSLNRIRRQGRRAATQQCTSQLPTSLEGTNPTPAVNSDLSTNTKMSRNSSLGAK AGLARPPPRTTTAAPQRWGLRPPELLLLTSTPSPRPPARESQSVSLQPNQASVPHPTVSS WGVPARHSPPRVSLRPPFYPRHQRSGQSPRWDQVRAAGQPNRRRPAEGRRELLQQVQVQV HKLDSAAVFKRLSVEVHVVLLFRQPLGNGSKPLTAAAGARARGAEGLRARASTAHLLGSG EMVIKKFPDLSGLTVGHKTPIPGSCSIPERKKCYNRDTKKNLNRQVLLGFLLSVYYIRSH PLPQLYMTVHSSLHRSIQMDSFSYISGSSF >gi568815588f:92211131_92451204|GENSCAN_predicted_CDS_6|1353_bp atgtccccggagagtgaaaacttgctgaagaaatttctcattgtcaattgcaggaagaga ggcccttcagagcacactgtgaggggtccatggatgaatgcgagtcatgaaggtgaggaa cttaagcgctgtgtgcagccactctcccactgggaggaccccctgcaaactaagttcatg gtatccgtgggttacagacaggaagagatccaggacttgctgaggagtatgaggaagcag tggtcacctgcctgctcctgggttctcctttcatcccaggaggtacagctcagggtctct gccagcccggagcagcagtggtttcaacaacctgcctttcccatctttcacggcctgaga agactcaaaggagccacccagagtctgaacaggatcaggagacagggcagaagggcagca acacagcaatgcaccagccagctccctactagcctggaggggacgaaccccacccccgct gtgaacagcgacctctccactaacaccaaaatgagcaggaattcctcactcggggcaaaa gccggtttggccagacctccacccagaacgacaacggcagccccacagcgctggggcctg aggcctcctgagctcttgcttctcacctctaccccgtctccccggccaccagcccgggaa tcccagtcggtctccttgcagcccaaccaggcctcggtgccgcaccccacagtgagcagc tggggggtgcccgcccggcacagtccccccagagtgtccctgcggcctcccttctacccg cgtcatcagcggagtgggcagagcccccgatgggaccaggtacgcgcggccgggcagccc aacaggcggcggccagcagagggccgccgggagcttcttcagcaagttcaagttcaagtt cacaagctggactcagctgccgtatttaaacgattgtcggtggaagttcacgtcgtgctc ctctttcgacagccgttgggtaacggatcaaaacccttaacggccgcggctggagcgcga gctcggggcgccgaggggctccgggcccgagcgtccaccgctcatctcctgggctcggga gagatggtcataaagaaattccctgacctatctggcttgacagtaggtcataagaccccc attccaggatcctgttccatacctgagaggaagaaatgctacaacagagacaccaagaag aacctcaacagacaggtcttgctgggcttcctcctctcggtctactacattagatcacac cctttgccacagttgtacatgactgtccactcttcattgcatcgaagcatacaaatggac agtttttcctatatctctgggtcttcattttga