GENSCAN 1.0 Date run: 8-Nov-116 Time: 02:37:59 Sequence gi568815592r:46900225_47129061 : 228837 bp : 40.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 1863 1858 6 1.05 1.06 Term - 2684 2437 248 0 2 75 43 160 0.706 5.17 1.05 Intr - 3149 2987 163 2 1 50 33 95 0.338 -1.17 1.04 Intr - 6562 6437 126 1 0 82 97 56 0.076 5.86 1.03 Intr - 9248 9163 86 0 2 89 97 -17 0.101 -2.08 1.02 Intr - 13599 13496 104 2 2 35 69 126 0.278 4.30 1.01 Init - 16305 16259 47 0 2 62 92 44 0.233 2.41 1.00 Prom - 21170 21131 40 -4.95 2.00 Prom + 28796 28835 40 -3.65 2.01 Init + 28973 29052 80 1 2 83 66 39 0.435 1.78 2.02 Term + 33381 33723 343 0 1 49 49 189 0.202 4.10 2.03 PlyA + 34334 34339 6 1.05 3.06 PlyA - 34615 34610 6 1.05 3.05 Term - 44280 44103 178 2 1 36 49 143 0.163 1.38 3.04 Intr - 47728 47563 166 2 1 60 77 147 0.538 8.90 3.03 Intr - 54899 54749 151 2 1 106 15 56 0.318 -1.09 3.02 Intr - 55608 55464 145 2 1 98 97 96 0.729 10.86 3.01 Init - 62673 62495 179 0 2 52 80 101 0.486 4.48 3.00 Prom - 78130 78091 40 -2.75 4.03 PlyA - 78304 78299 6 1.05 4.02 Term - 85001 84808 194 0 2 106 55 86 0.757 3.70 4.01 Init - 87571 87529 43 1 1 86 96 13 0.662 2.53 4.00 Prom - 92547 92508 40 -3.45 5.17 PlyA - 92711 92706 6 1.05 5.16 Term - 100071 99998 74 1 2 133 48 81 0.952 5.69 5.15 Intr - 101343 101277 67 0 1 61 113 8 0.136 -1.84 5.14 Intr - 105652 105593 60 1 0 94 86 26 0.135 1.01 5.13 Intr - 110094 108721 1374 0 0 109 94 582 0.488 48.99 5.12 Intr - 111971 111783 189 2 0 60 64 143 0.978 7.96 5.11 Intr - 114620 114457 164 0 2 99 100 109 0.995 11.97 5.10 Intr - 116544 116393 152 2 2 68 90 149 0.999 11.99 5.09 Intr - 120565 120507 59 1 2 61 123 35 0.962 1.06 5.08 Intr - 121834 121734 101 2 2 78 82 64 0.967 3.71 5.07 Intr - 123993 123820 174 1 0 52 82 73 0.690 2.09 5.06 Intr - 125779 125630 150 2 0 106 110 34 0.778 6.61 5.05 Intr - 127994 127929 66 0 0 54 106 39 0.011 0.26 5.04 Intr - 132615 132534 82 0 1 71 91 44 0.036 1.29 5.03 Intr - 135673 135611 63 1 0 135 93 28 0.206 6.00 5.02 Intr - 186033 185954 80 1 2 98 46 57 0.193 0.85 5.01 Init - 186336 186264 73 0 1 42 116 84 0.914 8.16 5.00 Prom - 188225 188186 40 -7.05 6.00 Prom + 189301 189340 40 -3.15 6.01 Init + 196004 196137 134 1 2 59 107 73 0.726 5.42 6.02 Intr + 202405 202462 58 1 1 128 98 1 0.875 3.07 6.03 Term + 205264 205551 288 0 0 71 41 249 0.861 12.89 6.04 PlyA + 208169 208174 6 1.05 7.02 PlyA - 209577 209572 6 1.05 7.01 Term - 222838 222668 171 1 0 16 32 224 0.479 6.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:46900225_47129061|GENSCAN_predicted_peptide_1|257_aa MTISKPMRGSEHISSRESYAYNITYCVDVKAKQLTCGKNLHLLLQFTNKFHRKIESEETR KLLRAARLILRARAYFPALANQLKLEDMKSPRRTTLCLMFIVIYSSKAALNWNYESTIHP LEIPSGQLCPLWLRQSPGDLYTDGAGRGGPIGSSVFLSEVNTLPSTGSPGKPVLPCPVTW QPDLCNSLNLGMAKSIYPSPSQLLRGFIAINVLFPPLSPMIVGLKIVSRLLPLDNEEAVG DRAVEAALAKMWPESSR >gi568815592r:46900225_47129061|GENSCAN_predicted_CDS_1|774_bp atgacaatatccaaaccgatgaggggctctgagcacatcagcagcagagaaagctatgcg tataacatcacctactgcgtggacgtgaaagctaagcagcttacatgtgggaagaacctt caccttctacttcaatttacaaacaaatttcacaggaaaatagagtcagaggagaccagg aagttgctcagagccgcaaggctaatactgagggctagggcatatttcccagctttggcc aaccaactcaaacttgaagacatgaaatccccaaggagaaccactttgtgcctcatgttt attgtgatttattcttccaaagctgcactgaactggaattacgagtctactattcatcct ttggaaattccctctggacagctctgtccactgtggctcagacagtcccctggggacctc tacacagatggagcaggcaggggaggtccaattggaagctctgttttcttgtctgaagta aatacactgccgagcactgggagccctggcaaacctgtgctcccttgtccggtgacttgg cagccagatctttgtaactccttgaatctggggatggccaagagcatttacccatctcca tcccaactcctcagaggcttcattgccatcaatgtgcttttccctccattatctccaatg attgtagggcttaaaatagtaagtaggctgctcccactggacaacgaagaggctgtaggt gacagagctgtggaagcggcacttgcaaagatgtggcctgaaagttccagatga >gi568815592r:46900225_47129061|GENSCAN_predicted_peptide_2|140_aa MDEAGNGHSQQTIARTKNQTPHVLTHRNLKGKRLKTKVFTVAATAIQNGLALYRLSSGLR EIAQPSTAPMAYLSTMRGSTFPSSRPFITTSPSIYQSLPLRTTKLFLDLKHSIQNIATYQ QLHDYSRPIALGFASSDLEY >gi568815592r:46900225_47129061|GENSCAN_predicted_CDS_2|423_bp atggatgaagctggaaacggtcattctcagcaaactatcgcaaggacaaaaaaccaaaca cctcatgttctcactcataggaatttaaagggtaagagattgaagacaaaggtgttcaca gtggctgctactgcaattcaaaatggtctagcattgtacagattgtcgtcgggcctcagg gaaattgctcagccctccacagctcccatggcctatctcagcacaatgagggggtccaca tttccaagttcaagacctttcataactacgtcaccttctatttatcagagtcttcctctc aggaccactaaactatttttggatctgaagcacagtatacaaaacatagctacatatcaa caattacacgattattccagacccatcgcccttggttttgcctcatctgacctggaatat taa >gi568815592r:46900225_47129061|GENSCAN_predicted_peptide_3|272_aa MWGWENKGESKAGYHTLDLNLLTRRMVGTENRRTGIKYFGKCCYLKRNEEEYPDEEGEDS SFQVCCCWRTDCHPAEHNGGSPGLSTTQLVKIHMAQSSHVLSTARTGLWPEDKFPVSSRE PGERHQAPNIGNHVERISYLLQSPYYFPPPSPRPPPPVTPSPPLPNQRPKEQGGPARTNT SLSRDWVVGLCWQVDVQVPPPRSADSGKCAVALWKMGVGAGQGAMLQNLHFSTSTLEQES PPCGPWTSASQCPVRNQVTQQDMSHGRASVIA >gi568815592r:46900225_47129061|GENSCAN_predicted_CDS_3|819_bp atgtggggctgggaaaataaaggagaaagtaaagctggttatcacactttagatttgaac ctgctaactagaagaatggtgggtactgaaaacaggagaacaggtatcaagtactttggg aaatgctgttatttaaagaggaatgaagaagaatatccagatgaggagggagaagacagc agcttccaagtgtgctgctgttggaggactgactgtcaccctgctgaacacaatggaggt agtccaggtctctcaacaactcagcttgtgaaaatccacatggcccagagttcccacgtt ctttctacagccaggactggcttgtggccagaggacaagtttcctgtttcctctagggag cctggagagaggcatcaggcaccaaatataggaaatcatgtggagaggatctcttattta ttacagtccccatattacttccctcccccaagcccccgccccccacctccagtcacccca tcaccacccctgcccaatcaaaggcccaaggagcaagggggacctgcacgtacaaacacc agcctgagcagggactgggttgttggcctctgctggcaggtggatgtccaggtcccccct ccaaggtccgcagattcaggaaaatgtgctgtagccctttggaagatgggggtaggagca ggacagggagcaatgctgcaaaatctgcatttttcaaccagcaccttagaacaagagtcc ccaccttgtgggccatggactagtgccagtcagtgtcctgttaggaaccaggtgacacag caggacatgagccatgggagagccagtgttattgcctga >gi568815592r:46900225_47129061|GENSCAN_predicted_peptide_4|78_aa MATRDWEGGRQMKKGGRRNSLESSLPKDGSLVSQEVIHPWHARPSRDCGNEAGEARTGAS DPSKECTACENIGHGVHG >gi568815592r:46900225_47129061|GENSCAN_predicted_CDS_4|237_bp atggctaccagagattgggaagggggaagacagatgaagaaaggtgggagaaggaatagc ctggaatcctcgcttcctaaggatgggtccttggtctctcaggaagttattcatccttgg catgctcgtccatccagagattgtggaaatgaggctggggaagcaaggactggagcttct gatccctcaaaagaatgcacagcctgcgaaaatattggccatggagttcatggctga >gi568815592r:46900225_47129061|GENSCAN_predicted_peptide_5|975_aa MCKKLLALSFEARFLQLVVPGLREDYSHTPLRYGFQVSMGYFKINSCGGMKESPVLIVSI FTFMCTQCLAATSDGHQLTQVHSEKSNSDTIQQVTIKTDEMHALLLCFSVLNGASGLSLL QSPVEEYQLLLQVTYRDSKEKRDLRNFLKLLKPPLLWSHGLIRIIRAKATTDCNSLNGVL QCTCEDSYTWFPPSCLDPQNCYLHTAGALPSCECHLNNLSQSVNFCERTKIWGTFKINER FTNDLLNSSSAIYSKYANGIEIQLKKAYERIQGFESVQVTQFRNGSIVAGYEVVGSSSAS ELLSAIEHVAEKAKTALHKLFPLEDGSFRVFGKAQCNDIVFGFGSKDDEYTLPCSSGYRG NITAKCESSGWQVIRETCVLSLLEELNKNFSMIVGNATEAAVSSFVQNLSVIIRQNPSTT VGNLASVVSILSNISSLSLASHFRVSNSTMEDVISIADNILNSASVTNWTVLLREEKYAS SRLLETLENISTLVPPTALPLNFSRKFIDWKGIPVNKSQLKRGYSYQIKMCPQNTSIPIR GRVLIGSDQFQRSLPETIISMASLTLGNILPVSKNGNAQVNGPVISTVIQNYSINEVFLF FSKIESNLSQPHCVFWDFSHLQWNDAGCHLVNETQDIVTCQCTHLTSFSILMSPFVPSTI FPVVKWITYVGLGISIGSLILCLIIEALFWKQIKKSQTSHTRRICMVNIALSLLIADVWF IVGATVDTTVNPSGVCTAAVFFTHFFYLSLFFWMLMLGILLAYRIILVFHHMAQHLMMAV GFCLGYGCPLIISVITIAVTQPSNTYKRKDVCWLNWSNGSKPLLAFVVPALAIVAVNFVV VLLVLTKLWRPTVGERLSRDDKATIIRVGKSLLILTPLLGLTWGFGIGTIVDSQNLAWHV IFALLNAFQLRQLLFNKLSALSSWKQTEKQNSSDLSAKPKFSKPFNPLQNKGHYAFSHTG DSSDNIMLTQFVSNE >gi568815592r:46900225_47129061|GENSCAN_predicted_CDS_5|2928_bp atgtgtaagaaactcctggctctctcctttgaagcccgcttcctgcagctggttgtccct ggacttagagaagattattcacatactcccttgagatacggtttccaagtcagcatgggc tatttcaaaatcaacagctgtggaggcatgaaggagtccccagtgttgattgtttccatc tttacgttcatgtgtacccagtgcttagctgccactagtgatggccatcagttaacccaa gtgcactcagagaagtcaaattctgacacaatccagcaagtaactataaaaactgatgaa atgcacgcattgcttctgtgcttctctgttctcaatggggcttcaggattgagcctgcta caaagcccagtcgaagaatatcagctgctgcttcaggtgacctatagagattccaaggag aaaagagatttgagaaattttctgaagctcttgaagcctccattattatggtcacatggg ctaattagaattatcagagcaaaggctaccacagactgcaacagcctgaatggagtcctg cagtgtacctgtgaagacagctacacctggtttcctccctcatgccttgatccccagaac tgctaccttcacacggctggagcactcccaagctgtgaatgtcatctcaacaacctcagc cagagtgtcaatttctgtgagagaacaaagatttggggcactttcaaaattaatgaaagg tttacaaatgaccttttgaattcatcttctgctatatactccaaatatgcaaatggaatt gaaattcaacttaaaaaagcatatgaaagaattcaaggttttgagtcggttcaggtcacc caatttcgaaatggaagcatcgttgctgggtatgaagttgttggctccagcagtgcatct gaactgctgtcagccattgaacatgttgccgagaaggctaagacagcccttcacaagctg tttccattagaagacggctctttcagagtgttcggaaaagcccagtgtaatgacattgtc tttggatttgggtccaaggatgatgaatataccctgccctgcagcagtggctacagggga aacatcacagccaagtgtgagtcctctgggtggcaggtcatcagggagacttgtgtgctc tctctgcttgaagaactgaacaagaatttcagtatgattgtaggcaatgccactgaggca gctgtgtcatccttcgtgcaaaatctttctgtcatcattcggcaaaacccatcaaccaca gtggggaatctggcttcggtggtgtcgattctgagcaatatttcatctctgtcactggcc agccatttcagggtgtccaattcaacaatggaggatgtcatcagtatagctgacaatatc cttaattcagcctcagtaaccaactggacagtcttactgcgggaagaaaagtatgccagc tcacggttactagagacattagaaaacatcagcactctggtgcctccgacagctcttcct ctgaatttttctcggaaattcattgactggaaagggattccagtgaacaaaagccaactc aaaaggggttacagctatcagattaaaatgtgtccccaaaatacatctattcccatcaga ggccgtgtgttaattgggtcagaccaattccagagatcccttccagaaactattatcagc atggcctcgttgactctggggaacattctacccgtttccaaaaatggaaatgctcaggtc aatggacctgtgatatccacggttattcaaaactattccataaatgaagttttcctattt ttttccaagatagagtcaaacctgagccagcctcattgtgtgttttgggatttcagtcat ttgcagtggaacgatgcaggctgccacctagtgaatgaaactcaagacatcgtgacgtgc caatgtactcacttgacctccttctccatattgatgtcaccttttgtcccctctacaatc ttccccgttgtaaaatggatcacctatgtgggactgggtatctccattggaagtctcatt ttatgcctgatcatcgaggctttgttttggaagcagattaaaaaaagccaaacctctcac acacgtcgtatttgcatggtgaacatagccctgtccctcttgattgctgatgtctggttt attgttggtgccacagtggacaccacggtgaacccttctggagtctgcacagctgctgtg ttctttacacacttcttctacctctctttgttcttctggatgctcatgcttggcatcctg ctggcttaccggatcatcctcgtgttccatcacatggcccagcatttgatgatggctgtt ggattttgcctgggttatgggtgccctctcattatatctgtcattaccattgctgtcacg caacctagcaatacctacaaaaggaaagatgtgtgttggcttaactggtccaatggaagc aaaccactcctggcttttgttgtccctgcactggctattgtggctgtgaacttcgttgtg gtgctgctagttctcacaaagctctggaggccgactgttggggaaagactgagtcgggat gacaaggccaccatcatccgcgtggggaagagcctcctcattctgacccctctgctaggg ctcacctggggctttggaataggaacaatagtggacagccagaatctggcttggcatgtt atttttgctttactcaatgcattccagctgcgacaacttctgttcaacaagttgtctgcc ttaagttcttggaagcaaacagaaaagcaaaactcatcagatttatctgccaaacccaaa ttctcaaagcctttcaacccactgcaaaacaaaggccattatgcattttctcatactgga gattcctccgacaacatcatgctaactcagtttgtctcaaatgaataa >gi568815592r:46900225_47129061|GENSCAN_predicted_peptide_6|159_aa MYISSLAFLACYVHGHKGFLSPQKARKQMDTDADSCKLNQQAQKRFSVTSIGSHGLQLSR LLLKAQQNVEMYLGGELNAHAFDGNSDSPSSHTNTPASPGVLAPNTFTQSPLRQPMPRVV TAIPNGSREEAKTPGIFGLRNYPPCTDRTGKDSPLKAAF >gi568815592r:46900225_47129061|GENSCAN_predicted_CDS_6|480_bp atgtacattagctccctggcatttctggcttgctatgtgcatggccacaaagggtttctc tctccacaaaaagccagaaaacaaatggacacagacgctgacagttgcaaactaaaccaa caagctcagaaacgcttttctgttacctccattggtagtcacggcttgcaattgtcaagg ttactcctaaaggcccagcaaaatgttgaaatgtatcttgggggggaactaaatgcccat gcttttgatggcaattcagactccccatcctcccacaccaacacccctgccagccctggt gtactggctccaaacactttcacacagtcccctttgagacagcccatgcctcgagttgtc acagccatccccaacggctcaagagaagaagccaagacaccaggtatctttggtctccgt aattacccgccgtgcactgacaggacaggaaaggattcgccattaaaagccgcgttttaa >gi568815592r:46900225_47129061|GENSCAN_predicted_peptide_7|56_aa ELLTNLTGQWWLKKFSKGDKSLEDEDHSGRPLKVDNNQLRATIEADPLTTAGEVAK >gi568815592r:46900225_47129061|GENSCAN_predicted_CDS_7|171_bp gaactgttaaccaacttaacaggacagtggtggctcaagaagtttagcaaaggagacaag agccttgaagatgaggatcatagtggccggccattgaaagttgacaacaaccagttgaga gcaaccatcgaagctgatcctcttacaactgcaggagaagttgccaaataa