GENSCAN 1.0 Date run: 3-Jul-120 Time: 20:46:16 Sequence gi568815596f:203606697_203834909 : 228213 bp : 41.69% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 15251 15331 81 1 0 105 98 58 0.537 9.63 1.02 Term + 16608 16685 78 2 0 96 35 97 0.959 1.98 1.03 PlyA + 17041 17046 6 1.05 2.05 PlyA - 17158 17153 6 1.05 2.04 Term - 19019 18896 124 1 1 96 44 68 0.215 0.08 2.03 Intr - 20742 20715 28 1 1 101 108 -8 0.205 -1.24 2.02 Intr - 28521 28238 284 2 2 31 100 315 0.462 23.14 2.01 Init - 29160 28535 626 0 2 18 66 547 0.223 40.36 2.00 Prom - 32934 32895 40 -5.25 3.07 PlyA - 32943 32938 6 1.05 3.06 Term - 35483 35167 317 0 2 -49 43 359 0.221 11.82 3.05 Intr - 39876 39682 195 1 0 71 92 93 0.332 6.46 3.04 Intr - 44328 44151 178 0 1 111 72 119 0.438 11.17 3.03 Intr - 61558 61444 115 0 1 73 92 34 0.008 1.83 3.02 Intr - 67631 67511 121 0 1 92 34 59 0.001 -0.47 3.01 Init - 75649 75571 79 1 1 29 119 82 0.734 6.67 3.00 Prom - 86590 86551 40 -3.15 4.00 Prom + 93268 93307 40 -3.55 4.01 Init + 99828 99921 94 2 1 76 96 111 0.705 10.10 4.02 Intr + 99935 100052 118 0 1 41 97 107 0.694 5.50 4.03 Term + 109031 109214 184 2 1 108 43 77 0.645 1.23 4.04 PlyA + 109495 109500 6 1.05 5.02 PlyA - 109790 109785 6 1.05 5.01 Sngl - 110881 110564 318 1 0 38 48 227 0.913 9.42 5.00 Prom - 115071 115032 40 -4.85 6.00 Prom + 116800 116839 40 -6.25 6.01 Init + 116987 117060 74 1 2 76 72 47 0.249 2.49 6.02 Intr + 118152 118198 47 0 2 66 58 42 0.662 -3.87 6.03 Intr + 119937 120293 357 1 0 103 87 201 0.897 15.80 6.04 Intr + 122952 123076 125 1 2 113 86 6 0.912 2.28 6.05 Term + 128088 128216 129 2 0 131 47 85 0.871 5.90 6.06 PlyA + 128957 128962 6 1.05 7.07 PlyA - 132108 132103 6 1.05 7.06 Term - 139773 139697 77 1 2 105 43 97 0.439 3.92 7.05 Intr - 154964 154855 110 1 2 66 99 64 0.059 4.31 7.04 Intr - 157980 157638 343 1 1 -27 -16 395 0.023 11.16 7.03 Intr - 159004 158185 820 1 1 56 61 936 0.586 77.76 7.02 Intr - 166157 166064 94 1 1 -34 108 75 0.016 -3.65 7.01 Init - 166615 166242 374 1 2 80 -5 668 0.039 53.18 7.00 Prom - 168965 168926 40 -4.35 8.08 PlyA - 170370 170365 6 1.05 8.07 Term - 178908 178760 149 1 2 54 32 187 0.007 6.78 8.06 Intr - 182040 181930 111 1 0 113 95 -15 0.001 1.13 8.05 Intr - 203788 203633 156 2 0 115 26 95 0.074 5.06 8.04 Intr - 206026 205876 151 1 1 77 97 0 0.116 -1.39 8.03 Intr - 206636 206521 116 0 2 46 68 96 0.421 2.65 8.02 Intr - 212908 212722 187 1 1 71 35 107 0.292 2.04 8.01 Init - 215827 215603 225 1 0 56 44 193 0.261 10.22 8.00 Prom - 215883 215844 40 -2.45 9.03 PlyA - 216122 216117 6 1.05 9.02 Term - 224018 223625 394 1 1 16 42 205 0.408 2.22 9.01 Intr - 227094 227009 86 0 2 58 98 73 0.360 3.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 61495 61589 95 0 2 75 84 90 0.851 7.20 S.002 Sngl - 166615 166238 378 1 0 80 33 661 0.947 55.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:203606697_203834909|GENSCAN_predicted_peptide_1|52_aa MANSQPEEGMLRLQSLLLHAQLDQETGEEAALVMSVAEEELSHTSLRISFME >gi568815596f:203606697_203834909|GENSCAN_predicted_CDS_1|159_bp atggccaattctcagcctgaggaggggatgctgaggctccagagcctgttgcttcatgct cagcttgaccaagaaactggggaagaggctgctttagtgatgagtgtggcagaggaggag ctgtctcatactagcctgcggatttcctttatggagtaa >gi568815596f:203606697_203834909|GENSCAN_predicted_peptide_2|353_aa MQVHKEIIDAVPNAIPGITDIELEIYGMEGIPEKDMDEGRQVLEQKTQESQKKKEQDDSD EYDDDDSAGSTSFHPQPVQLQQGYIPPVAQPGLPPVPGAPGMPLGIAPLMPGVPPLMPGM PPVMPAMPPGLHHQRKCTQSFFGENIMMPMGGIYHLDQEYTTWTRNTPPGPGIPPLMPGM SPGIPSTPRPPFHILESSSDPNTGCFSTRPPAPTATVPAPQPPVTKPLFSSAGQAQAAVQ GPVGTDFKPLNSTPEATTASPKPMFSAYTQSADSTTSTTNSTAAKPAASITSKPATLVTT NATTSFMYKLGVNIHLLGSPNSRSAAPYGFHIEPVFCSWLPRTVSQVPSLPSM >gi568815596f:203606697_203834909|GENSCAN_predicted_CDS_2|1062_bp atgcaggtgcataaagaaataatagatgctgtaccaaatgcaatacctgggataacagac atagaattggaaatatatggtatggaaggtattccagaaaaagacatggatgaaggacga caagttcttgaacagaaaacacaagaaagtcaaaaaaagaaggaacaagatgattctgat gaatacgatgacgacgactctgcaggctcaacttcatttcacccacaacctgttcaactt caacaaggttatattcctccagtggcacagccaggactgccgccagtaccaggagcacca ggaatgcctctaggcatagctccattaatgccaggtgttcctcctctgatgccaggaatg ccaccagttatgccagccatgccacctggattgcatcatcaaagaaaatgcacccagtca tttttcggtgaaaacataatgatgccaatgggtggaatataccacctggaccaggaatac accacctggaccaggaatacaccacctggaccaggaataccacctctgatgcctggaatg tcaccaggtatcccctccactccccgacccccattccacatcctagaatcctccagtgac ccaaacacaggctgtttcagcaccagaccacctgcaccaacagcaactgtacctgctcca cagcctccagttacgaagcctcttttctccagtgctggacaggctcaggcagctgtccaa ggacctgttggtacagatttcaaacccttaaatagtacccctgaagcaactacagcatcc ccaaagcctatgttctctgcttatacacagtctgcagattcaaccactagtacaacaaat agtactgcagctaaaccagcagcttcaataacaagtaagcctgctacacttgtaacaacc aatgcaaccaccagttttatgtacaaactaggggtcaatatccacttgcttggctctcca aattcccgctcagctgcgccttatggatttcatattgaaccagtcttctgctcctggtta cccaggactgtgtctcaggtgcccagtctgcccagtatgtga >gi568815596f:203606697_203834909|GENSCAN_predicted_peptide_3|334_aa MRKCGGEKAIPSNALNSLLSKAAEKPGKRGGAAQNPSLRLDVLQIFRNRNLRKTLKELPV LKQQVLRHTQDQYLVSFNPIKLTLGISHHSCLVRNWIDLGLRTWWRVPFTAAGLQYTAAT MTVHGEMSITCKQSVGSHPGSKAYKTFIQTLVFWDKSPQCYPQAGHWTPPQEFACKTPGP ATSPAQSPLLCRHLEQLARPALACSHTLLPPGAECTVAVTTGSMLEYKPAVTVVAKVCSF TPEASEATDPPGGTNNSRRAAIRAVTLTTKVRSLTSEPARARTHRKEETPNTSEHQKEQT PDTPPLRTVTLTARVHGFILEVIETKNPPIPDTS >gi568815596f:203606697_203834909|GENSCAN_predicted_CDS_3|1005_bp atgagaaaatgtggtggagaaaaggcaatcccctccaacgcccttaattcactcctctcc aaagcagcagaaaagcctggaaagcgaggaggagcagcacaaaaccccagtttaaggctt gatgtgcttcagatatttagaaacagaaatttaagaaagaccctgaaagagcttccagtc ctaaagcaacaagttctgagacacactcaggatcaataccttgtatccttcaatccaatc aagttgacactcggtattagccatcacagctgtctagtcagaaattggattgatctaggg ttaaggacttggtggcgtgtcccctttacagctgcaggactccagtacacagcagccacc atgactgttcatggggagatgagtatcacgtgtaagcaatcagttggctcccatccagga agtaaggcttacaaaacatttatccagacactagtcttctgggacaaaagtccacagtgc tatccacaggcaggacactggactccaccacaggagtttgcttgcaaaacacctggtcca gccacaagccctgcacagagcccactcttgtgtcggcacttggagcagctggccaggcct gcactcgcttgctcacacaccctcctcccaccaggggctgagtgcacagttgcagtgacc acgggatccatgctggagtacaagccagctgtaacagtcgtcgcgaaggtctgcagcttc actcctgaagccagcgaggccacagacccaccgggaggaacgaacaactccagacgcgcc gccataagagctgtaacactcaccacaaaggtccgcagcttgacttctgagccagcgaga gcacgaacccaccggaaagaagaaactccgaacacatccgaacatcagaaggaacaaact ccggacacgccgcctttaagaactgtaacactcaccgcgagggtccacggcttcattctt gaagtcattgagaccaagaacccaccaattccggacacatcctga >gi568815596f:203606697_203834909|GENSCAN_predicted_peptide_4|131_aa MPCGLSALIMCPKGMVAVVVAVDDGDSQALAVPLTLRVPRGGGAGTLAHRQDKDAQAALG SQLIPFNSSNRMNKFPLNLPHFVNGASTHTLTQCVNSGSVSIPVHHQSVKLSTNALFDPF IPAATSTAQAL >gi568815596f:203606697_203834909|GENSCAN_predicted_CDS_4|396_bp atgccttgtggtttgagtgccttgatcatgtgccctaaggggatggtggcggtggtggtg gccgtggatgacggagactctcaggccttggcagttcccctcacacttcgggttcctcgg ggaggaggggctggaaccctagcccatcgtcaggacaaagatgctcaggctgctcttggc tctcaacttattcccttcaattcaagtaacaggatgaacaaatttcctcttaacttacct cattttgtcaatggtgcctccactcatacccttacccagtgcgtgaactctgggtcggtt tccattcctgttcaccatcagtctgtaaaactgagcaccaatgccttgtttgaccctttc ataccagctgccacctccacagctcaggccttatga >gi568815596f:203606697_203834909|GENSCAN_predicted_peptide_5|105_aa MAHTKLGKSRKVIYERTVGKGVARAEPRGVVPEPVTCSSRIMTTLRSKETRGQRSYQNQK ARELCSGGHHDRSNDFHQEKQTKQPALAKIVYITTISYTKHINCS >gi568815596f:203606697_203834909|GENSCAN_predicted_CDS_5|318_bp atggcacacacaaaattgggcaagagtaggaaggttatttatgaaaggactgttggcaaa ggtgtggcaagagcagaaccacgaggggtagtgccagagcctgtgacttgcagcagcaga attatgaccacccttaggtcgaaagagaccaggggacagcgaagttaccagaaccagaag gcaagagagttgtgtagtggaggccaccatgacagaagcaatgacttccatcaagagaag cagaccaagcaaccagctttggccaagattgtgtatatcacaactatcagttataccaag cacattaactgttcttga >gi568815596f:203606697_203834909|GENSCAN_predicted_peptide_6|243_aa MMRYHFTSTRMAIIEKMNNNKCRQGVGIRDVASNKLKRNARNKILVKQSPMLVAYDNAVN LSCKYSYNLFSREFRASLHKGLDSAVEVCVVYGNYSQQLQVYSKTGFNCDGKLGNESVTF YLQNLYVNQTDIYFCKIEVMYPPPYLDNEKSNGTIIHVKGKHLCPSPLFPGPSKPFWVLV VVGGVLACYSLLVTVAFIIFWVRSKRSRLLHSDYMNMTPRRPGPTRKHYQPYAPPRDFAA YRS >gi568815596f:203606697_203834909|GENSCAN_predicted_CDS_6|732_bp atgatgagatatcacttcacgtccaccaggatggctataatagaaaagatgaacaataac aagtgtcggcaaggtgtgggtatacgggatgttgcttcaaataaacttaaaagaaatgca agaaacaagattttggtgaagcagtcgcccatgcttgtagcgtacgacaatgcggtcaac cttagctgcaagtattcctacaatctcttctcaagggagttccgggcatcccttcacaaa ggactggatagtgctgtggaagtctgtgttgtatatgggaattactcccagcagcttcag gtttactcaaaaacggggttcaactgtgatgggaaattgggcaatgaatcagtgacattc tacctccagaatttgtatgttaaccaaacagatatttacttctgcaaaattgaagttatg tatcctcctccttacctagacaatgagaagagcaatggaaccattatccatgtgaaaggg aaacacctttgtccaagtcccctatttcccggaccttctaagcccttttgggtgctggtg gtggttggtggagtcctggcttgctatagcttgctagtaacagtggcctttattattttc tgggtgaggagtaagaggagcaggctcctgcacagtgactacatgaacatgactccccgc cgccccgggcccacccgcaagcattaccagccctatgccccaccacgcgacttcgcagcc tatcgctcctga >gi568815596f:203606697_203834909|GENSCAN_predicted_peptide_7|605_aa MNDEGSPIKVTLATLKMSVQPTASLGGFEITPPVVLQLKCGSGPVHISRQCLVAVEEDAE SEDEEEEDVNLLSISGKRSASGGGSKVPQKKGKLAADEDDDDDDDDDDDENDNDFDDEEA EEKHESKGQESFKKQEKTPKTPKGPSSVEDIIAKMQIRKLNRRSETPSQNNQQQSLQPSL ICPVTLPPAGTVCASPLVSPPTWFCPGAQLPRPLVRSVASVYAGPGSSGSRISMSRSTSF QGGLGSRSMAAGMAGGLAGMGGIQNEKETMQSLNDRLASYLDRVRSLEMGNWKLESKIWE KLESKIWEHLEKKGPQVRDWGHYFKTIEEDLTQIFTSTVDNTCIILQIDNAHLAADDFRV KYETEPATCQSVENDIHGLHKVIDDTSVTQLQLETEIEALKEELLFLKKNHEEEVKGLQA QIASSVDHGGLGDRPGLDEKSGGQLGEQPEGGVEACYTLQMEQLNGILLHLESELAQTWA EGQGQAQEYQALPNIKVKLEAEIATYRRLLEDGEDFNFGDALDSSNSTQTTQKTPTCQQW MAKWEGMKIESHNLICQAYVGTNCVSKESTQRYQLRSSRAACSQANSSPGRKEAISDAAC RHNIL >gi568815596f:203606697_203834909|GENSCAN_predicted_CDS_7|1818_bp atgaatgatgaaggcagtccaattaaagtcacactggcaactttgaaaatgtctgtacag ccaactgcttcccttgggggctttgaaatcacaccaccagtggtcttacagttgaagtgt ggttcagggccagtgcatattagtagacagtgcttagtagctgtggaggaagatgcagag tcagaagatgaagaggaggaggatgtgaacctcttaagcatatctggaaagcggtctgcc tctggaggtggtagcaaggttccacagaaaaaaggaaaacttgctgctgatgaagatgat gatgatgatgatgatgatgatgatgatgaaaatgataatgattttgatgatgaggaagct gaagaaaagcacgaatcaaaaggacaagaatccttcaaaaaacaagaaaaaactcccaaa acaccaaaaggacctagttctgtagaagacattatagcaaaaatgcaaataaggaaactg aaccgtaggagtgaaactccatctcaaaacaaccaacaacagagccttcagccaagcctg atctgtcctgtcactctaccccctgccggcacagtatgcgcttcaccacttgtttcccca ccaacttggttctgtccaggcgcccagctaccgcgcccactggtcagaagtgtggccagc gtctatgcaggccctgggagctcaggttcccggatctccatgtcccgctccaccagcttc cagggtggcttggggtccaggagcatggccgcaggtatggccgggggtctggcaggaatg ggaggcatccagaacgagaaggagaccatgcaaagcctgaatgaccgcctggcctcctac ctggacagagtgaggagcctggagatgggaaactggaagctggagagcaaaatctgggag aagctggagagcaaaatctgggagcacctggagaagaagggaccccaggtcagagactgg ggccactacttcaagaccatcgaggaggacctgactcagatcttcacaagtactgtggac aatacctgcatcattctgcagatcgacaatgcccatcttgctgctgatgactttagagtc aagtatgagacagagccggccacgtgccagtctgtggagaacgacatccatgggctccac aaggtcattgatgacaccagtgtcactcagctgcagctagagacagagatcgaggctctc aaggaggagctgctcttcctgaagaagaaccacgaagaggaagtaaaaggcctacaagcc cagattgccagctctgttgaccatggaggccttggagatcgacctggactggatgaaaaa tctggaggccagcttggagaacagcctgaagggggggtggaggcctgctacaccctgcag atggaacaactcaatgggatcctgctgcacctggagtcagagctggcacagacctgggca gagggacagggccaggcccaggagtaccaggccctgccaaacatcaaggtcaagctggag gctgagatcgccacctaccgccgcctgctggaagatggcgaggacttcaattttggtgat gctctggacagcagcaactccacgcaaaccacccaaaagactcccacctgccaacagtgg atggcaaagtgggagggaatgaagattgaaagtcataatctaatctgccaagcatatgtg ggcaccaactgtgtttctaaggaatctacacaaaggtatcagctcagaagctccagagca gcatgttctcaggccaattcctcccctggtcgcaaagaggctatatcagatgctgcttgc agacataacattctatag >gi568815596f:203606697_203834909|GENSCAN_predicted_peptide_8|364_aa MGTGSRIHQSEAPTLESESDASDTKKQRLRPFLEVAAVAARLLFHGSSGCGAVPSLHLEQ TFYCDSNRIQHLDRPLPTSTQGPATKIFWQNSPKTYLVNNDVFLRFPSPVFAQKVTPQAP PHWYKRTTSRLDMIWRKGIYTKDLKAWAQTDTGTPTFLAAYPQEPKAGNNPNVYRQNLDL VATAMSTAKNLFPRLPADRGGHMTISCPARCKLPVLEWGFWDSSLKGLIFQWALVDWISS SPPTLCLWPFSSHQSHEQLSTHEHPRADQPWLMGIGYTALWSMCNMMFFKEINAFLFPIM AGPFFRPILSLKINEVPSTSQGLRSAGGRHLDWQAAPPAAPVRDPLGEASWAPESGGEVE NLYV >gi568815596f:203606697_203834909|GENSCAN_predicted_CDS_8|1095_bp atgggcacaggatccaggatccaccaatcagaagcacccactctggagtctgaatcagat gctagtgacacaaagaaacagcgactcagaccctttctagaggtggcggcagtggcagca agattgcttttccacggcagcagtggctgcggtgctgtccccagcctgcacctggagcag acattctactgtgatagcaacagaatacagcatctggacaggccactgcccacgtctaca cagggccccgcaaccaagatattctggcagaactctccgaaaacttatttagtcaataac gatgtatttttaaggtttccaagcccagtttttgctcagaaagtaaccccacaggctcct ccacactggtacaagaggaccacctcaagattggacatgatctggagaaagggtatatac acaaaagacttgaaagcatgggctcaaacagatactggtacaccaacattcctggcagcg tatccacaagagccaaaagctggaaataacccgaatgtctaccgacagaaccttgatttg gtggcgacagcaatgagcacggctaaaaatctgtttcccaggctccctgcagataggggt ggccacatgactataagctgtccagcgagatgtaagcttccagtgcttgagtggggtttc tgggacagttcattaaaaggccttatctttcagtgggcactggtggactggatttccagc agcccaccaactctttgtctgtggcctttctctagccaccagagtcatgagcagctctcc acccatgagcaccccagagcagaccaaccatggctgatgggaattgggtacactgccctt tggtcaatgtgcaacatgatgttcttcaaagagattaatgcttttctatttccaataatg gcaggcccgtttttcagaccaatcctttcactgaaaataaatgaagtcccatcgaccagc caagggctgaggagtgccggcggacgacacctggactggcaggcagctccacctgcagcc ccggtgcgggatccactgggtgaagccagctgggctcctgaatctggtggggaagtggag aacctttatgtctag >gi568815596f:203606697_203834909|GENSCAN_predicted_peptide_9|159_aa LMWFLSSAWTLADTGRYCSSETNGDKGGRIPDVSHNGIPTIEMLWDSEAWAWSGHVTSYN WEMTIVLTGSFSPHDREWMSQHPPLLVLPSGPPGNEMSPPTPNPKPMRGAAFLALFIPNS ALWKQDCWLAKKPEITFPLLVEANAERHLERLLSSRFST >gi568815596f:203606697_203834909|GENSCAN_predicted_CDS_9|480_bp ttgatgtggtttctgtcttctgcctggaccctagcagatacaggtcgctactgcagctct gaaacaaacggggataaaggtggcagaataccagatgtcagtcataatggaatccccacc atcgaaatgctctgggacagtgaagcgtgggcatggtcagggcatgtgacctcatacaac tgggaaatgactattgttctcaccgggtcattcagtccccatgatcgtgaatggatgagc cagcaccctcctctcctagtgttgccgagtgggcctccaggaaatgaaatgtctccacca acaccaaaccccaaaccaatgagaggagcagcatttctggccttattcatacccaactca gctctgtggaaacaagactgctggcttgccaagaagccagaaataacatttcctctactg gttgaggcaaatgctgagagacaccttgagaggttactgagttcacgcttttctacctag