GENSCAN 1.0 Date run: 3-Nov-116 Time: 05:04:14 Sequence gi568815578r:47110764_47410206 : 299443 bp : 46.94% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8412 8519 108 1 0 29 22 128 0.182 0.56 1.02 Intr + 15140 15230 91 0 1 60 59 97 0.098 3.05 1.03 Intr + 26454 26484 31 0 1 59 103 17 0.269 -1.57 1.04 Intr + 32296 32385 90 0 0 126 68 118 0.930 13.79 1.05 Term + 42494 42532 39 1 0 79 43 65 0.112 -1.71 1.06 PlyA + 43275 43280 6 1.05 2.04 PlyA - 44050 44045 6 1.05 2.03 Term - 50289 50175 115 2 1 41 39 144 0.539 2.84 2.02 Intr - 50365 50314 52 2 1 62 37 34 0.383 -6.33 2.01 Init - 50908 50560 349 1 1 63 80 407 0.831 34.75 2.00 Prom - 51043 51004 40 -1.16 3.00 Prom + 52231 52270 40 -4.56 3.01 Init + 55921 56002 82 0 1 56 90 75 0.751 3.85 3.02 Intr + 57751 57842 92 2 2 100 92 -20 0.924 -0.69 3.03 Intr + 58376 58434 59 1 2 102 81 90 0.984 7.38 3.04 Intr + 61944 62104 161 0 2 141 89 310 0.896 36.03 3.05 Intr + 69035 69206 172 0 1 118 -16 126 0.928 4.20 3.06 Intr + 70052 70173 122 2 2 106 91 117 0.996 13.94 3.07 Intr + 72528 72628 101 1 2 91 75 134 0.904 12.23 3.08 Intr + 85813 85964 152 0 2 70 91 35 0.156 0.96 3.09 Intr + 95052 95193 142 0 1 51 25 132 0.194 3.56 3.10 Term + 97859 97966 108 1 0 99 43 51 0.230 0.21 3.11 PlyA + 98475 98480 6 -0.45 4.31 PlyA - 98698 98693 6 -0.45 4.30 Term - 100134 99998 137 1 2 106 43 107 0.994 6.18 4.29 Intr - 101962 101834 129 2 0 80 72 97 0.738 7.97 4.28 Intr - 105409 105283 127 1 1 90 42 52 0.499 1.05 4.27 Intr - 110711 110551 161 0 2 105 85 140 0.997 15.11 4.26 Intr - 113793 113554 240 1 0 77 95 413 0.967 38.42 4.25 Intr - 116518 116440 79 1 1 90 82 63 0.961 5.02 4.24 Intr - 119043 118963 81 0 0 99 55 38 0.653 1.43 4.23 Intr - 125753 125563 191 0 2 61 87 261 0.942 22.60 4.22 Intr - 128369 127995 375 0 0 40 81 321 0.671 21.59 4.21 Intr - 135754 135245 510 2 0 41 115 517 0.887 42.64 4.20 Intr - 138676 138524 153 2 0 61 42 145 0.059 7.24 4.19 Intr - 151665 151525 141 1 0 56 52 87 0.000 2.22 4.18 Intr - 166032 165551 482 2 2 142 76 645 0.136 61.38 4.17 Intr - 169814 169702 113 2 2 59 67 64 0.733 0.58 4.16 Intr - 170344 170231 114 1 0 84 5 90 0.520 0.94 4.15 Intr - 172885 172808 78 1 0 101 75 11 0.319 0.85 4.14 Intr - 176521 176466 56 2 2 88 87 22 0.917 0.80 4.13 Intr - 179511 179424 88 0 1 102 84 79 0.991 8.44 4.12 Intr - 181125 181033 93 0 0 88 58 98 0.991 6.96 4.11 Intr - 184016 183903 114 2 0 41 78 121 0.718 7.04 4.10 Intr - 199441 199293 149 2 2 115 103 82 0.498 12.35 4.09 Intr - 200602 200495 108 2 0 79 85 12 0.494 0.26 4.08 Intr - 206831 206712 120 0 0 85 66 45 0.477 2.47 4.07 Intr - 208615 208482 134 0 2 102 21 84 0.488 3.39 4.06 Intr - 220273 220103 171 0 0 103 93 25 0.533 3.56 4.05 Intr - 227067 226941 127 1 1 16 89 107 0.246 3.34 4.04 Intr - 237163 237093 71 0 2 90 113 52 0.082 6.73 4.03 Intr - 237284 237254 31 0 1 68 93 13 0.019 -3.01 4.02 Intr - 250022 249909 114 0 0 55 71 73 0.037 2.72 4.01 Init - 258354 258279 76 0 1 56 67 85 0.073 4.47 4.00 Prom - 272098 272059 40 -1.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 138659 138524 136 2 1 77 42 130 0.929 7.61 S.002 Intr + 148907 148975 69 0 0 96 63 64 0.871 3.85 S.003 Term + 150786 150856 71 1 2 5 48 182 0.913 4.00 S.004 Intr - 166032 165468 565 2 1 142 99 630 0.804 62.30 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:47110764_47410206|GENSCAN_predicted_peptide_1|119_aa XFFQRPKIAGDEVSPAKRQEGVNAIVFGVDFDREIGERGEEISIPGLVLKHMCPEVQDVR VIDTEEKICNASENGFHDTTTSVRIGLMMEEMIFNLADTHLFFNDLEGKNDQYIKPVMS >gi568815578r:47110764_47410206|GENSCAN_predicted_CDS_1|360_bp ntgttcttccagcgtccaaagattgctggtgatgaggtgtcacctgctaaacggcaggaa ggtgtcaacgccattgtatttggggttgactttgacagggaaataggagagagaggggag gagatctccattccaggattggtgctgaagcacatgtgtcctgaagttcaggacgtcagg gtaattgacacagaagaaaaaatatgtaatgcttcagaaaatgggtttcatgacaccacg acgtccgtgcgcattggccttatgatggaagagatgatcttcaaccttgcagatacacat ctgttcttcaatgacctggaggggaaaaatgaccagtacatcaaaccagtgatgtcatag >gi568815578r:47110764_47410206|GENSCAN_predicted_peptide_2|171_aa MGNHGGFCGGFSRGIGGWGHSRGQGPGQGRGAHRGKTKDKEWTSIPKLGHLVKDMKIKFL EEIYLFFLAIKEFEIIDFFLGASLKDEVLKIMPVQKQTHASQRTRLKGFIVIGDYNGALA SSRSPCPKAAADGGCTATLGNFAKATFDAISKTYSYPTPNLWKETVFTKSP >gi568815578r:47110764_47410206|GENSCAN_predicted_CDS_2|516_bp atggggaaccacggtggcttctgtggaggcttcagcaggggcatcgggggctggggtcat agccgcggacagggcccgggccaaggccgaggagctcacagaggcaagaccaaggacaag gagtggacgtctatccccaagctgggccacctggtcaaggacatgaagatcaagttcctg gaggagatctatctcttcttcctggccatcaaagagtttgagatcattgactttttcctg ggggcttctctcaaggatgaggttttgaagatcatgccagtacaaaagcagacccacgcc agccagcgcaccaggctcaagggatttattgtcatcggtgactataatggggcactggca tcatctcggtccccatgcccaaaagctgctgctgatggcggctgcactgccaccctggga aacttcgccaaggccacctttgatgctatctctaagacctacagctacccgacccccaac ctctggaaggagactgtattcaccaagtctccctag >gi568815578r:47110764_47410206|GENSCAN_predicted_peptide_3|396_aa MYAVATVHPTLFSVLSCSISLLLTSSGERGVSPRKLSGKQHGIGVCSMASPGDSAWHIDC DQIHVDDVSSDDNGQDLSTYNFSADGFHSSAPGANLCLGSGVHGGVDWMRKLAFRYRRVK EMYNTYKNNVGGLIGTPKRETWLQLRAELEALTDLWLTHSLKALNLINSRQVAALIFLCA TELSRSRQWPNCVNVLVTTTQLIPALAKVLLYGLGSVFPIENIYSATKTGKESCFERIMQ RFGRKAVYVVIGDGVEEEQGAKKVPLEVGPGKSNNASLPRKDSQGVCFGLRRATGIALMI LQHRRRYVRPKRSCGPDLSTGCSFAYGHQIASLLDCMKRSEPVPAPVIAKGNQQGTGSFC QDASLMKAGTRLLDGTMGSSVPGQCLTRCVFADWQE >gi568815578r:47110764_47410206|GENSCAN_predicted_CDS_3|1191_bp atgtatgcagtagccactgtgcacccgacactgttctcggtgctctcatgcagcatctcg cttctcctcacctcatctggagagagaggtgtttccccaaggaaactgagtgggaaacag catggcattggagtttgttcaatggcaagccctggggatagtgcctggcacatagattgt gaccagatccacgttgatgacgtctcatcagatgacaatggccaagatttaagcacatac aacttctccgctgacggcttccacagttcggccccaggagccaacctgtgcctgggctct ggcgtgcacggcggcgtggactggatgaggaagctggccttccgctaccggcgggtgaag gagatgtacaatacctacaagaacaacgttggtgggttgataggcactcccaaaagggag acctggctacagctccgagctgagctggaagctctcacagacctctggctgacccactcc ctgaaggcactaaacctcatcaactcccggcaagtggcagccctcattttcctctgtgcc actgaactttctcgcagtagacagtggcccaactgtgtcaatgtgctggtcaccaccact caactaattcctgccctggccaaagtcctgctatatggcctggggtctgtgtttcctatt gagaacatctacagtgcaaccaagacagggaaggagagctgcttcgagaggataatgcag agattcggcagaaaagctgtctacgtggtgatcggtgatggtgtggaagaggagcaagga gcgaaaaaggtgccacttgaagtgggacccggaaagagcaataacgcaagccttccgaga aaggattctcaaggcgtctgctttgggctgcgccgggctacagggattgctctgatgata ctgcagcaccgtcgacgatatgttaggcccaagaggtcatgtgggccagatctctccaca ggctgcagttttgcctatggccaccagatagcgagcttgctggactgcatgaaaaggtct gagccagttcctgccccggtcattgccaaaggaaaccagcaaggaactggcagcttctgc caggatgcaagtttgatgaaggcagggacccgactccttgatggcaccatgggatcctca gtgcctggacagtgcctgacgcgatgtgtgtttgctgattggcaggaatga >gi568815578r:47110764_47410206|GENSCAN_predicted_peptide_4|1520_aa MVALEPSTCAEWSYAAAVGSGRRSEGSRTGYLGMQLVGPLLLLLAWLEASGTGEPFQGGQ ASAPLDKDEGMSLSLAEEEIKTEQEVVEGMDISTRSKERFSSQETKLSSFQDVRLTFVSE GNDPAERCSYINGKCHLDASHTDCADPSKPQGLCPYLEGPLLAFLSLLPINFKAQNEDEF CLLFLLILFYYHLACCWLPSDIVTSASRTTAVPRIYSQSHEINGSSVHCTGDFQLSRAAV EGPSRLAVDIIKEIKHPQKHKKLPAFQLRPFPLREETIPCNGAGVYRCEPPCLAEVYMCL NEMLKDEKKGCMNKSGADPGSAERTAQKRKFPSPPHSSNGHSPQDTSTSPIKKKKKPGLL NSNNKEQKITVAECIETQSKAMTMLTIEQLSYLLKFAIQKMKQPGTDAFQKPVPLEQHPD YAEYIFHPMDLCTLEKNAKKKMYGCTEAFLADAKWILHNCIIYNGGNHKLTQIAKVVIKI CEHEMNEIEVCPECYLAACQKRDNWFCEPCKYIFDDCCVPACVVEAGDSNVSAHVKADIL AGKKDDTQTLMVDGCSLSAVTALQGPPSLTRQTATEGGLQSMKDGVAWVPINNCYLMSKE IPFSVKKTKSIFNSAMQEMEVYVENIRRKFGVFNYSPFRTPYTPNSQYQMLLDPTNPSAG TAKIDKQEKVKLNFDMTASPKILMSKPVLSGGTGRRISLSDMPRSPMSTNSSVHTGSDVE QDAEKKATSSHFSASEESMDFLDKSTASPASTKTGQAGSLSGSPKPFSPQLSAPITTKTD KTSTTGSILNLNLDRSKAEMDLKELSESVQQQSTPVPLISPKRQIRSRFQLNLDKTIESC KAQLGINEISEDVYTAVEHSDSEDSEKSDSSDSEYISDDEQKSKNEPEDTEDKEGCQMDK EPSAVKKKPKPTNPVEIKEELKSTSPASEKADPGAVKDKASPEPEKDFSEKAKPSPHPIK DKLKGKDETDSPTVHLGLDSDSESELVIDLGEDHSGREGRKNKKEPKEPSPKQDGKTPPS TTVGSHSPPETPVLTRSSAQTSAAGATATTSTSSTVTVTAPAPAATGSPVKKQRPLLPKE TAPAVQRVVWNSSSKFQTSSQKWHMQKMQRQQQQQQQQNQQQQPQSSQGTRYQTRQAVKA VQQKEITQSPSTSTITLVTSTQSSPLVTSSGSMSTLVSSVNADLPIATASADVAADIAKY TSKMMDAIKGTMTEIYNDLSKNTTGSTIAEIRRLRIEIEKLQWLHQQELSEMKHNLELTM AEMRQSLEQERDRLIAEVKKQLELEKQQAVDETKKKQWCANCKKEAIFYCCWNTSYCDYP CQQAHWPEHMKSCTQSATAPQQEADAEVNTETLNKSSQGSSSSTQSAPSETASASKEKET SAEKSKESGSDSIVHDGYFSSSLLCFGPPCISPCVTIPGRSGLLLCPLQGLRVSKRCDKQ PAYAPTTTDHQPHPNYPAQKCKYEPVLRKLPGLLPDHSRSNKSSWSSSDEKRGSTRSDHN TSTSTKSLLPKESRLDTFWD >gi568815578r:47110764_47410206|GENSCAN_predicted_CDS_4|4563_bp atggtggctctggagccaagcacctgtgcagaatggagctacgccgccgcagtcggcagt ggaaggcgctcagaaggcagccgaacaggctatctcgggatgcagcttgtggggccgctc ctgctgctgctcgcctggctggaggccagcggaacgggcgagccatttcagggaggccag gcctcagctcccctggataaggatgagggaatgtctttaagcttggctgaagaggaaata aaaacagaacaggaggtggtagagggcatggatatctctactcgctccaaagagcgtttt agctctcaggagacaaagctttcgagctttcaagatgtcaggttgacatttgtgtccgag ggtaatgatcctgctgaaagatgctcctacatcaatgggaagtgccatctcgatgccagc cacactgactgtgcagatccatcaaagcctcagggcctttgcccataccttgaaggtccc ctccttgccttcctctccttactccccatcaattttaaggctcaaaatgaggatgagttc tgccttttgtttttgctcatcctgttttactaccaccttgcctgctgctggctcccttct gatattgtcacctcggccagcagaacaactgcagttcctcgtatttattcacaatcccac gaaatcaatggctcttcggtgcactgcaccggagactttcagctcagccgagctgctgtt gagggccccagcaggcttgctgtcgacatcattaaggaaattaaacacccacaaaaacac aagaagctcccggcctttcagctgcggccctttcccttgagggaggagacaattccctgt aatggtgctggggtttataggtgtgagccaccgtgcctggctgaagtgtacatgtgttta aatgagatgctgaaagatgaaaagaaggggtgcatgaacaagagtggggctgatcctggc tctgcagagagaacagcccagaaaagaaagttccccagccctccacattcttccaatggc cactcgccgcaggacacatcaacaagccccattaaaaagaaaaagaaacctggcttactg aacagtaacaataaggagcagaaaattacagtagcagaatgcatcgagacccagagtaaa gccatgacaatgctcaccattgaacagttatcctacctgctcaagtttgccattcagaaa atgaaacagccagggacagatgcattccagaagcccgttccattggaacagcaccctgac tatgcggaatacatcttccatccaatggacctttgtacattggaaaagaatgcgaaaaag aaaatgtatggctgcacagaagccttcctggctgatgcaaagtggattttgcacaactgc atcatttataatgggggaaatcacaaattgacgcaaatagcgaaagtagtcatcaaaatc tgtgaacatgagatgaatgaaatcgaagtatgtccagaatgttatctagctgcttgccaa aaacgagataactggttttgtgagccttgtaaatacatatttgatgactgctgcgtgcca gcctgtgtagtggaagctggtgattcaaatgtttctgcccatgtgaaggctgacatccta gcggggaagaaggatgatacacaaacccttatggttgatggctgtagtctctcagctgtg actgctctgcagggacctccatctctcacccgtcagacagcgactgaaggtggcttacag agcatgaaggacggcgtggcctgggttccaataaataattgctacctcatgtctaaagaa attcctttttctgtgaaaaagactaagagcatcttcaacagtgccatgcaagagatggag gtttacgtggagaacatccgcaggaagtttggggtttttaattactctccatttaggaca ccctacacacccaacagccagtatcaaatgctgctcgatcccaccaaccccagcgccggc actgccaagatagacaagcaggagaaggtcaagctcaactttgacatgacggcatccccc aagatcctgatgagcaagcctgtgctgagtgggggcacaggccgccggatttccttgtcg gatatgccgcgctcccccatgagcacaaactcttctgtgcacacgggctccgacgtggag caggatgctgagaagaaggccacgtcgagccacttcagtgcgagcgaggagtccatggac ttcctggataagagcacagcttcaccagcctccaccaagacgggacaagcagggagttta tccggcagcccaaagcccttctctcctcaactgtcagctcctatcacgacgaaaacggac aaaacctccaccaccggcagcatcctgaatcttaacctggatcgaagcaaagctgagatg gatttgaaggagctgagcgagtcggtccagcaacagtccacccctgttcctctcatctct cccaagcgccagattcgtagcaggttccagctgaatcttgacaagaccatagagagttgc aaagcacaattaggcataaatgaaatctcggaagatgtctatacggccgtagagcacagc gattcggaggattctgagaagtcagatagtagcgatagtgagtatatcagtgatgatgag cagaagtctaagaacgagccagaagacacagaggacaaagaaggttgtcagatggacaaa gagccatctgctgttaaaaaaaagcccaagcctacaaacccagtggagattaaagaggag ctgaaaagcacgtcaccagccagcgagaaggcagaccctggagcagtcaaggacaaggcc agccctgagcctgagaaggacttttccgaaaaggcaaaaccttcacctcaccccataaag gataaactgaagggaaaagatgagacggattccccaacagtccatttgggcctggactct gattcagagagcgaacttgtcatagatttaggagaagaccattctgggcgggagggtcga aaaaataagaaggaacccaaagaaccatctcccaaacaggatggtaaaactccaccatcc acgacggtgggcagccattctcccccggaaacaccggtgctcacccgctcttccgcccaa acttccgcggctggcgccacagccaccaccagcacgtcctccacggtcaccgtcacggcc ccggcccccgccgccacaggaagcccagtgaaaaagcagaggccgcttttaccgaaggag actgccccggccgtgcagcgggtcgtgtggaactcatcaagtaagtttcaaacgtcctcc caaaagtggcacatgcagaagatgcagcgtcagcagcagcagcagcagcagcaaaaccag cagcagcagcctcagtcttcccaggggacgagatatcagaccagacaggctgtgaaagct gtccagcagaaggagatcacacagagcccatccacgtccaccatcaccctggtgaccagc acacagtcatcgcccctggtcaccagctcggggtccatgagcacccttgtgtcctcagtc aacgctgacctgcccatcgccactgcctcagctgatgtcgccgctgatattgccaagtac actagcaaaatgatggatgcaataaaaggaacaatgacagaaatatacaacgatctttct aaaaacactactggaagcacaatagctgagattcgcaggctgaggatcgagatagagaag ctccagtggctgcaccagcaagagctctccgaaatgaaacacaacttagagctgaccatg gcggagatgcggcagagcctggagcaggagcgggaccggctcatcgccgaggtgaagaag cagctggagttggagaagcagcaggcggtggatgagaccaagaagaagcagtggtgcgcc aactgcaagaaggaggccatcttttactgctgttggaacaccagctactgtgactacccc tgccagcaagcccactggcctgagcacatgaagtcctgcacccagtcagctactgctcct cagcaggaagcggatgctgaggtgaacacagaaacactaaataagtcctcccaggggagc tcctcgagcacacaatcagcaccttcagaaacggccagcgcctccaaagagaaggagacg tcagctgagaaaagcaaggagagtggctcggattccatcgtccatgatggctatttctca agttctctgctctgttttggacctccctgcatctctccctgcgtcaccatccctgggcgc tcagggctgctcctctgccctctgcagggcctccgtgttagcaaaaggtgtgacaagcaa cctgcctatgccccaaccaccacagaccaccagccgcaccccaactaccccgcccagaag tgtaagtatgaacctgtcttacgaaagctgccggggcttcttcctgaccattcccggagt aataaatccagttggagcagcagtgatgagaagaggggatcgacacgttccgatcacaac accagtaccagcacgaagagcctcctcccgaaagagtctcggctggacaccttctgggac tag