GENSCAN 1.0 Date run: 6-Nov-116 Time: 20:00:32 Sequence gi568815578f:43414545_43641125 : 226581 bp : 46.38% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12806 12906 101 1 2 90 53 64 0.161 2.01 1.02 Intr + 15197 15290 94 2 1 79 113 -19 0.072 -0.33 1.03 Term + 26167 26289 123 0 0 86 54 154 0.948 10.18 1.04 PlyA + 29025 29030 6 1.05 2.00 Prom + 36474 36513 40 -6.46 2.01 Init + 43490 43596 107 1 2 91 92 141 0.959 14.49 2.02 Intr + 43817 43965 149 2 2 90 73 368 0.929 35.38 2.03 Intr + 45227 45351 125 0 2 39 95 56 0.995 1.70 2.04 Term + 45489 45701 213 2 0 98 32 178 0.324 10.43 2.05 PlyA + 45741 45746 6 1.05 3.05 PlyA - 45863 45858 6 -1.75 3.04 Term - 46077 45950 128 1 2 32 45 164 0.316 4.94 3.03 Intr - 46325 46125 201 0 0 92 27 204 0.169 13.96 3.02 Intr - 72006 71981 26 2 2 78 76 28 0.063 -1.73 3.01 Init - 79498 79437 62 1 2 45 103 73 0.545 4.32 3.00 Prom - 80068 80029 40 -2.06 4.00 Prom + 80785 80824 40 -2.86 4.01 Init + 98942 99095 154 1 1 99 109 67 0.277 10.30 4.02 Intr + 99294 99517 224 1 2 66 60 153 0.786 8.15 4.03 Intr + 100091 100232 142 1 1 29 57 163 0.806 7.23 4.04 Intr + 100465 100615 151 2 1 57 91 107 0.997 7.12 4.05 Intr + 100748 100871 124 2 1 67 58 152 0.996 10.69 4.06 Intr + 101549 101633 85 1 1 12 68 64 0.089 -3.81 4.07 Intr + 114113 114201 89 0 2 91 86 95 0.983 9.29 4.08 Intr + 114720 114824 105 2 0 48 83 166 0.827 12.51 4.09 Intr + 115740 115875 136 2 1 47 116 169 0.998 15.74 4.10 Intr + 116254 116345 92 2 2 -36 94 123 0.974 0.21 4.11 Intr + 118229 118380 152 1 2 99 27 293 0.797 23.26 4.12 Intr + 119721 119850 130 0 1 52 34 226 0.402 14.30 4.13 Intr + 120284 120398 115 1 1 76 74 105 0.998 7.92 4.14 Intr + 121293 121392 100 1 1 116 117 -44 0.598 0.47 4.15 Intr + 121553 121750 198 2 0 102 72 134 0.833 11.67 4.16 Intr + 121865 121914 50 2 2 75 89 52 0.990 2.32 4.17 Intr + 125607 125764 158 1 2 91 78 141 0.976 13.13 4.18 Intr + 126209 126271 63 1 0 61 94 67 0.842 3.51 4.19 Intr + 129993 130067 75 2 0 55 72 55 0.468 0.31 4.20 Intr + 133567 133697 131 0 2 77 -4 202 0.277 9.39 4.21 Intr + 137170 137259 90 1 0 36 34 125 0.141 1.01 4.22 Intr + 152532 152573 42 0 0 63 87 69 0.113 1.66 4.23 Intr + 153121 153178 58 1 1 116 113 177 0.774 21.89 4.24 Intr + 153372 153455 84 2 0 76 66 118 0.826 8.42 4.25 Intr + 154841 154972 132 1 0 82 90 294 0.878 29.84 4.26 Intr + 156073 156185 113 0 2 120 94 199 0.980 22.88 4.27 Intr + 156480 156516 37 0 1 102 77 28 0.956 1.36 4.28 Intr + 157507 157593 87 0 0 85 99 99 0.985 10.87 4.29 Intr + 160365 160460 96 2 0 65 105 121 0.999 11.71 4.30 Intr + 161680 161835 156 0 0 103 28 217 0.905 17.41 4.31 Intr + 165428 165517 90 1 0 78 83 48 0.836 3.49 4.32 Intr + 170046 170063 18 2 0 105 70 37 0.503 0.41 4.33 Term + 170308 170472 165 0 0 60 43 182 0.993 8.82 4.34 PlyA + 173101 173106 6 1.05 5.00 Prom + 179463 179502 40 -0.76 5.01 Init + 180155 180273 119 1 2 105 47 153 0.747 12.57 5.02 Intr + 199306 199432 127 1 1 99 87 10 0.193 2.68 5.03 Intr + 204396 204482 87 2 0 72 103 49 0.947 4.97 5.04 Intr + 206313 206381 69 2 0 81 59 44 0.437 0.18 5.05 Term + 209347 209523 177 0 0 75 52 287 0.815 21.49 5.06 PlyA + 210591 210596 6 -1.75 6.00 Prom + 210664 210703 40 -5.56 6.01 Init + 210900 210905 6 2 0 104 101 10 0.098 4.21 6.02 Intr + 222601 222709 109 0 1 132 84 62 0.738 10.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 35407 35552 146 2 2 128 41 86 0.811 5.97 S.002 Intr + 152524 152573 50 0 2 63 87 76 0.833 2.48 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:43414545_43641125|GENSCAN_predicted_peptide_1|105_aa SNLRECQHRHPPLAKAIILDIITLSYSHIQSLTSIALGTITLYFNLYQWSLQYTVPLGFK LPSQKEKFANSWSTPCQALTMHHHTYFSTRLWKVSYYDVHATDEK >gi568815578f:43414545_43641125|GENSCAN_predicted_CDS_1|318_bp tccaatctcagagaatgtcagcatcgccacccaccacttgctaaagccataatcttggat atcattactctctcctactcccacatccagtcgctcaccagcatagcactgggcacaata actctttattttaatctttatcagtggagtctacagtacactgttcccctcggcttcaag ctcccaagccagaaggaaaagtttgccaactcctggtccacaccgtgccaggcactgacg atgcaccaccacacttacttctcaactcgactctggaaggtgtcctactacgatgtccat gctacagatgagaaatga >gi568815578f:43414545_43641125|GENSCAN_predicted_peptide_2|197_aa MPRVYIGRLSYNVREKDIQRFFSGYGRLLEVDLKNGYGFVEFEDSRDADDAVYELNGKEL CGERVIVEHARGPRRDRDGYSYGSRSGGGGYSSRRTSGRDKYGPPVRTEYRLIVENLSSR CSWQDLKDFMRQAGEVTYADAHKERTNEGVIEFRSYSDMKRALDKLDGTEINGRNIRLIE DKPRTSHRRSYSGSRSR >gi568815578f:43414545_43641125|GENSCAN_predicted_CDS_2|594_bp atgccgcgcgtctacataggacgcctgagctacaacgtccgggagaaggacatccagcgc tttttcagtggctatggccgcctcctcgaagtagacctcaaaaatgggtacggcttcgtg gagttcgaggactcccgcgacgccgacgacgccgtttacgagctgaacggcaaggagctc tgcggcgagcgcgtgatcgtagagcacgcccggggcccgcgtcgcgatcgcgacggctac agctacggaagccgcagtggtggaggtggatacagcagtcggagaacatctggcagagac aaatacggaccacctgttcgtacagaatacaggcttattgtagaaaatctttctagtcgg tgcagttggcaagatttaaaggattttatgcgacaagcaggtgaagtaacctatgcggat gcccacaaggaacgaacaaatgagggtgtaattgagtttcgctcctactctgacatgaag cgtgctttggacaaactggatggcacagaaataaatggcagaaatattaggcttattgaa gataagccacgcacaagccataggcgatcttactctggaagcagatccaggtaa >gi568815578f:43414545_43641125|GENSCAN_predicted_peptide_3|138_aa MRMFLKCVPQKLVAAQEAQLMCCGRDLVGGDRDRDLLRDFSYSSLDLLRECEWEPRSDLG LDLLFDLDFLPFDRERDRPLLRDLELDKERPIFSILMTQHNVIYRERLFEILRDLLRLLL RLLLRDRLLDRDLLGEKQ >gi568815578f:43414545_43641125|GENSCAN_predicted_CDS_3|417_bp atgaggatgtttctgaagtgtgtgccccagaagctcgtggcagctcaggaggctcagctc atgtgttgtgggagggacctggtgggaggggatcgggaccgagacctgcttcgagatttc tcatactcatccttagatctgcttcgagaatgtgaatgggagccccgatcagacttgggc ttagatttgctctttgatctagatttcctgccttttgatcgagaacgtgatcgacctttg ctccgcgacctggaactggacaaggaaagaccgattttctcaatacttatgactcaacac aacgttatttaccgggagcgactttttgagatacttcgagatctactgcggctgctcctg cgactcctacttcgtgaccgtcttctagatcgagacctattaggagaaaaacaatga >gi568815578f:43414545_43641125|GENSCAN_predicted_peptide_4|1213_aa MEGHAGMEGHAEMEMLRTLKGPSTGEVSMHLVAGDSPGSGPHLPATAFIIPASSATLGLP SSALDVSCFPREPIHVGAPEQVAGCEPVSATVLPQLSAGPASSSTSTVRLLEWTEAAAPP PGGGLRFRISEYKPLNMAGVEQPPSPELRQEGVTEYEDGGAPAGDGEAGPQQAEDHPQNP PEDPNQDPPEDDSTCQCQACGPHQAAGPDLGSSNDGCPQLFQERSVIVENSSGSTSASEL LKPMKKRKRREYQSPSEEESEPEAMEKQEEGKDPEGQPTASTPESEEWSSSQPATGEKKE CWSWESYLEEQKAITAPVSLFQDSQAVTHNKNGFKLGMKLEGIDPQHPSMYFILTVAEVC GYRLRLHFDGYSECHDFWVNANSPDIHPAGWFEKTGHKLQPPKGYKEEEFSWSQYLRSTR AQAAPKHLFVSQSHSPPPLGFQVGMKLEAVDRMNPSLVCVASVTDVVDSRFLVHFDNWDD TYDYWQLSPLQRPPHSFLVNMKLEAVDRRNPALIRVASVEDVEDHRIKIHFDGWSHGYDF WIDADHPDIHPAGWCSKTGHPLQPPLGPREPSSASPGGCPPLSYRSLPHTRTSKYSFHHR KCPTPGCDGSGHVTGKFTAHHCLSGCPLAERNQSRLKAELSDSEASARKKNLSGFSPRKK PRHHGRIGRPPKYRKIPQEDFQTLTPDVVHQSLFMSALSAHPDRSLSVCWEQHCKLLPGV AGISASTVAKWTIDEVFGFVQTLTGCEDQARLFKDESQTARPQTLAFALKNLEPGLWQLP WMIDGEAFLLLTQADIVKIMSVKLGPALKIYNAILMFKNADDTLKSSNHGAVRERILSTP QLGTYLTEIGAMIDWANGNINLGPSANPNAQPTDFDFLKVIGKGNYGKVLLAKRKSDGAF YAVKVLQKKSILKKKEQSHIMAERSVLLKNVRHPFLVGLRYSFQTPEKLYFVLDYVNGGE LFFHLQRERRFLEPRARFYAAEVASAIGYLHSLNIIYRDLKPENILLDCQGHVVLTDFGL CKEGVEPEDTTSTFCGTPEYLAPEVLRKEPYDRAVDWWCLGAVLYEMLHGLPPFYSQDVS QMYENILHQPLQIPGGRTVAACDLLQSLLHKDQRQRLGSKADFLEIKNHVFFSPINWDDL YHKRLTPPFNPNVRHEQYLTGPADLKHFDPEFTQEAVSKSIGCTPDTVASSSGASSAFLG FSYAPEDDDILDC >gi568815578f:43414545_43641125|GENSCAN_predicted_CDS_4|3642_bp atggaggggcatgctgggatggaggggcatgctgaaatggagatgctgaggacactgaag gggccttccacaggggaggtcagcatgcacttggtggccggagacagccccggttctggt cctcacctgcccgcaactgccttcatcattccagccagttcggccaccctcggcctgccc agcagtgccctggatgtgtcttgctttccccgggagccaatccatgtgggtgccccggag caagtggccggctgcgaaccagtttctgccaccgtcctgccgcagcttagcgccgggccg gccagctccagcaccagcacagtgcggcttctggaatggacagaggccgcggccccgccc ccagggggcggcctgcggttccggataagcgagtataagccgctgaacatggcgggagtg gagcagcccccgagccccgagctgcggcaggaaggcgtgaccgaatacgaagatggcggg gccccggcgggagatggcgaggcgggcccccaacaggcggaggaccacccccagaatcct ccagaagatcccaatcaggaccccccagaggatgatagcacctgtcagtgccaggcgtgc gggcctcaccaagccgcgggtccagatcttggttcctctaatgatggctgccctcagctg ttccaggagcggtcagtcatagtggagaactcctcaggctctaccagcgcttctgagctc ctcaaacccatgaagaagaggaagcgcagggaataccagagcccatcagaggaggagtcg gagccagaggccatggagaagcaagaagaaggaaaggacccagagggacaacccactgct agcaccccagagagtgaggagtggagcagcagccagcctgcaacaggtgagaagaaggaa tgctggtcgtgggagtcctacctagaggagcagaaggccattactgctccagtcagcctc ttccaggactcccaggcagtcactcacaacaagaatggcttcaaactgggcatgaagttg gaaggcattgaccctcaacacccgtccatgtacttcatcctcaccgtggctgaggtatgt ggctatcgcctacgcctgcactttgatgggtattctgagtgccatgacttctgggtcaat gccaactcccctgacattcaccctgctggctggttcgagaagacgggccacaagctgcag cctcccaaaggttacaaggaggaggagttcagctggagccagtacctgcgcagcacaaga gctcaggctgcccccaagcacctgtttgtgagccagagccacagtcccccacccctgggc ttccaggtgggcatgaagctggaggctgttgaccgcatgaacccgtcccttgtctgcgtg gccagtgtgaccgatgtggtggacagccgcttcctggtgcactttgacaactgggatgat acttatgactactggcagctgtcccctctgcagcgaccccctcacagcttcctggtcaat atgaagctggaggctgtggaccgcaggaacccagccctgattcgcgtggccagcgtggag gatgtggaggaccatcggataaagatccactttgatggctggagtcatggctatgatttc tggatcgacgctgaccacccagacatccaccctgccggctggtgctccaagacaggacat cccctgcagcctcctctcggacccagagagcccagctctgcctcccctgggggctgtccc cctctcagctataggagcctgccccacactaggacctccaaatacagctttcaccaccgg aagtgccccactcctggttgcgacggctctggccatgtcacaggcaagttcacagctcac cattgcctctcaggctgcccactggctgagaggaaccagagccggctgaaagcggagctg tctgactcggaggcctcagcccgcaagaagaacctctcaggcttctccccaaggaagaag cctcgccatcacggccgaattggacgccctccgaagtatcgaaagattccgcaggaagat ttccagaccctcacgcccgatgtcgtgcaccagtccctcttcatgtcagccctgtcggcc caccctgaccgctcactctcagtgtgctgggagcagcactgcaagctcctgccaggagta gcgggcatctcagcctcgacagtcgccaagtggaccatcgatgaggtcttcggctttgtt cagaccctgacaggttgtgaggaccaagcacgcctcttcaaagacgagtctcagactgcc agaccacagactctggcctttgctttgaagaacctggaacctgggctttggcagttaccg tggatgattgacggcgaggccttccttttgctgacacaggcggacattgtgaagatcatg agcgtcaagctgggcccagccttgaagatctataacgccattctcatgttcaaaaacgct gatgacaccttaaaaagctccaaccatggagctgtccgagagcgcatcctctccacacca caactgggaacctacctgaccgagattggtgccatgatagactgggccaatgggaacatc aacctggggccttcagccaacccaaatgcccagcccacggacttcgacttcctcaaagtc atcggcaaagggaactacgggaaggtcctactggccaagcgcaagtctgatggggcgttc tatgcagtgaaggtactacagaaaaagtccatcttaaagaagaaagagcagagccacatc atggcagagcgcagtgtgcttctgaagaacgtgcggcaccccttcctcgtgggcctgcgc tactccttccagacacctgagaagctctacttcgtgctcgactatgtcaacgggggagag ctcttcttccacctgcagcgggagcgccggttcctggagccccgggccaggttctacgct gctgaggtggccagcgccattggctacctgcactccctcaacatcatttacagggatctg aaaccagagaacattctcttggactgccagggacacgtggtgctgacggattttggcctc tgcaaggaaggtgtagagcctgaagacaccacatccacattctgtggtacccctgagtac ttggcacctgaagtgcttcggaaagagccttatgatcgagcagtggactggtggtgcttg ggggcagtcctctacgagatgctccatggcctgccgcccttctacagccaagatgtatcc cagatgtatgagaacattctgcaccagccgctacagatccccggaggccggacagtggcc gcctgtgacctcctgcaaagccttctccacaaggaccagaggcagcggctgggctccaaa gcagactttcttgagattaagaaccatgtattcttcagccccataaactgggatgacctg taccacaagaggctaactccacccttcaacccaaatgtgcgccacgaacagtacctgaca ggacctgctgacttgaagcattttgacccagagttcacccaggaagctgtgtccaagtcc attggctgtacccctgacactgtggccagcagctctggggcctcaagtgcattcctggga ttttcttatgcgccagaggatgatgacatcttggattgctag >gi568815578f:43414545_43641125|GENSCAN_predicted_peptide_5|192_aa MEKELRSTILFNAYKKEIFTTNNGYKSMQKKLRSNWKIQRALTFVYPFGATLSVMKPAVA VLSTGSVCFPLNRPILAFYHSKNQGGKLAVLGSCHMFSDQYLDKEENSKIMDVVFQWLTT GDIHLNQIDAEDPEISDYMMLPYTATLSKRNRECLQESDEIPRDFTTLFDLSIFQLDTTS FHSVIESVPVGL >gi568815578f:43414545_43641125|GENSCAN_predicted_CDS_5|579_bp atggagaaagagctgcggagcaccattcttttcaatgcctacaaaaaggagatatttacc accaacaatggctacaaatccatgcagaaaaaacttcggagtaattggaagattcagagg gctctcacctttgtgtatccttttggtgccacattgagtgtcatgaaaccagcagtggcg gttctgtctacaggttctgtctgcttcccacttaacagacccattttggctttctatcac tcaaagaaccaaggtgggaagctggcagtgcttggttcatgtcacatgttcagtgatcaa tatttggacaaagaagaaaacagcaaaatcatggatgttgttttccagtggctcacgaca ggagacatccacctaaaccagattgatgctgaggacccagagatttctgactacatgatg ctgccctacacagccaccctatcaaagcggaatcgagagtgtctccaggagagtgatgag atcccaagggactttaccaccctcttcgacctgtccatcttccagctggataccacctcc ttccacagcgtcatcgagtcagtacctgtgggcctctga >gi568815578f:43414545_43641125|GENSCAN_predicted_peptide_6|39_aa MAVFPPSFRELPPPPLELFDLDETFSSEKARLAQITNKX >gi568815578f:43414545_43641125|GENSCAN_predicted_CDS_6|117_bp atggcggtttttcctcccagtttccgggagttaccacctcctcctctggagctatttgat ttagatgaaacgttctcctctgagaaggcacggctggctcagattaccaataagtnn