GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:03:26 Sequence gi568815578f:43466316_43685013 : 218698 bp : 46.92% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 460 455 6 1.05 1.04 Term - 11421 11383 39 0 0 117 46 14 0.104 -2.71 1.03 Intr - 15210 15107 104 1 2 87 65 50 0.079 2.49 1.02 Intr - 20235 20210 26 2 2 78 76 28 0.071 -1.73 1.01 Init - 27727 27666 62 1 2 45 103 73 0.542 4.32 1.00 Prom - 28297 28258 40 -2.06 2.00 Prom + 29014 29053 40 -2.86 2.01 Init + 47171 47324 154 1 1 99 109 67 0.277 10.30 2.02 Intr + 47523 47746 224 1 2 66 60 153 0.786 8.15 2.03 Intr + 48320 48461 142 1 1 29 57 163 0.806 7.23 2.04 Intr + 48694 48844 151 2 1 57 91 107 0.997 7.12 2.05 Intr + 48977 49100 124 2 1 67 58 152 0.996 10.69 2.06 Intr + 49778 49862 85 1 1 12 68 64 0.089 -3.81 2.07 Intr + 62342 62430 89 0 2 91 86 95 0.983 9.29 2.08 Intr + 62949 63053 105 2 0 48 83 166 0.827 12.51 2.09 Intr + 63969 64104 136 2 1 47 116 169 0.998 15.74 2.10 Intr + 64483 64574 92 2 2 -36 94 123 0.974 0.21 2.11 Intr + 66458 66609 152 1 2 99 27 293 0.797 23.26 2.12 Intr + 67950 68079 130 0 1 52 34 226 0.402 14.30 2.13 Intr + 68513 68627 115 1 1 76 74 105 0.998 7.92 2.14 Intr + 69522 69621 100 1 1 116 117 -44 0.598 0.47 2.15 Intr + 69782 69979 198 2 0 102 72 134 0.833 11.67 2.16 Intr + 70094 70143 50 2 2 75 89 52 0.990 2.32 2.17 Intr + 73836 73993 158 1 2 91 78 141 0.976 13.13 2.18 Intr + 74438 74500 63 1 0 61 94 67 0.842 3.51 2.19 Intr + 78222 78296 75 2 0 55 72 55 0.468 0.31 2.20 Intr + 81796 81926 131 0 2 77 -4 202 0.277 9.39 2.21 Intr + 85399 85488 90 1 0 36 34 125 0.141 1.01 2.22 Intr + 100761 100802 42 0 0 63 87 69 0.113 1.66 2.23 Intr + 101350 101407 58 1 1 116 113 177 0.774 21.89 2.24 Intr + 101601 101684 84 2 0 76 66 118 0.826 8.42 2.25 Intr + 103070 103201 132 1 0 82 90 294 0.878 29.84 2.26 Intr + 104302 104414 113 0 2 120 94 199 0.980 22.88 2.27 Intr + 104709 104745 37 0 1 102 77 28 0.956 1.36 2.28 Intr + 105736 105822 87 0 0 85 99 99 0.985 10.87 2.29 Intr + 108594 108689 96 2 0 65 105 121 0.999 11.71 2.30 Intr + 109909 110064 156 0 0 103 28 217 0.905 17.41 2.31 Intr + 113657 113746 90 1 0 78 83 48 0.836 3.49 2.32 Intr + 118275 118292 18 2 0 105 70 37 0.503 0.41 2.33 Term + 118537 118701 165 0 0 60 43 182 0.993 8.82 2.34 PlyA + 121330 121335 6 1.05 3.00 Prom + 127692 127731 40 -0.76 3.01 Init + 128384 128502 119 1 2 105 47 153 0.747 12.57 3.02 Intr + 147535 147661 127 1 1 99 87 10 0.193 2.68 3.03 Intr + 152625 152711 87 2 0 72 103 49 0.947 4.97 3.04 Intr + 154542 154610 69 2 0 81 59 44 0.437 0.18 3.05 Term + 157576 157752 177 0 0 75 52 287 0.814 21.49 3.06 PlyA + 158820 158825 6 -1.75 4.00 Prom + 158893 158932 40 -5.56 4.01 Init + 159129 159134 6 2 0 104 101 10 0.106 4.21 4.02 Intr + 170830 170938 109 0 1 132 84 62 0.985 10.06 4.03 Intr + 176164 176309 146 2 2 73 92 140 0.871 12.90 4.04 Term + 180621 180668 48 2 0 92 49 51 0.807 -1.10 4.05 PlyA + 181317 181322 6 -0.45 5.00 Prom + 183797 183836 40 -5.76 5.01 Init + 186208 186383 176 0 2 83 52 126 0.991 7.32 5.02 Intr + 200737 200892 156 1 0 68 1 117 0.445 0.13 5.03 Intr + 203170 203241 72 1 0 92 97 10 0.599 0.82 5.04 Intr + 207491 207584 94 2 1 105 105 137 0.991 17.07 5.05 Intr + 215469 215540 72 2 0 92 76 98 0.891 8.60 5.06 Term + 216479 216637 159 1 0 88 50 49 0.648 -0.96 5.07 PlyA + 218524 218529 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 100753 100802 50 0 2 63 87 76 0.833 2.48 S.002 Term + 186784 186979 196 1 1 22 54 176 0.995 4.48 S.003 Init + 196317 196324 8 2 2 114 91 0 0.804 3.40 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:43466316_43685013|GENSCAN_predicted_peptide_1|76_aa MRMFLKCVPQKLVAAQEAQLMCCGRDLVGETFPCKRADALTVAKRLLGNVFLSWTLLEKS PVMEVFSDHAATLEFF >gi568815578f:43466316_43685013|GENSCAN_predicted_CDS_1|231_bp atgaggatgtttctgaagtgtgtgccccagaagctcgtggcagctcaggaggctcagctc atgtgttgtgggagggacctggtgggagaaactttcccgtgtaagcgggctgatgctcta acagtagctaaaaggttattaggaaatgtgtttctctcatggacattactggagaaatct ccagtgatggaggtcttctctgaccatgcagcaaccctggagtttttctga >gi568815578f:43466316_43685013|GENSCAN_predicted_peptide_2|1213_aa MEGHAGMEGHAEMEMLRTLKGPSTGEVSMHLVAGDSPGSGPHLPATAFIIPASSATLGLP SSALDVSCFPREPIHVGAPEQVAGCEPVSATVLPQLSAGPASSSTSTVRLLEWTEAAAPP PGGGLRFRISEYKPLNMAGVEQPPSPELRQEGVTEYEDGGAPAGDGEAGPQQAEDHPQNP PEDPNQDPPEDDSTCQCQACGPHQAAGPDLGSSNDGCPQLFQERSVIVENSSGSTSASEL LKPMKKRKRREYQSPSEEESEPEAMEKQEEGKDPEGQPTASTPESEEWSSSQPATGEKKE CWSWESYLEEQKAITAPVSLFQDSQAVTHNKNGFKLGMKLEGIDPQHPSMYFILTVAEVC GYRLRLHFDGYSECHDFWVNANSPDIHPAGWFEKTGHKLQPPKGYKEEEFSWSQYLRSTR AQAAPKHLFVSQSHSPPPLGFQVGMKLEAVDRMNPSLVCVASVTDVVDSRFLVHFDNWDD TYDYWQLSPLQRPPHSFLVNMKLEAVDRRNPALIRVASVEDVEDHRIKIHFDGWSHGYDF WIDADHPDIHPAGWCSKTGHPLQPPLGPREPSSASPGGCPPLSYRSLPHTRTSKYSFHHR KCPTPGCDGSGHVTGKFTAHHCLSGCPLAERNQSRLKAELSDSEASARKKNLSGFSPRKK PRHHGRIGRPPKYRKIPQEDFQTLTPDVVHQSLFMSALSAHPDRSLSVCWEQHCKLLPGV AGISASTVAKWTIDEVFGFVQTLTGCEDQARLFKDESQTARPQTLAFALKNLEPGLWQLP WMIDGEAFLLLTQADIVKIMSVKLGPALKIYNAILMFKNADDTLKSSNHGAVRERILSTP QLGTYLTEIGAMIDWANGNINLGPSANPNAQPTDFDFLKVIGKGNYGKVLLAKRKSDGAF YAVKVLQKKSILKKKEQSHIMAERSVLLKNVRHPFLVGLRYSFQTPEKLYFVLDYVNGGE LFFHLQRERRFLEPRARFYAAEVASAIGYLHSLNIIYRDLKPENILLDCQGHVVLTDFGL CKEGVEPEDTTSTFCGTPEYLAPEVLRKEPYDRAVDWWCLGAVLYEMLHGLPPFYSQDVS QMYENILHQPLQIPGGRTVAACDLLQSLLHKDQRQRLGSKADFLEIKNHVFFSPINWDDL YHKRLTPPFNPNVRHEQYLTGPADLKHFDPEFTQEAVSKSIGCTPDTVASSSGASSAFLG FSYAPEDDDILDC >gi568815578f:43466316_43685013|GENSCAN_predicted_CDS_2|3642_bp atggaggggcatgctgggatggaggggcatgctgaaatggagatgctgaggacactgaag gggccttccacaggggaggtcagcatgcacttggtggccggagacagccccggttctggt cctcacctgcccgcaactgccttcatcattccagccagttcggccaccctcggcctgccc agcagtgccctggatgtgtcttgctttccccgggagccaatccatgtgggtgccccggag caagtggccggctgcgaaccagtttctgccaccgtcctgccgcagcttagcgccgggccg gccagctccagcaccagcacagtgcggcttctggaatggacagaggccgcggccccgccc ccagggggcggcctgcggttccggataagcgagtataagccgctgaacatggcgggagtg gagcagcccccgagccccgagctgcggcaggaaggcgtgaccgaatacgaagatggcggg gccccggcgggagatggcgaggcgggcccccaacaggcggaggaccacccccagaatcct ccagaagatcccaatcaggaccccccagaggatgatagcacctgtcagtgccaggcgtgc gggcctcaccaagccgcgggtccagatcttggttcctctaatgatggctgccctcagctg ttccaggagcggtcagtcatagtggagaactcctcaggctctaccagcgcttctgagctc ctcaaacccatgaagaagaggaagcgcagggaataccagagcccatcagaggaggagtcg gagccagaggccatggagaagcaagaagaaggaaaggacccagagggacaacccactgct agcaccccagagagtgaggagtggagcagcagccagcctgcaacaggtgagaagaaggaa tgctggtcgtgggagtcctacctagaggagcagaaggccattactgctccagtcagcctc ttccaggactcccaggcagtcactcacaacaagaatggcttcaaactgggcatgaagttg gaaggcattgaccctcaacacccgtccatgtacttcatcctcaccgtggctgaggtatgt ggctatcgcctacgcctgcactttgatgggtattctgagtgccatgacttctgggtcaat gccaactcccctgacattcaccctgctggctggttcgagaagacgggccacaagctgcag cctcccaaaggttacaaggaggaggagttcagctggagccagtacctgcgcagcacaaga gctcaggctgcccccaagcacctgtttgtgagccagagccacagtcccccacccctgggc ttccaggtgggcatgaagctggaggctgttgaccgcatgaacccgtcccttgtctgcgtg gccagtgtgaccgatgtggtggacagccgcttcctggtgcactttgacaactgggatgat acttatgactactggcagctgtcccctctgcagcgaccccctcacagcttcctggtcaat atgaagctggaggctgtggaccgcaggaacccagccctgattcgcgtggccagcgtggag gatgtggaggaccatcggataaagatccactttgatggctggagtcatggctatgatttc tggatcgacgctgaccacccagacatccaccctgccggctggtgctccaagacaggacat cccctgcagcctcctctcggacccagagagcccagctctgcctcccctgggggctgtccc cctctcagctataggagcctgccccacactaggacctccaaatacagctttcaccaccgg aagtgccccactcctggttgcgacggctctggccatgtcacaggcaagttcacagctcac cattgcctctcaggctgcccactggctgagaggaaccagagccggctgaaagcggagctg tctgactcggaggcctcagcccgcaagaagaacctctcaggcttctccccaaggaagaag cctcgccatcacggccgaattggacgccctccgaagtatcgaaagattccgcaggaagat ttccagaccctcacgcccgatgtcgtgcaccagtccctcttcatgtcagccctgtcggcc caccctgaccgctcactctcagtgtgctgggagcagcactgcaagctcctgccaggagta gcgggcatctcagcctcgacagtcgccaagtggaccatcgatgaggtcttcggctttgtt cagaccctgacaggttgtgaggaccaagcacgcctcttcaaagacgagtctcagactgcc agaccacagactctggcctttgctttgaagaacctggaacctgggctttggcagttaccg tggatgattgacggcgaggccttccttttgctgacacaggcggacattgtgaagatcatg agcgtcaagctgggcccagccttgaagatctataacgccattctcatgttcaaaaacgct gatgacaccttaaaaagctccaaccatggagctgtccgagagcgcatcctctccacacca caactgggaacctacctgaccgagattggtgccatgatagactgggccaatgggaacatc aacctggggccttcagccaacccaaatgcccagcccacggacttcgacttcctcaaagtc atcggcaaagggaactacgggaaggtcctactggccaagcgcaagtctgatggggcgttc tatgcagtgaaggtactacagaaaaagtccatcttaaagaagaaagagcagagccacatc atggcagagcgcagtgtgcttctgaagaacgtgcggcaccccttcctcgtgggcctgcgc tactccttccagacacctgagaagctctacttcgtgctcgactatgtcaacgggggagag ctcttcttccacctgcagcgggagcgccggttcctggagccccgggccaggttctacgct gctgaggtggccagcgccattggctacctgcactccctcaacatcatttacagggatctg aaaccagagaacattctcttggactgccagggacacgtggtgctgacggattttggcctc tgcaaggaaggtgtagagcctgaagacaccacatccacattctgtggtacccctgagtac ttggcacctgaagtgcttcggaaagagccttatgatcgagcagtggactggtggtgcttg ggggcagtcctctacgagatgctccatggcctgccgcccttctacagccaagatgtatcc cagatgtatgagaacattctgcaccagccgctacagatccccggaggccggacagtggcc gcctgtgacctcctgcaaagccttctccacaaggaccagaggcagcggctgggctccaaa gcagactttcttgagattaagaaccatgtattcttcagccccataaactgggatgacctg taccacaagaggctaactccacccttcaacccaaatgtgcgccacgaacagtacctgaca ggacctgctgacttgaagcattttgacccagagttcacccaggaagctgtgtccaagtcc attggctgtacccctgacactgtggccagcagctctggggcctcaagtgcattcctggga ttttcttatgcgccagaggatgatgacatcttggattgctag >gi568815578f:43466316_43685013|GENSCAN_predicted_peptide_3|192_aa MEKELRSTILFNAYKKEIFTTNNGYKSMQKKLRSNWKIQRALTFVYPFGATLSVMKPAVA VLSTGSVCFPLNRPILAFYHSKNQGGKLAVLGSCHMFSDQYLDKEENSKIMDVVFQWLTT GDIHLNQIDAEDPEISDYMMLPYTATLSKRNRECLQESDEIPRDFTTLFDLSIFQLDTTS FHSVIESVPVGL >gi568815578f:43466316_43685013|GENSCAN_predicted_CDS_3|579_bp atggagaaagagctgcggagcaccattcttttcaatgcctacaaaaaggagatatttacc accaacaatggctacaaatccatgcagaaaaaacttcggagtaattggaagattcagagg gctctcacctttgtgtatccttttggtgccacattgagtgtcatgaaaccagcagtggcg gttctgtctacaggttctgtctgcttcccacttaacagacccattttggctttctatcac tcaaagaaccaaggtgggaagctggcagtgcttggttcatgtcacatgttcagtgatcaa tatttggacaaagaagaaaacagcaaaatcatggatgttgttttccagtggctcacgaca ggagacatccacctaaaccagattgatgctgaggacccagagatttctgactacatgatg ctgccctacacagccaccctatcaaagcggaatcgagagtgtctccaggagagtgatgag atcccaagggactttaccaccctcttcgacctgtccatcttccagctggataccacctcc ttccacagcgtcatcgagtcagtacctgtgggcctctga >gi568815578f:43466316_43685013|GENSCAN_predicted_peptide_4|102_aa MAVFPPSFRELPPPPLELFDLDETFSSEKARLAQITNKCTEEDLEFYVRKCGDILGVTSK LPKDQQDAKHILEHVFFQVVEFKKLNQEHDIDTSETAFQNNF >gi568815578f:43466316_43685013|GENSCAN_predicted_CDS_4|309_bp atggcggtttttcctcccagtttccgggagttaccacctcctcctctggagctatttgat ttagatgaaacgttctcctctgagaaggcacggctggctcagattaccaataagtgtact gaagaagacctggaattttatgtcaggaagtgtggtgatattcttggagtaaccagtaaa ctaccaaaggaccaacaggatgccaaacatatccttgagcacgtcttcttccaagtggtg gagttcaagaaattgaaccaggaacatgacatcgatacaagtgaaacagcattccagaac aatttctga >gi568815578f:43466316_43685013|GENSCAN_predicted_peptide_5|242_aa MPSRLRKTQKLRGHVSHGHGRIGKLQKHPRGHSNAGGMHHHRINFNKYYPGYFGKVGMRT GRRRRAWREIEKCFNPRRRRLQFLRARSAGPADTLTPSSAARGPERPEQPGAPLPALSLT HPCPGAEPSTLKGKSEDLDELHYQDTDSDVPEQRDSKCKVKWTHEEDEQLRALVRQFGQQ DWKFLASHFPNRTDQQCQYRWLRVLNPDLVKGPWTKEEDQKVTAGTVPCTQWVFTHQCTW GL >gi568815578f:43466316_43685013|GENSCAN_predicted_CDS_5|729_bp atgccatccagactgaggaagacccagaaacttcggggccatgtgagccatggccatggc cgcataggcaagctccaaaagcaccccagaggccacagtaatgctggtggaatgcatcac cacaggatcaacttcaacaaatactacccaggttactttgggaaagttggtatgaggact gggcggcgccgacgcgcttggcgggagatagaaaagtgcttcaacccgcgccggcggcga ctgcagttcctgcgagcgaggagcgcgggacctgctgacacgctgacgccttcgagcgcg gcccggggcccggagcggccggagcagcccggtgccccacttcccgctttgtctctgact cacccctgtcctggagcagagcccagcacgttgaagggcaaaagcgaggatctggatgag ctgcactaccaggacacagattcagatgtgccggagcagagggatagcaagtgcaaggtc aaatggacccatgaggaggacgagcagctgagggccctggtgaggcagtttggacagcag gactggaagttcctggccagccacttccctaaccgcactgaccagcaatgccagtacagg tggctgagagttttgaatccagaccttgtcaaggggccatggaccaaagaggaagaccaa aaagtaactgctgggacagtgccttgcacacagtgggtcttcacccaccagtgtacctgg ggactgtga