GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:32:48 Sequence gi568815593r:157038733_157239431 : 200699 bp : 42.21% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5572 5997 426 0 0 19 42 198 0.416 1.36 1.02 Term + 7025 7180 156 1 0 59 35 150 0.638 3.75 1.03 PlyA + 10475 10480 6 1.05 2.05 PlyA - 10794 10789 6 1.05 2.04 Term - 13917 13634 284 1 2 10 46 566 0.959 39.20 2.03 Intr - 16801 16469 333 2 0 130 113 122 0.989 13.52 2.02 Intr - 19223 19166 58 2 1 101 111 2 0.550 1.34 2.01 Init - 21677 21567 111 2 0 58 61 93 0.455 3.86 2.00 Prom - 45732 45693 40 -3.65 3.08 PlyA - 46105 46100 6 -0.45 3.07 Term - 48530 48370 161 0 2 102 36 228 0.969 16.12 3.06 Intr - 56727 56520 208 0 1 88 -6 161 0.000 4.43 3.05 Intr - 60169 60126 44 2 2 102 98 -9 0.242 -1.46 3.04 Intr - 68230 67895 336 2 0 97 109 373 0.752 34.87 3.03 Intr - 94665 94621 45 1 0 63 87 73 0.036 2.16 3.02 Intr - 99057 98974 84 1 0 98 56 92 0.103 5.87 3.01 Init - 100699 100075 625 1 1 81 40 511 0.521 41.15 3.00 Prom - 104390 104351 40 -5.15 4.04 PlyA - 104984 104979 6 1.05 4.03 Term - 114094 113982 113 2 2 36 48 144 0.827 2.94 4.02 Intr - 118658 118610 49 2 1 70 115 8 0.052 -0.97 4.01 Init - 119894 119811 84 2 0 64 73 85 0.615 5.47 4.00 Prom - 120824 120785 40 -7.45 5.03 PlyA - 121683 121678 6 1.05 5.02 Term - 124926 123715 1212 0 0 70 43 995 0.988 83.85 5.01 Init - 127364 126726 639 2 0 75 6 445 0.379 30.38 5.00 Prom - 127788 127749 40 -4.95 6.00 Prom + 151298 151337 40 -2.15 6.01 Init + 152000 152221 222 1 0 61 37 139 0.174 4.70 6.02 Intr + 170157 170261 105 2 0 107 111 119 0.999 15.59 6.03 Intr + 172555 172636 82 0 1 112 86 80 0.998 8.49 6.04 Intr + 175459 175587 129 2 0 63 95 70 0.967 4.95 6.05 Intr + 179135 179175 41 0 2 123 96 60 0.944 7.32 6.06 Intr + 184131 184282 152 2 2 92 106 179 0.994 18.14 6.07 Intr + 193608 193662 55 0 1 132 98 60 0.966 9.46 6.08 Term + 199377 199505 129 2 0 96 45 132 0.409 6.90 6.09 PlyA + 200175 200180 6 -1.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 56727 56574 154 0 1 88 101 221 0.999 22.12 S.002 Init - 70251 70194 58 0 1 35 108 77 0.976 4.10 S.003 Sngl - 103990 103727 264 1 0 91 38 177 0.839 8.12 S.004 Init - 163457 163333 125 2 2 93 99 119 0.998 13.19 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:157038733_157239431|GENSCAN_predicted_peptide_1|193_aa PGQQSETEKEGRKGKGREKERTKEGRKEGRKEGRKEGRKEKERKKERKKERKKERKEGRK EGRKEGRKEEGRKERKERRKEEGRKERERKTKKEGKKERKKERKRKKERKKERKKERKKE RKKERREGGRKEGRRKESQKQYATEGQPRHSKNLAQEPWIHGQNGSSHTNPGDTAAPLSV PPGKLYFDQVSLT >gi568815593r:157038733_157239431|GENSCAN_predicted_CDS_1|582_bp cctgggcaacagagtgagactgagaaagaaggaaggaaagggaagggaagggagaaagaa aggacgaaggaaggaaggaaggaaggaaggaaggaaggaaggaaggaaggaaggaaggag aaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaaggaaggaaggaag gaaggaaggaaggaaggaaggaaggaggaaggaaggaaagaaagaaaggaaagaagaaag gaggaaggaaggaaggagagagaaagaaagacaaagaaagaaggaaagaaagaaagaaag aaagaaagaaagagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaa agaaagaaagaaaggagggagggaggaaggaaggaaggaaggagaaaagaaagccaaaag caatatgcaactgagggccaacccaggcacagtaagaacttggcccaagagccttggatc cacggccaaaatggcagcagccacacaaatcctggggatacagctgctccactgtctgtt ccacctggaaaactctactttgaccaggtttcactgacatag >gi568815593r:157038733_157239431|GENSCAN_predicted_peptide_2|261_aa MENTATEKALGLRQVGKKALVAGAESAGERVVEEIYGADPIMHPQVVILSLILHLADSVA GSVKVGGEAGPSVTLPCHYSGAVTSMCWNRGSCSLFTCQNGIVWTNGTHVTYRKDTRYKL LGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGWFNDMKITVSLEIVPRHDYSNCHNCSNR HDCSNEHHCSNDNDCSNDDCSNDNCSNNNEHSNDNDCSDDNDCFNDNERSNDNEHSNNNK CSSDNNCLYLCSSNAFAQAEP >gi568815593r:157038733_157239431|GENSCAN_predicted_CDS_2|786_bp atggagaacacagccacggaaaaggccttagggttgaggcaagttggaaagaaagctcta gtagctggggctgagtcagcaggggagagagtggtagaagaaatctatggggctgatccc ataatgcatcctcaagtggtcatcttaagcctcatcctacatctggcagattctgtagct ggttctgtaaaggttggtggagaggcaggtccatctgtcacactaccctgccactacagt ggagctgtcacatccatgtgctggaatagaggctcatgttctctattcacatgccaaaat ggcattgtctggaccaatggaacccacgtcacctatcggaaggacacacgctataagcta ttgggggacctttcaagaagggatgtctctttgaccatagaaaatacagctgtgtctgac agtggcgtatattgttgccgtgttgagcaccgtgggtggttcaatgacatgaaaatcacc gtatcattggagattgtgccacgtcacgactactccaattgtcacaactgttccaaccgt cacgactgttcgaacgagcaccactgttccaacgacaacgactgttccaatgacgactgt tccaacgacaactgttccaacaacaatgagcattccaacgacaacgactgttctgacgac aatgactgtttcaacgacaacgagcgttccaacgacaacgagcattccaacaacaacaag tgttccagtgacaacaactgtctctacctttgttcctccaatgcctttgcccaggcagaa ccatga >gi568815593r:157038733_157239431|GENSCAN_predicted_peptide_3|500_aa MGEPQQVSALPPPPMQYIKEYTDENIQEGLAPKPPPPIKDSYMMFGNQFQCDDLIIRPLE SQGIERLHPMQFDHKKELRKLNMSILINFLDLLDILIRSPGSIKREEKLEDLKLLFVHVH HLINEYRPHQARETLRVMMEVQKRQRLETAERFQKHLERVIEMIQNCLASLPDDLPHSEA GMRVKTEPMDADDSNNCTGQNEHQRENSAAGWTVVSRISLEATEDGKDCQSGSLNDQQSC EEPSKSDGIYGWSSEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGACPVFECGNVVL RTDERDVNYWTSRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGIMNDEKFNLKLV IKPAETQTLGSLPDINLTQISTLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALA LIFGALIFKCKCFCFSLPLITNSSVRTRLANAVAEGIRSEENIYTIEENVYEVEEPNEYY CYVSSRQQPSQPLGCRFAMP >gi568815593r:157038733_157239431|GENSCAN_predicted_CDS_3|1503_bp atgggtgaaccacagcaagtgagtgcacttccaccacctccaatgcaatatatcaaggaa tatacggatgaaaatattcaagaaggcttagctcccaagcctccccctccaataaaagac agttacatgatgtttggcaatcagttccaatgtgatgatcttatcatccgccctttggaa agtcagggcatcgaacggcttcatcctatgcagtttgatcacaagaaagaactgagaaaa cttaatatgtctatccttattaatttcttggaccttttagatattttaataaggagccct gggagtataaaacgagaagagaaactagaagatcttaagctgctttttgtacacgtgcat catcttataaatgaataccgaccccaccaagcaagagagaccttgagagtcatgatggag gtccagaaacgtcaacggcttgaaacagctgagagatttcaaaagcacctggaacgagta attgaaatgattcagaattgcttggcttctttgcctgatgatttgcctcattcagaagca ggaatgagagtaaaaactgaaccaatggatgctgatgatagcaacaattgtactggacag aatgaacatcaaagagaaaattcagctgctggttggacggtcgtatccaggataagtttg gaagccaccgaagatggcaaagattgtcagtctggttctttgaatgaccaacaatcctgt gaagaaccttctaaatctgatggtatttatggctggtcctcagaagtggaatacagagcg gaggtcggtcagaatgcctatctgccctgcttctacaccccagccgccccagggaacctc gtgcccgtctgctggggcaaaggagcctgtcctgtgtttgaatgtggcaacgtggtgctc aggactgatgaaagggatgtgaattattggacatccagatactggctaaatggggatttc cgcaaaggagatgtgtccctgaccatagagaatgtgactctagcagacagtgggatctac tgctgccggatccaaatcccaggcataatgaatgatgaaaaatttaacctgaagttggtc atcaaaccagcagagacacagacactggggagcctccctgatataaatctaacacaaata tccacattggccaatgagttacgggactctagattggccaatgacttacgggactctgga gcaaccatcagaataggcatctacatcggagcagggatctgtgctgggctggctctggct cttatcttcggcgctttaattttcaaatgtaagtgtttttgtttctctctccctctgata acaaattcttcagtgagaaccagattggcaaatgcagtagcagagggaattcgctcagaa gaaaacatctataccattgaagagaacgtatatgaagtggaggagcccaatgagtattat tgctatgtcagcagcaggcagcaaccctcacaacctttgggttgtcgctttgcaatgcca tag >gi568815593r:157038733_157239431|GENSCAN_predicted_peptide_4|81_aa MHYQDPPSMKDLCPQLLRMLPADKFQLAWCPYGELELPPLPSSHVYNTKSEDQCELWTSG TNDASMSAHQNQMNHSGAGCR >gi568815593r:157038733_157239431|GENSCAN_predicted_CDS_4|246_bp atgcactatcaagaccctccttcaatgaaggacttgtgtcctcagttgctgagaatgctg ccggcagacaagtttcagctggcatggtgtccatatggggagttggaactcccacccttg cccagtagtcatgtgtataacaccaagagtgaagaccagtgtgaactatggacttcaggt actaatgatgcgtcaatgtcagctcatcaaaatcaaatgaaccactctggtgcaggatgt cgatga >gi568815593r:157038733_157239431|GENSCAN_predicted_peptide_5|616_aa MGDLQRQLYNRGEYNIFKYAPMFESNFIQINKKGEVIDVHNRVRMVTVGIVCTSPILPLP DVMVLAQPTKICEQHVRWGRFAKGRGRRPVKTLELTRLLPLKFVKISIHDHEKQQLRLKL ATGRTFYLQLCPSSDTREDLFCYWEKLVYLLRPPVESYCSTPTLLSGDAPPEDNKSLVVS LSKAREHGAGSLGEQFKASPTKPLVPSEAHLEKAAELHREGDQSETGLYKPCDVSAATSS AYAGGEGIQHASHGTASAASPSTSTPGAAEGGAARTAGGMAVAGTATGPRTDVAIAGAAM SPATGAMSIATTKSAGPGQVTTALAGAAIKNPGENESSKSMAGAANISSEGISLALVGAA STSLEGTSTSMAGAASLSQDSSLSAAFAGSITTSKCAAERTEGPAVGPLISTLQSEGYMS ERDGSQKVSQPSAEVWNENKERREKKDRHPSRKSSHHRKAGESHRRRAGDKNQKASSHRS ASGHKNTRDDKKEKGYSNVRGKRHGSSRKSSTHSSTKKESRTTQELGKNQSASSTGALQK KASKISSFLRSLRATPGSKTRVTSHDREVDIVAKMVEKQNIEAKVEKAQGGQELEMISGT MTSEKTEMIVFETKSI >gi568815593r:157038733_157239431|GENSCAN_predicted_CDS_5|1851_bp atgggggacctgcaacgacaattgtacaacagaggagagtacaacattttcaagtatgca ccaatgttcgagagtaattttattcagataaacaaaaagggagaggtgattgatgtacac aaccgtgtccgaatggtgacagtgggcatcgtctgcaccagccccatcctcccactgcct gacgtcatggttctggcccaaccaactaaaatctgtgaacagcatgtcagatggggccgg tttgccaaggggagaggtcgcaggcccgtcaagactctagagctcacgagactgcttccc ttgaaatttgtgaagatctccatccacgatcatgagaaacagcagctgcgcctgaaactc gccactggccgtactttttatctgcagttgtgtccctcttctgacacacgggaagatctc ttttgctattgggaaaaacttgtctatctcctgaggccaccagtagagagttactgcagt accccaacacttctatctggggacgcaccacccgaagacaacaaaagcctagtggtaagc ctctcaaaggctcgagaacatggggcaggttctttgggggagcagttcaaagccagtcct acaaagcctttggtgccttctgaggcccacctggaaaaagctgcagagctccacagagaa ggggatcagagtgagactgggctctacaagccttgtgatgtatctgcagccacctcttct gcttatgctgggggagagggaatccaacatgcctcccacggaacggctagtgcggcttct ccatccacgagcactccaggggctgctgaaggaggagcagcaaggacagcaggtggcatg gcagtggcaggaacagcaacaggacctagaacagatgtggcaatagcaggggcagcaatg agtcctgcaacaggtgctatgagcatagcaacaaccaaatctgcaggcccaggtcaagtg accacagcgctggcgggagcagctatcaaaaatccaggagaaaatgaatccagcaagtcc atggcaggtgctgccaacatatcctcagagggtattagcttggccttggtgggtgctgca agcacctccttggaaggtacttccacctcgatggcgggggccgccagtctctcccaagac agcagcttgagtgcggcgtttgcaggcagtattacgaccagcaagtgtgcagcagaaaga actgaaggaccagcagtgggacccctcatctccaccttgcaaagcgaaggctacatgagt gaacgagatggaagccagaaagtttcccagcccagtgctgaagtctggaatgaaaacaag gaaagaagagaaaagaaggacagacatcccagtaggaaaagttctcatcaccgcaaggca ggtgaaagtcaccgcaggagagcgggggacaagaatcagaaagcgtcttcccaccggtcc gcatctggccataaaaacacgagagatgacaaaaaagaaaaagggtacagcaacgtaagg ggcaagcgacatggctcctctcgcaagagctccacccacagctccaccaaaaaggagtcg agaacaactcaggaactggggaagaaccaatctgcatctagcacaggagctttacaaaag aaagccagtaagatcagctcttttttaaggagcctcagggccactcctggttcaaaaaca agggtcacatcacatgacagagaggtagatatcgtggctaagatggtggagaagcaaaac atagaggccaaagtggagaaagcccagggtggccaggagctggagatgatcagtggcact atgacatccgagaagacggagatgatcgtctttgaaaccaaatccatttaa >gi568815593r:157038733_157239431|GENSCAN_predicted_peptide_6|304_aa MGKSILGRGNGKYKDPEEGKSLACVRNRDLNMQSSREGHKTRGRSQTLQALVDHGKEFGF DSNSHEKFPRLKQWKKRTLKGSIELSRIKCVEIVKSDISIPCHYKYPFQVVHDNYLLYVF APDRESRQRWVLALKEETRNNNSLVPKYHPNFWMDGKWRCCSQLEKLATGCAQYDPTKNA SKKPLPPTPEDNRRPLWEPEETVVIALYDYQTNDPQELALRRNEEYCLLDSSEIHWWRVQ DRNGWYNKSISRDKAEKLLLDTGKEGAFMVRDSRTAGTYTVSVFTKAVVRYGANSAQQSG EKHF >gi568815593r:157038733_157239431|GENSCAN_predicted_CDS_6|915_bp atgggaaagagcattctgggcagagggaacggcaagtacaaagaccctgaggaaggaaaa agcttggcttgcgtgaggaacagagatttgaacatgcagagttcaagagaggggcacaag acgaggggcagaagccaaactctgcaggcccttgtggatcatggcaaggagtttggattt gattccaactcccatgaaaagtttccaaggttgaagcagtggaagaagcgcacgctgaag gggtccattgagctctcccgaatcaaatgtgttgagattgtgaaaagtgacatcagcatc ccatgccactataaatacccgtttcaggtggtgcatgacaactacctcctatatgtgttt gctccagatcgtgagagccggcagcgctgggtgctggcccttaaagaagaaacgaggaat aataacagtttggtgcctaaatatcatcctaatttctggatggatgggaagtggaggtgc tgttctcagctggagaagcttgcaacaggctgtgcccaatatgatccaaccaagaatgct tcaaagaagcctcttcctcctactcctgaagacaacaggcgaccactttgggaacctgaa gaaactgtggtcattgccttatatgactaccaaaccaatgatcctcaggaactcgcactg cggcgcaacgaagagtactgcctgctggacagttctgagattcactggtggagagtccag gacaggaatgggtggtacaataagagtatcagccgagacaaagctgaaaaacttcttttg gacacaggcaaagaaggagccttcatggtaagggattccaggactgcaggaacatacacc gtgtctgttttcaccaaggctgttgtaaggtatggagctaactctgctcagcaaagtgga gagaaacacttctga