GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:08:07 Sequence gi568815597r:39582539_39784603 : 202065 bp : 48.95% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2211 2350 140 2 2 57 43 129 0.478 4.91 1.02 Intr + 13871 14029 159 2 0 56 78 124 0.244 7.30 1.03 Term + 27046 27127 82 1 1 75 51 86 0.007 0.77 1.04 PlyA + 32005 32010 6 1.05 2.06 PlyA - 36564 36559 6 1.05 2.05 Term - 44642 43969 674 0 2 121 49 176 0.273 10.92 2.04 Intr - 47770 47689 82 1 1 127 80 34 0.718 5.71 2.03 Intr - 49041 48904 138 0 0 49 83 102 0.216 6.46 2.02 Intr - 50177 50111 67 1 1 104 84 3 0.169 0.41 2.01 Init - 57087 57008 80 0 2 85 94 146 0.977 15.43 2.00 Prom - 73866 73827 40 -3.66 3.15 PlyA - 73884 73879 6 -0.45 3.14 Term - 76948 76583 366 1 0 82 37 264 0.682 15.30 3.13 Intr - 78725 78541 185 0 2 68 47 427 0.996 36.11 3.12 Intr - 80896 80774 123 2 0 106 110 225 0.999 27.06 3.11 Intr - 83112 82983 130 0 1 95 89 313 0.977 32.37 3.10 Intr - 83698 83531 168 1 0 83 100 330 0.987 33.84 3.09 Intr - 86672 86596 77 0 2 94 44 40 0.938 -0.57 3.08 Intr - 87340 87081 260 0 2 71 -28 185 0.846 2.31 3.07 Intr - 89567 89349 219 1 0 89 67 144 0.236 9.82 3.06 Intr - 100195 100002 194 1 2 114 -1 330 0.019 24.99 3.05 Intr - 101614 101399 216 1 0 76 64 627 0.999 57.80 3.04 Intr - 102073 101904 170 2 2 89 86 381 0.418 37.67 3.03 Intr - 108632 108527 106 2 1 87 94 -1 0.013 0.19 3.02 Intr - 112467 112410 58 2 1 98 70 99 0.422 7.99 3.01 Init - 121123 121062 62 1 2 86 64 52 0.142 3.32 3.00 Prom - 145772 145733 40 -1.66 4.00 Prom + 155307 155346 40 -3.16 4.01 Init + 156363 156393 31 2 1 95 80 61 0.962 6.16 4.02 Intr + 157627 157725 99 2 0 57 109 138 0.995 12.88 4.03 Intr + 158828 158871 44 0 2 71 78 30 0.909 -1.74 4.04 Intr + 159357 159383 27 2 0 116 56 51 0.817 3.01 4.05 Intr + 160678 160759 82 0 1 67 109 46 0.887 3.81 4.06 Intr + 161286 161386 101 1 2 87 78 92 0.995 7.93 4.07 Intr + 162837 162960 124 2 1 76 99 228 0.954 22.86 4.08 Intr + 166365 166550 186 1 0 81 91 222 0.926 21.46 4.09 Intr + 170372 170514 143 0 2 96 87 154 0.997 16.17 4.10 Term + 170749 170817 69 0 0 66 54 194 0.985 11.74 4.11 PlyA + 171144 171149 6 1.05 5.09 PlyA - 171455 171450 6 1.05 5.08 Term - 178030 177881 150 1 0 115 47 249 0.998 21.41 5.07 Intr - 180664 180554 111 1 0 124 65 116 0.996 13.48 5.06 Intr - 181253 181174 80 0 2 97 51 130 0.888 9.47 5.05 Intr - 182279 182085 195 0 0 135 73 140 0.863 16.59 5.04 Intr - 182382 182311 72 1 0 51 86 50 0.376 0.48 5.03 Intr - 182857 182742 116 0 2 41 21 117 0.295 0.39 5.02 Intr - 185034 184997 38 0 2 106 86 -21 0.082 -3.44 5.01 Init - 188717 187176 1542 2 0 83 38 2872 0.299 272.11 5.00 Prom - 189183 189144 40 -9.46 6.05 PlyA - 190337 190332 6 1.05 6.04 Term - 191365 191328 38 2 2 101 49 1 0.503 -5.00 6.03 Intr - 191918 191770 149 1 2 109 89 214 0.939 23.48 6.02 Intr - 192500 192311 190 0 1 79 51 292 0.254 23.24 6.01 Init - 197207 197192 16 2 1 93 82 9 0.088 1.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 33766 33621 146 1 2 67 80 121 0.884 6.94 S.002 Init + 61359 61428 70 2 1 96 58 52 0.854 4.34 S.003 Term - 100195 99998 198 1 0 114 43 346 0.979 30.10 S.004 Init + 108793 108861 69 0 0 47 52 127 0.862 4.05 S.005 Term + 109700 109882 183 1 0 127 47 98 0.939 7.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:39582539_39784603|GENSCAN_predicted_peptide_1|126_aa MTKNQILKAETNVENPSSVEIRGMNEENHNCYSPVQVPRLNTGYKDDPWHVDTTQIVAAP VELGIASASAFEQGAPRCHFAPGLTNDVADPEQKIEEGGKNALILATVAEGREEWGEDDE TGPAVK >gi568815597r:39582539_39784603|GENSCAN_predicted_CDS_1|381_bp atgaccaagaaccagatattaaaagctgaaacaaatgtggaaaatccatcatctgtggag atcaggggtatgaacgaggagaatcataactgctattctcctgttcaagtccccaggctg aacactggctacaaagatgatccctggcatgttgacaccactcagattgtggctgcccct gtggagttaggaattgcctcggcatcagccttcgaacaaggggctccacgctgtcatttt gcacctggcctcacaaatgatgtagctgatcctgagcagaagattgaggaaggtggaaag aatgctctaatactggccacggtcgcagaagggagggaagaatggggcgaggacgatgaa acagggccagcggtgaagtga >gi568815597r:39582539_39784603|GENSCAN_predicted_peptide_2|346_aa MKRPKEPSGSDGESDGPIDVGQEGQLSQMARPLSTPSSSQMQARKKHRGIIEKRRRDRIN SSLSELRRLVPTAFEKQVCDIADCARGNTVWVADKGSSKLEKAEVLQMTVDHLKMLHATG GTGFFDARALAVDFRSIGFRECLTEVIRYLGVLEGPSSRADPVRIRLLSHLNSYAAEMEP SPTPTGPLAFPAWPWSFFHSCPGLPALSNQLAILGRVPSPVLPGVSSPAYPIPALRTAPL RRATGIILPARRNVLPSRGASSTRRARPLERPATPVPVAPSSRAARSSHIAPLLQSSSPT PPGPTGSAAYVAVPTPNSSSPGPAGRPAGAMLYHSWVSEITEIGAF >gi568815597r:39582539_39784603|GENSCAN_predicted_CDS_2|1041_bp atgaagcgacccaaggagccgagcggctccgacggggagtccgacggacccatcgacgtg ggccaagagggccagctgagccagatggccaggccgctgtccacccccagctcttcgcag atgcaagccaggaagaaacacagagggatcatagagaaacggcgtcgagaccgcatcaac agtagcctttctgaattgcgacgcttggtccccactgcctttgagaaacaggtatgtgac attgctgattgtgctagaggaaatactgtgtgggtggcagataagggctcttccaagctg gagaaagccgaggtcttgcagatgacggtggatcacttgaaaatgctccatgccactggt gggacaggattctttgatgcccgagccctggcagttgacttccggagcattggttttcgg gagtgcctcactgaggtcatcaggtacctgggggtccttgaagggcccagcagccgtgca gaccccgtccggattcgccttctctcccacctcaacagctacgcagccgagatggagcct tcgcccacgcccactggccctttggccttccctgcctggccctggtctttcttccatagc tgtccagggctgccagccctgagcaaccagctcgccatcctgggaagagtgcccagccct gtcctccccggtgtctcctctcctgcttaccccatcccagccctccgaaccgctcccctt cgcagagccacaggcatcatcctgccagcccggaggaatgtgctgcccagtcgaggggca tcttccacccggagggcccgccccctagagaggccagcgacccctgtgcctgtcgccccc agcagcagggctgccaggagcagccacatcgctcccctcctgcagtcttcctccccaaca ccccctggtcctacagggtcggctgcttacgtggctgttcccacccccaactcatcctcc ccagggccagctgggaggccagcgggagccatgctctaccactcctgggtctctgaaatc actgaaatcggggctttctga >gi568815597r:39582539_39784603|GENSCAN_predicted_peptide_3|777_aa MIVKRRNLYRKPFKELNQQERACSFNNVEQVFAKTPSLRTAAATEEPTPGQETPHLSPGP WVQRRRQSRLLIPSSGPAMGKTNSKLAPEVLEDLVQNTEFSEQELKQWYKGFLKDCPSGI LNLEEFQQLYIKFFPYGDASKFAQHAFRTFDKNGDGTIDFREFICALSVTSRGSFEQKLN WAFEMYDLDGDGRITRLEMLEIIEAIYKMVGTVIMMRMNQDGLTPQQRVDKIFKKMDQDK DDQITLEEFKEAAKSDPSIVLLLQCDMQNPAPATSASTPPPALAGPGQSRSMEPGQPREP QEPREPGPGAETAAAPVWEEAKIFYDNLAPKKKPKSVKAMHGKITLGWIGKGESGSKPTS EKAGASLKRKLDECQCCGLSDAEEGMETGDWGSESISAQNPFPPQAPRNRRAQAQLSVMT PGSAPDSHAGAISLTLKGRESCQHPGFLELRAPRPKPQNAVTIAVSSRALFRMDEEQQIY TEQGVEEYVRYQLEHENEPFSPGPAFPFVKALEAVNRRLRELYPDSEDVFDIVLMTNNHA QVGVRLINSINHYDLFIERFCMTGGNSPICYLKAYHTNLYLSADAEKVREAIDEGIAAAT IFSPSRDVVVSQSQLRVAFDGDAVLFSDESERIVKAHGLDRFFEHEKAHENKPLAQGPLK GFLEALGRLQKKFYSKGLRLECPIRTYLVTARSAASSGARALKTLRSWGLETDEALFLAG APKGPLLEKIRPHIFFDDQMFHVAGAQEMGTVAAHVPYGVAQTPRRTAPAKQAPSAQ >gi568815597r:39582539_39784603|GENSCAN_predicted_CDS_3|2334_bp atgattgtaaagaggagaaacctctacagaaaaccattcaaggaacttaatcagcaagaa agagcctgctccttcaacaacgtggaacaggtgtttgccaagactccaagcctgagaaca gcggcagcaactgaagagcccacccctgggcaggagaccccccacctcagtccaggccca tgggtgcagagaaggaggcagagccgcctacttatccccagctcaggccccgccatgggg aagaccaacagcaagctggcccccgaggtgctggaggaccttgttcagaacactgagttc agcgagcaggagctgaagcagtggtacaagggcttcctgaaggactgccccagcggcatc ctcaacctggaggagtttcagcagctctacatcaagttcttcccctacggcgacgcctcc aagttcgcgcagcacgctttccgcaccttcgacaagaacggcgacggcaccatcgacttc cgggagttcatctgcgccctgtcggtcacctcccgcggcagcttcgagcagaagctcaac tgggcctttgagatgtacgacctggacggcgacgggcgcatcacgcgcctggagatgctg gagatcatcgaggcaatctacaagatggtgggcaccgtgatcatgatgcgcatgaaccag gacgggctcacgccccagcagcgtgtggacaagatcttcaagaagatggaccaggataag gacgaccagattacattggaggagttcaaggaggcagccaagagtgacccatccattgtg ttgctgctgcagtgtgacatgcagaacccagccccagccacctccgcgtctacgccgccg cctgctctggccgggccgggtcagagccggagcatggaacctgggcagccccgggagccc caggagccccgcgagcccgggccaggagcggagaccgctgcggccccggtctgggaggaa gccaagattttctacgacaacctcgcgcccaagaagaaacccaaatcggtaaaagcaatg cacggaaagatcactctgggatggattggaaagggtgagtccggcagcaaacccaccagt gagaaggctggtgcctctctgaagaggaagctggatgaatgtcaatgctgtggcctgagt gatgcagaagaagggatggagactggggactggggcagtgaaagcatcagtgctcaaaat ccatttcctccccaggcaccaaggaaccggagagctcaggcccagttgtcagtcatgacc cctggatcagcccctgactctcatgccggagccatcagcctgaccctgaaaggcagagag agctgccagcaccctggtttcctggagctcagagctcccagacccaagcctcagaatgca gtcaccatcgctgtgtcctcccgagccttgtttcgcatggacgaggagcagcagatctac acggagcagggcgtggaggagtacgtgcgctaccagctggaacatgagaacgaacccttc agtcccgggccagccttcccttttgtgaaggctctggaggccgtgaacaggcggctgcgg gagctgtaccctgatagtgaggacgtcttcgacatcgtcctcatgactaacaaccatgct caagtgggtgtccgcctcatcaacagtatcaaccactatgacctgttcatcgagaggttc tgcatgacaggtgggaacagcccgatctgctacctcaaggcctatcacaccaacctctac ttgtcagccgatgcggaaaaagtgcgagaagccattgatgaggggatcgcagctgccacc atcttcagccccagcagggatgtggttgtgtcccagagtcagctgcgcgtggccttcgat ggggacgccgtgctcttctcggacgagtcggagcgcatcgtcaaggcccacgggctggac cgattcttcgagcatgagaaggcccacgagaacaaacctctggctcagggccccttaaag ggctttctggaggcactgggtaggttgcagaagaagttctactccaaaggcctgcggctg gagtgcccaattcgtacctacttggtgacagcacgcagtgcagccagttccggggcccgg gctctcaagaccctgcgcagctggggcctggagacagatgaagccttgttccttgctgga gcgcccaagggccctctccttgagaagatccgcccacacatcttctttgatgaccagatg ttccatgtggctggggctcaggagatgggcactgtggccgcccatgtgccttatggtgtg gcacagacaccccggcggactgcacctgcaaagcaggccccatctgcacagtag >gi568815597r:39582539_39784603|GENSCAN_predicted_peptide_4|301_aa MATTKRVLYVGGLAEEVDDKVLHAAFIPFGDITDIQIPLDYETEKHRGFAFVEFELAEDA AAAIDNMNESELFGRTIRVNLAKPMRIKEGSSRPVWSDDDWLKKFSGKTLEENKEEEGSE PPKAETQEGEPIAKKARSNPQVYMDIKIGNKPAGRIQMLLRSDVVPMTAENFRCLCTHEK GFGFKGSSFHRIIPQFMCQGGDFTNHNGTGGKSIYGKKFDDENFILKHTGPGLLSMANSG PNTNGSQFFLTCDKTDWLDGKHVVFGEVTEGLDVLRQIEAQGSKDGKPKQKVIIADCGEY V >gi568815597r:39582539_39784603|GENSCAN_predicted_CDS_4|906_bp atggccaccaccaagcgcgtcttgtacgtgggtggactggcagaggaagtggacgacaaa gttcttcatgctgcgttcattccttttggagacatcacagatattcagattcctctggat tatgaaacagaaaagcaccgaggatttgcttttgttgaatttgagttggcagaggatgct gcagcagctatcgacaacatgaatgaatctgagctttttggacgtacaattcgtgtcaat ttggccaaaccaatgagaattaaggaaggctcttccaggccagtttggtcagatgatgac tggttgaagaagttttctgggaagacgcttgaagagaataaagaggaagaagggtcagag cctcccaaagcagagacccaggagggagagcccattgctaaaaaggcccgctcaaatcct caggtgtacatggacatcaagattgggaacaagccggctggccgcatccagatgctcctg cgttctgatgtcgtgcccatgacagcagagaatttccgctgcctgtgcactcatgaaaag ggctttggctttaagggaagcagcttccaccgcatcatcccccagttcatgtgccagggc ggtgatttcacaaaccacaatggcactgggggcaagtccatctatgggaagaagttcgat gatgaaaactttatcctcaagcatacgggaccaggtctactatccatggccaactctggc ccaaacaccaatggctctcagttcttcctgacatgtgacaagacagactggctggatggc aagcatgtggtgtttggagaggtcaccgaaggcctagatgtcttgcggcaaattgaggcc cagggcagcaaggacgggaagccaaagcagaaggtgatcatcgccgactgtggggagtac gtgtga >gi568815597r:39582539_39784603|GENSCAN_predicted_peptide_5|767_aa MAALRLLASVLGRGVPAGGSGLALSQGCARCFATSPRLRAKFYADPVEMVKDISDGATVM IGGFGLCGIPENLIAALLRTRVKDLQVVSSNVGVEDFGLGLLLAARQVRRIVCSYVGENT LCESQYLAGELELELTPQGTLAERIRAGGAGVPAFYTPTGYGTLVQEGGAPIRYTPDGHL ALMSQPREVREFNGDHFLLERAIRADFALVKGWKADRAGNVVFRRSARNFNVPMCKAADV TAVEVEEIVEVGAFPPEDIHVPNIYVDRVIKGQKYEKRIERLTILKEEDGDAGKEEDART RIIRRAALEFEDGMYANLGIGIPLLASNFISPSMTVHLHSENGILGLGPFPTEDEVDADL INAGKQTVTVLPGGCFFASDDSFAMIRGGHIQLTMLGAMQVSKYGDLANWMIPGKKVKGM GGAMDLVSSQKTRVVVTMQHCTKDNTPKIMEKCTMPLTGKRCVDRIITEKAVFDVHRKKE LTLRELWEGLTVDDIKKSTGCAFAVSPNLRPMQQPAGARESCLRMGGETVNALLLGCAQQ NKHSLNLADATTSCLGDKFDIHTPETSTLLDLPRPRQRDRCCVCCRATARHSVDPGLAGL LGQRAPRSQQPFVVTFFRASPSPIRTPRAVRPLRRRQPKKSNELPQANRLPGIFDDVHGS HGRQVCRRHELYVSFQDLGWLDWVIAPQGYSAYYCEGECSFPLDSCMNATNHAILQSLVH LMMPDAVPKACCAPTKLSATSVLYYDSSNNVILRKHRNMVVKACGCH >gi568815597r:39582539_39784603|GENSCAN_predicted_CDS_5|2304_bp atggcggcgctgcggctcctggcgtcagtgctcgggcgcggggtccccgccggcggctca gggctcgcgctgtcccagggctgcgcccgctgctttgccaccagtccccggctccgtgcc aagttctacgcggacccggtggagatggtgaaggacatctctgacggggcgaccgtcatg atcgggggcttcgggctctgcgggatccccgagaacctgatcgccgcgctgctcaggacc cgcgtgaaagacctgcaggtggtcagcagcaacgtgggcgtggaggacttcggcctgggc ctcctgctggccgccaggcaggtccgtcgcatcgtctgttcctacgtgggcgagaacacc ctgtgcgagagccagtacctggcaggagagctggagctggagctcacgccccagggcacc ctggccgagcgcatccgcgcggggggcgccggggtgcccgccttctacacccccacgggc tacgggaccctggtccaggaagggggcgcccccatccgctacaccccggacggccacctg gcgctcatgagccagccccgagaggtgagggagttcaacggcgaccacttccttttggag cgcgccatccgggcagacttcgccctggtgaaagggtggaaggccgaccgggcaggaaac gtggtcttcaggagaagcgcccgcaatttcaacgtgcccatgtgcaaagctgcagacgtc acggcggtggaggtggaagagatcgtggaggtgggggctttccccccagaagacatccac gttcctaacatttatgtagatcgcgtgataaaggggcagaaatacgagaaacgaattgag cgcttaacgatcctgaaagaggaagatggagacgctggaaaggaagaggacgccaggacg cgcatcatcagacgcgcagctctggaatttgaggacggcatgtacgccaatctgggcata ggcatccccctgctggccagcaacttcatcagtcccagcatgactgtccatcttcacagt gagaacgggatcctgggcctgggcccgtttcccacggaagatgaggtggatgccgacctc atcaatgcaggcaagcagacggtcacggtgcttcccgggggctgcttcttcgccagcgac gactccttcgccatgatccgagggggacacatccaactaaccatgcttggagccatgcag gtttccaaatacggcgacctggcgaactggatgatccctggcaagaaggtgaaaggcatg ggcggtgccatggacttggtgtccagtcagaagaccagagtggtggtcaccatgcagcac tgcacaaaggacaacacccccaagatcatggagaaatgcaccatgccgctgaccgggaag cggtgcgtggaccgcatcatcaccgagaaggccgtgtttgacgtgcacaggaagaaagag ctgacgctgagggagctctgggagggcctgacggtggacgacatcaaaaagagcacgggg tgtgcctttgctgtgtccccgaacctcaggcccatgcagcagccagctggagccagggag tcctgcctaagaatgggtggggagacagttaatgccctgctcttgggctgtgcccagcag aacaagcactcgctcaacctggcagatgctaccacgagctgtctcggtgataagtttgac attcacaccccagaaacctccacgctcctggacctgcccaggccccggcagagagaccgc tgctgtgtctgctgcagggccactgcccggcacagcgtggatcctggcctggccggcctg ctgggtcaacgggccccacgctcccaacagcctttcgtggtcactttcttcagggccagt ccgagtcccatccgcacccctcgggcagtgaggccactgaggaggaggcagccgaagaaa agcaacgagctgccgcaggccaaccgactcccagggatctttgatgacgtccacggctcc cacggccggcaggtctgccgtcggcacgagctctacgtcagcttccaggacctcggctgg ctggactgggtcatcgctccccaaggctactcggcctattactgtgagggggagtgctcc ttcccactggactcctgcatgaatgccaccaaccacgccatcctgcagtccctggtgcac ctgatgatgccagacgcagtccccaaggcgtgctgtgcacccaccaagctgagcgccacc tctgtgctctactatgacagcagcaacaatgtcatcctgcgcaagcaccgcaacatggtg gtcaaggcctgcggctgccactga >gi568815597r:39582539_39784603|GENSCAN_predicted_peptide_6|130_aa MVPDTVERDRALGHQEPHWKEFRFDLTQIPAGEAVTAAEFRIYKVPSIHLLNRTLHVSMF QVVQEQSNRESDLFFLDLQTLRAGDEGWLVLDVTAASDCWLLKRHKDLGLRLYVETEDGD TGESLFYARQ >gi568815597r:39582539_39784603|GENSCAN_predicted_CDS_6|393_bp atggtgcctgacacagtggagcgagaccgtgccctgggccaccaggagccccattggaag gagttccgctttgacctgacccagatcccggctggggaggcggtcacagctgcggagttc cggatttacaaggtgcccagcatccacctgctcaacaggaccctccacgtcagcatgttc caggtggtccaggagcagtccaacagggagtctgacttgttctttttggatcttcagacg ctccgagctggagacgagggctggctggtgctggatgtcacagcagccagtgactgctgg ttgctgaagcgtcacaaggacctgggactccgcctctatgtggagactgaggacggtgac actggggagagcctcttttatgccaggcagtag