GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:14:36 Sequence gi568815597r:202496760_202705772 : 209013 bp : 45.30% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 3539 3534 6 1.05 1.04 Term - 3821 3757 65 0 2 117 54 8 0.168 -1.65 1.03 Intr - 4691 4591 101 1 2 43 98 79 0.162 4.15 1.02 Intr - 19157 19090 68 2 2 72 64 62 0.010 -0.10 1.01 Init - 31630 31571 60 1 0 74 66 37 0.304 1.35 1.00 Prom - 45877 45838 40 -0.06 2.00 Prom + 47573 47612 40 -4.36 2.01 Init + 64675 64692 18 0 0 83 103 29 0.974 4.00 2.02 Intr + 67684 67788 105 0 0 106 110 91 0.900 13.51 2.03 Intr + 72388 72438 51 0 0 65 86 88 0.892 5.40 2.04 Term + 83715 83801 87 2 0 140 44 136 0.970 12.06 2.05 PlyA + 86141 86146 6 1.05 3.13 PlyA - 86783 86778 6 1.05 3.12 Term - 95145 94981 165 0 0 40 36 91 0.058 -2.98 3.11 Intr - 96728 96579 150 2 0 106 74 53 0.541 6.06 3.10 Intr - 100204 100028 177 1 0 145 30 401 0.697 40.12 3.09 Intr - 101760 101629 132 0 0 129 -48 96 0.617 0.94 3.08 Intr - 102592 102459 134 2 2 83 53 340 0.989 30.36 3.07 Intr - 103715 103598 118 2 1 83 98 203 0.999 20.84 3.06 Intr - 105298 105131 168 1 0 124 94 247 0.996 29.04 3.05 Intr - 105786 105619 168 0 0 136 48 210 0.770 21.94 3.04 Intr - 106368 106240 129 0 0 137 85 151 0.998 20.59 3.03 Intr - 107862 107696 167 1 2 67 91 271 0.994 24.98 3.02 Intr - 109030 108836 195 2 0 116 76 208 0.841 21.79 3.01 Init - 112479 112347 133 0 1 78 47 67 0.605 1.90 3.00 Prom - 113258 113219 40 -4.46 4.00 Prom + 116181 116220 40 -3.76 4.01 Init + 131749 132055 307 0 1 78 53 120 0.033 4.96 4.02 Intr + 142990 143094 105 2 0 102 16 81 0.022 2.49 4.03 Term + 150147 150175 29 1 2 93 48 73 0.251 2.04 4.04 PlyA + 150411 150416 6 1.05 5.00 Prom + 153592 153631 40 -4.56 5.01 Init + 159849 159942 94 2 1 85 97 53 0.774 6.44 5.02 Intr + 160264 160396 133 2 1 97 71 51 0.971 4.00 5.03 Intr + 160926 161013 88 0 1 86 94 45 0.673 4.87 5.04 Intr + 161764 161788 25 0 1 78 78 25 0.294 -1.90 5.05 Intr + 163630 163672 43 2 1 55 94 91 0.806 3.70 5.06 Intr + 165601 165782 182 1 2 57 88 132 0.798 9.61 5.07 Intr + 166615 166829 215 2 2 133 113 25 0.940 7.93 5.08 Intr + 173165 173298 134 1 2 42 52 78 0.412 -0.96 5.09 Term + 175517 175706 190 2 1 65 54 113 0.308 2.52 5.10 PlyA + 175709 175714 6 1.05 6.00 Prom + 179960 179999 40 -4.16 6.01 Init + 185135 185249 115 1 1 61 93 149 0.848 13.07 6.02 Term + 205517 205638 122 0 2 115 39 65 0.023 2.94 6.03 PlyA + 206075 206080 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 78243 78377 135 2 0 89 32 171 0.943 9.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:202496760_202705772|GENSCAN_predicted_peptide_1|97_aa MTELTGIQQPQIVLFEHKGHKLVQGSSSDPGKVNRIYQHYEARIGLENSSRANRENSKKA IDKTQCKENGIQAKDADSSNHSPTLYFYELNFLRSYI >gi568815597r:202496760_202705772|GENSCAN_predicted_CDS_1|294_bp atgactgaattgactggcatacaacaaccacaaattgttctctttgagcacaaagggcat aaattggttcagggaagttcatctgatccaggaaaggtcaatcggatataccaacactac gaggccaggattggcttggaaaacagctccagagcaaacagggaaaacagtaagaaagca atagataaaactcaatgtaaagaaaatggaatccaggccaaagatgcagactccagtaac cacagtcctactctctacttctatgagctcaactttttaagatcctacatatga >gi568815597r:202496760_202705772|GENSCAN_predicted_peptide_2|86_aa MSTVIVLYESALTENQKLKTKLQEAQLELADIKSKLEKVAQERRALERKMSEMEEEMKVL TELKSDNQRLKDENGALIRVISKLSK >gi568815597r:202496760_202705772|GENSCAN_predicted_CDS_2|261_bp atgtccacggtcatagtgctctatgagagtgctctgactgaaaaccaaaaactgaaaaca aaacttcaggaagcccagctagagctagcagatataaagtccaagcttgagaaggtggcc caggagaggcgagccttggagcgcaaaatgtcagaaatggaggaagaaatgaaggtgtta acagaactgaaatccgacaaccagaggctgaaagatgaaaatggtgccctcatcagagtc atcagcaaactgtccaagtag >gi568815597r:202496760_202705772|GENSCAN_predicted_peptide_3|611_aa MEYYAAIKNDEFMSFVGTWMKLETIILSKLSQGQKTKHRMFSLIAVSSATMRNIFKRNQE PIVAPATTTATMPIGPVDNSTESGGAGESQEDMFAKLKEKLFNEINKIPLPPWALIAIAV VAGLLLLTCCFCICKKCCCKKKKNKKEKGKGMKNAMNMKDMKGGQLPQDDDDAETGLTEG EGEGEEEKEPENLGKLQFSLDYDFQANQLTVGVLQAAELPALDMGGTSDPYVKVFLLPDK KKKYETKVHRKTLNPAFNETFTFKVPYQELGGKTLVMAIYDFDRFSKHDIIGEVKVPMNT VDLGQPIEEWRDLQGGEKEEPEKLGDICTSLRYVPTAGKLTVCILEAKNLKKMDVGGLSD PYVKIHLMQNGKRLKKKKTTVKKKTLNPYFNESFSFEIPFEQIQEGWGQGYKRIFRVGSV IYGDVYDVIDGILQGNTGQLSDNVNPREKVQVVVTVLDYDKLGKNEAIGKIFVGSNATGT ELRHWSDMLANPRRPIAQWHSLKPEEEESIHNLCLQNQCGMILLAQGFSEREKRGHYSPP VGKSSSLINLPPCSGVGFSSISTLGEEQTAVGLHGSGYLSWGPCGGNQDFQEQPVSSSAR DDRTIPAYSPL >gi568815597r:202496760_202705772|GENSCAN_predicted_CDS_3|1836_bp atggaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aagctggaaaccatcattctcagcaaactatcgcaaggacaaaaaaccaaacaccgcatg ttctcactcatagctgtttcctctgccaccatgaggaacattttcaagaggaaccaggag cctattgtggctcctgccaccaccaccgccacgatgcccattggacccgtggacaactcc actgagagtgggggtgctggggagagccaggaggacatgtttgccaaactgaaggagaag ttattcaatgagataaacaagattcccttaccaccctgggcactgatcgccattgctgtg gttgctgggctcctgcttctcacctgctgcttctgcatctgcaagaaatgctgctgcaag aagaagaagaacaagaaggagaagggcaaaggcatgaagaatgccatgaacatgaaggac atgaaagggggtcagctcccacaggatgacgacgacgcagagacaggcctgactgagggg gaaggtgaaggggaggaggagaaagagccagagaacctgggcaaactgcagttttccctg gactatgattttcaggctaatcagcttactgtgggcgttctgcaggctgctgaactgcct gccctggacatgggaggcacctcagacccttatgtcaaggtcttcctccttcctgacaag aagaagaaatatgagaccaaagtccatcggaagacactgaaccctgccttcaatgaaacc ttcaccttcaaggtgccataccaggagcttgggggcaaaactctggtgatggccatctat gactttgaccgcttctccaaacatgacatcattggagaggtaaaggtgcctatgaacaca gtggacctcggccagcccattgaggagtggagagacctgcaaggcggggaaaaggaggag ccggagaagctgggcgacatctgcacctccctgcgctatgtgcccacggccgggaagctc actgtctgcatcctggaggctaagaacctcaagaagatggacgtgggcggcctttcagac ccgtacgtgaagatccacctgatgcagaatggcaagaggctcaagaagaagaagacaacc gtgaagaagaagaccctgaacccatacttcaacgagtccttcagctttgagatccccttc gagcagattcaggagggctgggggcaagggtacaagagaatcttcagagttggttctgtc atctatggagatgtgtatgatgttattgatggaatacttcaagggaacactgggcagctc agtgataatgttaatccaagagaaaaagtccaggtagtggtcaccgtgctggactatgac aagctgggcaagaacgaagccataggcaagatcttcgtgggcagcaatgccacgggcaca gagctgcggcactggtccgacatgctggccaacccccggaggcccatcgcccagtggcac tcgctcaagcctgaggaggaggaaagcatacataacttgtgtctgcagaatcagtgtggg atgattttgctggcccaaggcttcagcgagagggagaagagaggtcactacagccctcct gtgggtaaaagcagctctcttataaacctgcctccatgcagtggggtggggttcagctcc atctctacgctgggcgaggagcagacagcagtgggactccatggttctggatacctttcc tggggtccctgtggaggcaaccaggattttcaggagcagccagtcagcagctcagccagg gatgacagaaccatccctgcttactcacctctgtag >gi568815597r:202496760_202705772|GENSCAN_predicted_peptide_4|146_aa MPIITNTYDSSWSPETKVEHASPGGTIRCSKEASSLLPDRGPGQTWEIGARRDEPWPQNT HGLEEGSSEGENSKLESLNLEFYVDPNLEDSSAEEEEPSETPGPMTKTRACKLLEPEASA QQEEDTGRWGILGDRVHGFLDKVDSS >gi568815597r:202496760_202705772|GENSCAN_predicted_CDS_4|441_bp atgcccattatcaccaacacctatgacagctcatggtctccagaaactaaagtggagcat gcaagtcccgggggcacaattaggtgctccaaagaagccagctctctcctgcctgaccgg gggccgggacagacatgggagataggggccaggagggatgaaccatggcctcaaaacacc catgggctggaggaggggagcagtgagggggagaattccaaattggaaagcctcaacttg gaattctatgtggatccaaacctagaagacagctccgctgaggaagaggagccaagtgaa acaccaggccccatgacgaagactagagcatgcaaactcctggagcctgaggccagtgcg cagcaagaggaggacacaggacgttggggcattttgggggatcgtgtgcatggcttcctg gacaaggtggacagcagctga >gi568815597r:202496760_202705772|GENSCAN_predicted_peptide_5|367_aa MVRSLDELDHMLTLTYHTYVEGRTSCHSHDIVFSIASALLAKFFRAQLIFVYVLFKMGCP EKHSTPYIRANCPNSSFWVQKLSAVSIDNSPGIPSLSLTSTGSQQAMTAAARGANGGYNF CYRYYPSRAPYIQGKKVNPLISNDAQGSIWKTAHQLLKLTEAKGSSIVYLDYDSADTGRA KCYPSTGSGCWASPSAPSTQYFPKKTPKTFAFKPAPLPESPVSVHSTVVFLFTPLRNLSG NILNRSFIMTISSNLHIGTKLILIGVKAEGVMNTKLAGPDPLNNSATHTPTFISTPVLTD ICQHSVMVFGSGTFGRELGLDEAMKWGSSAMGRGCSQKADICEPGRGPSPRTTSASTLIL DFQSPEL >gi568815597r:202496760_202705772|GENSCAN_predicted_CDS_5|1104_bp atggtgcgaagcttagatgagctggaccacatgttaactttgacataccacacatatgtg gagggccgaacctcgtgccacagccatgacatagtgttttccattgcctctgccctgctg gccaaattcttccgggcacagctcatttttgtgtatgtcctttttaaaatgggatgtcca gagaagcacagcacaccctacataagagccaactgtccaaattccagtttctgggtccag aaactgagtgcagtcagcatagacaacagccctgggatcccctcactgtccttgacaagc actggaagccagcaggctatgacagcagctgccagaggggctaatggtggctacaacttc tgttaccgctactaccccagcagagccccatacattcaaggcaagaaagttaaccctcta attagtaatgatgctcagggcagcatctggaagacagcacatcaattactaaaactcact gaggccaagggctcctccatagtctacctggactatgacagcgcagatactggcagggcc aagtgctacccttccaccggctcaggttgctgggcgtcaccatctgcaccaagtacacaa tacttccccaagaaaactccaaaaacctttgccttcaaaccggctcctcttcctgaatcc cctgtgtctgtccacagcaccgttgtgtttctgttcactccacttcgaaacctcagcggc aacattttaaatagatcttttattatgactatatcctcaaacttacacataggtacaaag cttatcctcattggtgtaaaagcagagggcgtgatgaatacaaagcttgcaggtccagat ccattgaacaacagtgccacccacaccccaacattcatcagcacacctgtactcactgac atatgccagcactctgtgatggtatttggaagtgggacctttgggagagaattaggttta gatgaggccatgaaatgggggtcctctgccatgggaagaggatgtagccagaaggctgac atctgcgagccaggaagagggccctcacccagaaccacatctgccagcaccttgatctta gacttccagtctccggaactgtga >gi568815597r:202496760_202705772|GENSCAN_predicted_peptide_6|78_aa MSLRGDQIQDRDLISYPFKKPTMELLTAPPDEARATVAAAAAATSTVPSIKELIQFLSCA SPFASCNARIDLETGQPY >gi568815597r:202496760_202705772|GENSCAN_predicted_CDS_6|237_bp atgagcctccgtggggatcagatccaagaccgtgacctcatcagctaccccttcaaaaag ccaaccatggagctgctgacagctccacctgatgaggccagggccacggtggctgctgca gcagcagcaaccagcacagttcccagcatcaaagagctaatccaatttctcagctgtgct tcccccttcgcctcctgcaatgccaggattgatttggaaacagggcagccttattaa