GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:21:11 Sequence gi568815597r:229331502_229533115 : 201614 bp : 43.96% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 291 286 6 1.05 1.05 Term - 9990 9839 152 1 2 102 42 95 0.899 4.37 1.04 Intr - 11012 10598 415 2 1 70 80 691 0.869 60.18 1.03 Intr - 11297 11162 136 1 1 86 76 -17 0.511 -2.53 1.02 Intr - 11544 11411 134 0 2 75 94 65 0.912 5.24 1.01 Init - 13876 13778 99 1 0 63 64 61 0.559 1.46 1.00 Prom - 22945 22906 40 -5.56 2.00 Prom + 34864 34903 40 -2.86 2.01 Init + 36720 36777 58 2 1 74 88 27 0.598 2.85 2.02 Intr + 45184 45223 40 2 1 104 101 4 0.293 0.58 2.03 Intr + 45996 46207 212 0 2 101 42 59 0.151 1.16 2.04 Term + 47423 47541 119 0 2 73 44 124 0.214 5.20 2.05 PlyA + 51699 51704 6 1.05 3.00 Prom + 63049 63088 40 -2.46 3.01 Init + 71803 71856 54 0 0 88 95 -28 0.153 -0.82 3.02 Intr + 76243 76367 125 0 2 42 116 143 0.890 11.88 3.03 Intr + 80007 80091 85 0 1 133 73 15 0.617 4.32 3.04 Term + 84164 84358 195 1 0 55 33 109 0.251 -0.49 3.05 PlyA + 85139 85144 6 1.05 4.27 PlyA - 85847 85842 6 1.05 4.26 Term - 100141 99998 144 1 0 88 42 316 0.999 24.91 4.25 Intr - 100401 100220 182 1 2 105 98 366 0.999 38.89 4.24 Intr - 100684 100493 192 2 0 128 97 522 0.999 56.56 4.23 Intr - 100930 100769 162 2 0 92 85 406 0.999 40.65 4.22 Intr - 101379 101055 325 0 1 85 117 668 0.999 64.75 4.21 Intr - 101626 101486 141 1 0 101 71 274 0.989 27.55 4.20 Intr - 103381 103266 116 2 2 62 40 90 0.015 1.77 4.19 Intr - 113501 113413 89 1 2 81 115 86 0.939 10.21 4.18 Intr - 117553 117489 65 1 2 16 81 52 0.292 -5.18 4.17 Intr - 119104 119024 81 1 0 51 107 57 0.435 3.73 4.16 Intr - 129268 129110 159 1 0 79 116 43 0.949 6.38 4.15 Intr - 132175 132042 134 2 2 92 64 160 0.954 14.36 4.14 Intr - 133374 133123 252 1 0 75 88 316 0.995 27.71 4.13 Intr - 134018 133919 100 2 1 88 57 34 0.985 -0.02 4.12 Intr - 135255 135133 123 0 0 88 82 77 0.991 7.88 4.11 Intr - 139281 139079 203 1 2 0 94 141 0.307 4.90 4.10 Intr - 146259 146096 164 2 2 90 90 54 0.516 5.42 4.09 Intr - 152644 152553 92 1 2 105 75 48 0.948 3.99 4.08 Intr - 155027 154870 158 0 2 109 80 32 0.645 4.23 4.07 Intr - 156112 155935 178 1 1 62 85 40 0.319 0.59 4.06 Intr - 164546 164391 156 2 0 89 86 1 0.513 0.21 4.05 Intr - 166805 166635 171 2 0 98 98 -14 0.514 0.74 4.04 Intr - 169362 169255 108 0 0 61 96 18 0.180 0.38 4.03 Intr - 170601 170498 104 1 2 80 90 52 0.328 4.49 4.02 Intr - 174657 174539 119 2 2 21 100 66 0.271 1.21 4.01 Init - 176748 176567 182 0 2 62 115 80 0.247 4.82 4.00 Prom - 184317 184278 40 -6.16 5.08 PlyA - 184962 184957 6 1.05 5.07 Term - 186909 186678 232 2 1 49 49 182 0.707 6.35 5.06 Intr - 187374 187340 35 0 2 116 94 14 0.989 1.82 5.05 Intr - 190134 190091 44 1 2 76 121 40 0.996 4.06 5.04 Intr - 194615 194435 181 2 1 76 57 155 0.974 10.64 5.03 Intr - 195807 195728 80 1 2 82 110 45 0.879 5.37 5.02 Intr - 198907 198698 210 2 0 83 107 80 0.980 8.28 5.01 Intr - 200230 200135 96 2 0 96 76 36 0.645 3.18 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 49535 49420 116 0 2 73 51 152 0.940 8.73 S.002 Term - 110539 110403 137 2 2 108 32 80 0.886 2.58 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:229331502_229533115|GENSCAN_predicted_peptide_1|311_aa MQQEMYMTLRCQSRLLEDVTKTAQPTAHPTASYLERGPPRRHLAQEESARESELRFSCCA TRASGLGAEDRPGAAGRSPSLLRSPRPSPQPPCLPPTEQPSSAARPSTPAAPVPASALGP GRRAAGSEERLALEAADGTMSPGSGVKSEYMKRYQEPRWEEYGPCYRELLHYRLGRRLLE QAHAPWLWDDWGPAGSSEDSASSESSGAGGPAPRCAPPSPPPPVEPATQEEAERRARGAP EEQDAEAGDAEAEDAEDAALPDPKAPVALRVFSRHGPFTFFPAQRLAGLGTQLNASVQPG VTNGSFQITGF >gi568815597r:229331502_229533115|GENSCAN_predicted_CDS_1|936_bp atgcagcaggagatgtatatgaccttgagatgccagtccaggcttctggaggatgtgacc aagacggcccagcccacagcccaccccactgccagttatctggagcgcgggccgccgcgg cgacatcttgctcaggaggaaagcgcgcgggagagcgagctgcgctttagctgctgcgcc acgcgcgcctcgggcctgggcgcagaggatcggccgggcgcggcgggaaggagccccagt ctcctgcggtcccctcgccccagcccgcagcctccctgccttccgcccactgagcagccc tcctcggcggcgcgcccctcgaccccagcagccccggtccccgcctctgctcttggtccc ggccgccgggctgcgggcagcgaggagcggctggcgctcgaggcggcggacggcaccatg tccccggggagcggggtgaagagcgagtacatgaagcgctaccaggagccgcgctgggag gagtacgggccgtgctaccgcgagctgctgcactaccgcctaggccgccggctgctggag caggcgcacgcgccctggctctgggacgactggggcccggccggctcctcggaggactcg gcgtcgtcagagtcgtcgggcgccgggggccccgcaccccggtgcgccccgccctcgccc ccgccgcccgtagagccggcgacccaggaggaggcggaacggcgggcgcgcggggccccg gaggagcaggacgcggaggccggggacgcggaggccgaggacgcggaggacgcggctctg ccagatccgaaagcgccggtggcgctgcgcgtgttttcaaggcatggccccttcaccttc tttcctgcccagaggcttgcagggctggggacccagttgaacgcctcagtacagcctgga gtaaccaatggttccttccagatcactggattttaa >gi568815597r:229331502_229533115|GENSCAN_predicted_peptide_2|142_aa MVPGSASGEGFRLLLFVVRGQKDKEEPPSRPLREKGSLSLTHGGALSGHLCTGPLQGQMR QPEQNPLPAWPTKEPTTLHLAQPQPKAAPTRCAFRDLPGPLECESGITVIFPCHFKTYEY MPPVATQQYIIEHKQYLTDPSC >gi568815597r:229331502_229533115|GENSCAN_predicted_CDS_2|429_bp atggtgccaggatctgcttctggggagggcttcaggctgctgctattcgtggtgagaggg cagaaagataaagaggagcctccctccaggcctctgcgggagaagggaagtctctccctc acccatgggggagcactgagtggccacctctgcacaggccctctgcaggggcagatgagg cagcccgaacagaaccctctgcccgcctggccaaccaaagagcccacgacactccatctg gcccaaccgcagccgaaggcggcaccaactcgctgcgccttcagggacttgccagggcca cttgagtgtgaatctggcatcacagtcattttcccatgtcacttcaaaacctatgagtat atgcctccagtggccacacagcagtacatcatagagcacaagcagtacctcacagaccca tcatgctag >gi568815597r:229331502_229533115|GENSCAN_predicted_peptide_3|152_aa MEHILSTSPHVPSAWPHLERHPSRIRPRTPQLGTLTRLSRGFSGIRLQGNQLFAQTFKPS PRAQAFKQTLGPARLFEQVSHIYHSLEAPLFTFTWTNPDTHQAQQITWAVLLQGFTDIPH YFSQAQISTSSVTYLGIILIKTLVLSLLIVSG >gi568815597r:229331502_229533115|GENSCAN_predicted_CDS_3|459_bp atggagcacatattaagcacctcaccccacgtgccctcagcatggccccacctggaaagg caccctagccggattcgccctcggacgccccagctgggcacactcacgcggctctcccgc ggcttctcgggaattcgcctccagggcaatcagctcttcgcacaaacgttcaaaccaagc cccagagcacaggccttcaagcagaccctgggtccagccaggctttttgagcaggtttct catatttatcattccctcgaggcacctctcttcactttcacttggactaatcctgacacc catcaggctcagcaaattacctgggctgtactgctgcaaggcttcacagacatcccccat tacttcagtcaagcccaaatttcaacctcatctgttacctatctcggcataattctcata aaaacactcgtgctctccctgctgatcgtgtccggctaa >gi568815597r:229331502_229533115|GENSCAN_predicted_peptide_4|1299_aa MFPAAPSPRTPGTGSRRGPLAGLGPGSTPRTASRKGLPLGSAVSSPVLFSPVGRRSSLSS RGTPTRMFPHHSITESVNYDVKTFGSSLPVKVMEALTLAEVDDQLTINIDEGGWACLVCK EKLIIWKIALSPITKLSVCKELQLPPSDFHWSADLVALSYSSPSGEAHSTQGGSFILSSS GSQLIRLIPESSGKIHQHILPQGQGMLSGIGRKVSSLFGILSPSSDLTLSSVLWDRERSS FYSLTSSNISKWELDDSSEKHAYSWDINRALKENITDAIWSEDLILCQLTVPNFSNQTAY LYNESAVYVCSTGTGKFSLPQEKIVFNAQGTVAVLESMIGDSVLGAGACGGVPIIFSRNS GLVSITSRENVSILAEDLEGSLASSVAGPNSESMIFETTTKNETIAQEDKIKLLKAAFLQ YCRKDLGHAQMVVDELFSSHSDLDSDSELDRAVTQISVDLMDDYPASDPRWAESVPEGSF PVRGTPMATRLLLCEHAEKLSAAIVLKNHHSRLSDLVNTAILIALNKREYEIPSNLTPAD VFFREVSQVDTICECLLEHEEQVLRDAPMDSIEWAEVVINVNNILKDMLQAASHYRQNRN SLYRREESLEKEPEYVPWTATSGPGGIRTVIIRQHEIVLKVAYPQADSNLRNIVTEQLVA LIDCFLDGYVSQLKSVDKSSNRERYDNLEMEYLQKRSDLLSPLLSLGQYLWAASLAEKYC DFDILVQMCEQTDNQSRLQRYMTQFADQNFSDFLFRWYLEKGKRGKLLSQPISQHGQLAN FLQAHEHLSWLHEINSQELEKLYICEENRRANEYDFKKALDLLEYIDESITEDFCDQTCW DFPPPTLGNHWSSSDGKDDPIEVSKDSIFVKILQKLLKDGLSPGAPAPPPRRSALASPRQ LSILRQLRALRPPVALCAKLDTMCDEDETTALVCDNGSGLVKAGFAGDDAPRAVFPSIVG RPRHQGVMVGMGQKDSYVGDEAQSKRGILTLKYPIEHGIITNWDDMEKIWHHTFYNELRV APEEHPTLLTEAPLNPKANREKMTQIMFETFNVPAMYVAIQAVLSLYASGRTTGIVLDSG DGVTHNVPIYEGYALPHAIMRLDLAGRDLTDYLMKILTERGYSFVTTAEREIVRDIKEKL CYVALDFENEMATAASSSSLEKSYELPDGQVITIGNERFRCPETLFQPSFIGMESAGIHE TTYNSIMKCDIDIRKDLYANNVMSGGTTMYPGIADRMQKEITALAPSTMKIKIIAPPERK YSVWIGGSILASLSTFQQMWITKQEYDEAGPSIVHRKCF >gi568815597r:229331502_229533115|GENSCAN_predicted_CDS_4|3900_bp atgttcccagccgccccttctccgcggaccccgggtaccgggtcccgaaggggcccgctg gccggactcgggcccggctccacgccccggacggctagcaggaagggtctgcccctgggg tctgcagtcagctccccagtgctcttctcgccggtcggccggcgtagctcgctaagctcg cggggaacaccaacacgaatgttcccacaccactccataactgagtctgtgaactatgat gtgaaaacgtttggatcttctcttcctgttaaagtcatggaagccctaacattggctgaa gtcgatgaccagctgaccattaacatagatgaaggtggatgggcttgtctggtgtgcaaa gagaagctcattatttggaagattgctctgtcacctattactaagttatccgtttgcaaa gaacttcagctgccacctagtgatttccactggagtgccgacttagtggctctttcttac tcttctccctcaggtgaagcacattctactcagggaggaagttttattttgtcttcatca ggaagccaactaattcggttgatacctgagagctcaggaaagattcatcagcatatcctg cctcaggggcaaggcatgctttcaggaattggtcgaaaagtttcttctctttttggaatt ttatctcctagtagtgatctcacactttcaagtgttctctgggatagagagagatcaagc ttttatagcctgacgagttcaaacatcagtaaatgggaattagatgattcttcagaaaag catgcatacagttgggatataaatagagccctgaaggaaaacattaccgatgctatttgg tctgaagacctgattttgtgtcagttgacggtcccaaacttttcaaaccagactgcctat ctgtataacgaaagtgctgtctatgtgtgctccacaggaactgggaaattttctcttccc caggagaaaattgtctttaatgcacaaggtacagtagcagttttagaaagcatgattgga gatagtgttttaggtgctggtgcctgtggtggtgttcctatcattttttctagaaacagt ggactggtgtctattacttcaagggaaaatgtgtctatattggcagaagacttggaaggg tctttagcatcttcagttgctggaccaaacagtgagagtatgatttttgagaccactaca aagaatgaaactatagcccaggaagataaaatcaagttgctgaaagctgcctttctgcaa tactgcagaaaagatttaggtcatgctcaaatggtggttgatgagctcttttcctctcac tctgatttggattctgattctgaactagacagggcagttacccaaatcagtgtagacctg atggatgactacccagcatctgacccacggtgggctgagtctgtccctgagggcagtttt ccagttagagggacaccgatggccactcgactgttgctctgtgagcatgccgaaaagctg tcagccgccattgttctcaagaaccaccactcccggctttctgaccttgtcaacacagcc atattgattgctttgaacaagagggagtatgaaatcccatccaacctgactcctgcagat gtctttttcagggaggtatcccaagtagataccatctgtgagtgcttactggagcatgag gagcaagtcttgagggatgcacctatggattccattgaatgggctgaagtggtgatcaat gtgaacaatattctcaaggatatgctgcaggctgctagtcattatcgccaaaatagaaac tctttgtatagaagagaagaatcactagaaaaagaacctgaatatgttccatggacggca acaagtggtcctggtggcatccgaacggtaataatacgccagcatgagattgtcctgaag gtggcttatccacaggcagacagcaacctccgaaacatcgtgaccgagcagctggtagcc ctgatcgattgcttcctggatggttatgtttctcagcttaagtctgtggataaatccagt aatcgggaaagatatgacaatctggagatggaatacctacagaaaagatcagatctctta tctcctcttctttcactaggccagtacctgtgggctgcttctctagcagagaaatactgt gactttgatatattggtacaaatgtgtgagcagactgacaaccagagccgactccagcgc tacatgacccagtttgctgatcagaatttttcagactttctcttccgttggtatctggag aaaggaaagcgaggcaaattattatctcagcccatttctcagcatggacagttggcaaat tttttgcaagctcatgaacatctcagctggttacatgaaattaatagccaagaattagaa aagctatatatctgtgaagaaaatagaagagctaatgaatatgatttcaagaaagctttg gacttgttggaatatattgatgagtcaataacagaagacttctgtgaccagacgtgttgg gatttcccacctcctacactgggcaaccactggtccagttctgatggcaaagatgatcca attgaagtatctaaagacagtatatttgtgaagatcttacagaaacttttaaaagatggc ctatccccgggagcccccgcgcctcctccccggcgctccgccctcgcctccccccgccag ttgtctatcctgcgacagctgcgcgccctccggccgccggtggccctctgtgcgaaacta gacacaatgtgcgacgaagacgagaccaccgccctcgtgtgcgacaatggctccggcctg gtgaaagccggcttcgccggggatgacgcccctagggccgtgttcccgtccatcgtgggc cgcccccgacaccagggcgtcatggtcggtatgggtcagaaagattcctacgtgggcgac gaggctcagagcaagagaggtatcctgaccctgaagtaccctatcgagcacggcatcatc accaactgggatgacatggagaagatctggcaccacaccttctacaacgagcttcgcgtg gctcccgaggagcaccccaccctgctcaccgaggcccccctcaatcccaaggccaaccgc gagaagatgacccagatcatgtttgagaccttcaacgtgcccgccatgtacgtggccatc caggccgtgctgtccctctacgcctccggcaggaccaccggcatcgtgctggactccggc gacggcgtcacccacaacgtgcccatttatgagggctacgcgctgccgcacgccatcatg cgcctggacctggcgggccgcgatctcaccgactacctgatgaagatcctcactgagcgt ggctactccttcgtgaccacagctgagcgcgagatcgtgcgcgacatcaaggagaagctg tgctacgtggccctggacttcgagaacgagatggcgacggccgcctcctcctcctccctg gaaaagagctacgagctgccagacgggcaggtcatcaccatcggcaacgagcgcttccgc tgcccggagacgctcttccagccctccttcatcggtatggagtcggcgggcattcacgag accacctacaacagcatcatgaagtgtgacatcgacatcaggaaggacctgtatgccaac aacgtcatgtcggggggcaccacgatgtaccctgggatcgctgaccgcatgcagaaagag atcaccgcgctggcacccagcaccatgaagatcaagatcatcgccccgccggagcgcaaa tactcggtgtggatcggcggctccatcctggcctcgctgtccaccttccagcagatgtgg atcaccaagcaggagtacgacgaggccggcccttccatcgtccaccgcaaatgcttctag >gi568815597r:229331502_229533115|GENSCAN_predicted_peptide_5|292_aa XLSSFYSELMKGLGAGGRLWELLEREPKLPFNEGVILNEKSFQGALEFKNVHFAYPARPE VPIFQDFSLSIPSGSVTALVGPSGSGKSTVLSLLLRLYDPASGTISLDGHDIRQLNPVWL RSKIGTVSQEPILFSCSIAENIAYGADDPSSVTAEEIQRVAEVANAVAFIRNFPQGFNTV VGEKGVLLSGGQKQRIAIARALLKNPKILLLDEATSALDAENEYLVQEALDRLMDGRTVL VIAHRLSTIKNANMVAVLDQGKITEYGKHEELLSKPNGIYRKLMNKQSFISA >gi568815597r:229331502_229533115|GENSCAN_predicted_CDS_5|879_bp ngtctgagctctttctactcggagctgatgaaaggactgggtgcaggggggcgcctctgg gagctcctggagagagagcccaagctgccttttaacgagggggtcatcttaaatgagaaa agcttccagggtgctttggagtttaagaacgtgcattttgcctatccagctcgcccagag gtgcccatatttcaggatttcagcctttccattccgtcaggatctgtcacggcactggtt ggcccaagtggttctggcaaatcaacagtgctttcactcctgctgaggttgtacgaccct gcttctggaactattagtcttgatggccatgacatccgtcagctaaacccagtgtggctg agatccaaaattgggacagtgagtcaggaacccattttgttttcttgctctattgctgag aacattgcttatggtgctgatgacccttcctctgtgaccgctgaggaaatccagagagtg gctgaagtggccaatgcagtggccttcatccggaatttcccccaagggttcaacactgtg gttggagaaaagggtgttctcctctcaggtgggcagaaacagcggattgcgattgcccgt gctctgctaaagaatcccaaaattcttctcctagatgaagcaaccagtgcgctggatgcc gaaaatgagtaccttgttcaagaagctctagatcgactgatggatggaagaacggtgtta gttattgcccatcgtctgtccaccattaagaatgctaatatggttgctgttcttgaccaa ggaaaaattactgaatatggaaaacatgaagagctgctttcaaaaccaaatgggatatac agaaaactaatgaacaaacaaagttttatttcagcataa