GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:32:27 Sequence gi568815577r:6345561_6563871 : 218311 bp : 49.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.25 PlyA - 2094 2089 6 1.05 1.24 Term - 4202 4065 138 2 0 65 42 96 0.219 0.66 1.23 Intr - 7979 7913 67 1 1 91 99 43 0.156 4.61 1.22 Intr - 30464 30371 94 0 1 84 71 67 0.000 3.62 1.21 Intr - 84122 84000 123 0 0 90 58 66 0.009 4.36 1.20 Intr - 90940 90853 88 1 1 89 76 92 0.517 7.64 1.19 Intr - 92242 92155 88 0 1 43 77 63 0.272 0.67 1.18 Intr - 100101 100002 100 1 1 118 -4 234 0.005 16.37 1.17 Intr - 103056 102921 136 0 1 107 65 213 0.765 21.04 1.16 Intr - 104371 104077 295 0 1 122 85 188 0.514 18.91 1.15 Intr - 105086 104952 135 1 0 121 44 228 0.999 21.38 1.14 Intr - 105421 105344 78 0 0 118 99 147 0.997 17.47 1.13 Intr - 105928 105848 81 0 0 98 68 49 0.830 2.65 1.12 Intr - 106664 106559 106 0 1 100 66 212 0.999 19.47 1.11 Intr - 108513 108429 85 0 1 54 101 176 0.999 14.89 1.10 Intr - 108862 108629 234 1 0 11 64 131 0.634 0.89 1.09 Intr - 109196 109071 126 2 0 76 90 276 0.961 27.48 1.08 Intr - 110109 110018 92 1 2 40 96 138 0.741 9.51 1.07 Intr - 111390 111321 70 0 1 99 74 134 0.999 11.85 1.06 Intr - 111639 111505 135 0 0 90 94 164 0.999 17.96 1.05 Intr - 111813 111734 80 1 2 122 98 120 0.999 15.67 1.04 Intr - 112495 112361 135 2 0 81 80 289 0.999 27.94 1.03 Intr - 114532 114338 195 2 0 29 76 88 0.593 1.09 1.02 Intr - 114733 114627 107 0 2 89 65 155 0.845 13.16 1.01 Init - 118311 118103 209 0 2 86 105 105 0.550 10.35 1.00 Prom - 119040 119001 40 -9.26 2.00 Prom + 119874 119913 40 -9.26 2.01 Init + 120735 120848 114 2 0 50 116 124 0.338 10.94 2.02 Intr + 122864 122936 73 1 1 57 71 81 0.358 2.28 2.03 Intr + 125768 125996 229 0 1 59 64 202 0.791 11.83 2.04 Intr + 127656 127744 89 0 2 77 72 70 0.656 3.91 2.05 Intr + 129824 129873 50 0 2 75 80 24 0.235 -1.30 2.06 Term + 133700 133834 135 1 0 117 44 69 0.394 3.42 2.07 PlyA + 134529 134534 6 1.05 3.10 PlyA - 135938 135933 6 1.05 3.09 Term - 139356 139209 148 2 1 93 52 103 0.681 4.57 3.08 Intr - 140670 140578 93 2 0 60 81 123 0.983 7.88 3.07 Intr - 140895 140762 134 0 2 87 84 234 0.999 22.34 3.06 Intr - 141645 141547 99 0 0 82 97 174 0.987 18.01 3.05 Intr - 141852 141803 50 1 2 118 78 51 0.973 5.50 3.04 Intr - 147550 147484 67 1 1 113 93 11 0.695 2.58 3.03 Intr - 153263 153143 121 1 1 -23 41 144 0.405 -0.90 3.02 Intr - 153664 153569 96 0 0 57 89 207 0.686 16.82 3.01 Init - 167102 165976 1127 2 2 60 53 298 0.510 17.17 3.00 Prom - 168360 168321 40 -4.66 4.00 Prom + 183150 183189 40 -5.26 4.01 Init + 186447 186583 137 2 2 69 84 113 0.925 8.61 4.02 Intr + 188311 188440 130 1 1 67 41 75 0.835 1.40 4.03 Term + 189958 190038 81 0 0 96 33 96 0.770 2.59 4.04 PlyA + 191789 191794 6 1.05 5.00 Prom + 198766 198805 40 -5.66 5.01 Init + 203583 203650 68 2 2 69 86 68 0.409 5.25 5.02 Intr + 206889 207094 206 0 2 109 75 67 0.698 6.34 5.03 Intr + 212029 212135 107 2 2 90 65 53 0.359 3.13 5.04 Intr + 215133 215411 279 2 0 52 94 464 0.603 41.07 5.05 Intr + 216641 216763 123 1 0 61 99 333 0.761 32.48 5.06 Intr + 218195 218251 57 1 0 125 -10 108 0.345 3.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 26868 26806 63 0 0 102 44 81 0.960 2.99 S.002 Intr - 86598 86488 111 2 0 107 51 106 0.884 8.29 S.003 Term - 100101 99998 104 1 2 118 45 247 0.995 21.84 S.004 Sngl - 200353 199877 477 1 0 71 43 191 0.843 7.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577r:6345561_6563871|GENSCAN_predicted_peptide_1|998_aa MPSETPQAEVGPTGCPHRSGPHSAKGSLEKGSPEDKEAKEPLWIRPDAPSRCTWQLGRPA SESPHHHTAPAKSPKILPDILKKIGDTPMVRINKIGKKFGLKCELCKALGSENSLVNEWP RTYQLKFGKNTQQGIVSEGDQSDSPADGQGNEQVTFELRPEWKQRGCEGPVAKCEFFNAG GSVKDRISLRMIEDAERDGTLKPGDTIIEPTSGNTGIGLALAAAVRGYRCIIVMPEKMSS EKVDVLRALGAEIVRTPTNARFDSPESHVGVAWRLKNEIPNSHILDQYRNASNPLAHYDT TADEILQQCDGKLDMLVASVGTGGTITGIARKLKEKCPGCRIIGVDPEGSILAEPEELNQ TEQTTYEVEGIGYDFIPTVLDRTGCSPGGTGREQGKRVKIKAPEKSVKSQKADFGLRESR DRRAGVGHTDAQWGWRAVLLSAKRVGTGKSADVPIPGGEAGVVDKWFKSNDEEAFTFARM LIAQEGLLCGGSAGSTVAVAVKAAQELQEGQRCVVILPDSVRNYMVSSMACKLQRLHSSR ELSCPWRARPGWTKFLSDRWMLQKGFLKEEDLTEKKPWWWHLRVQELGLSAPLTVLPTIT CGHTIEILREKGFDQAPVVDEAGVILGMVTLGNMLSSLLAGKVQPSDQVGKVIYKQFKQV PSHLQAAQTDAQSPTGSSNRCPVTYTQLKQVRGHLQAAQTGERSPTGSSNRYPVTYRQLK QKTRGAMFPLPLTTLPLQIRLTDTLGRLSHILEMDHFALVVHEQIQYHSTGKSSQRQMVF GVVTAIDLLNFVAAQERDQNNGLNYALASSYLKDGDRAAVDTNKGFSPQALEPLVSAQTQ PLSGAGTHSSHKHRHLVACVHASSVTPAATGEAAVSVSAQVCATHDVSANNGLSRPRRSR CKPCRSWNQWEDVGNQRLKALQNHHANMELPHQSLLTPTIEGAFVTVRRYRALRPLCGCS YHSPVLAVVAGLVHLVFFQKEGGVGHIKKLLINLNAAL >gi568815577r:6345561_6563871|GENSCAN_predicted_CDS_1|2997_bp atgccttctgagaccccccaggcagaagtggggcccacaggctgcccccaccgctcaggg ccacactcggcgaaggggagcctggagaaggggtccccagaggataaggaagccaaggag cccctgtggatccggcccgatgctccgagcaggtgcacctggcagctgggccggcctgcc tccgagtccccacatcaccacactgccccggcaaaatctccaaaaatcttgccagatatt ctgaagaaaatcggggacacccctatggtcagaatcaacaagattgggaagaagttcggc ctgaagtgtgagctctgcaaggccctgggatcagaaaactcgctggttaatgagtggcca agaacatatcagcttaagtttgggaaaaacacacaacagggcattgtgtcagagggtgac caaagtgactccccagcagatggacaagggaacgaacaggtgacctttgaactgaggccg gagtggaagcagcgagggtgtgaagggccggtggccaagtgtgagttcttcaacgcgggc gggagcgtgaaggaccgcatcagcctgcggatgattgaggatgctgagcgcgacgggacg ctgaagcccggggacacgattatcgagccgacatccgggaacaccgggatcgggctggcc ctggctgcggcagtgaggggctatcgctgcatcatcgtgatgccagagaagatgagctcc gagaaggtggacgtgctgcgggcactgggggctgagattgtgaggacgcccaccaatgcc aggttcgactccccggagtcacacgtgggggtggcctggcggctgaagaacgaaatcccc aattctcacatcctagaccagtaccgcaacgccagcaaccccctggctcactatgacacc accgctgatgagatcctgcagcagtgtgatgggaagctggacatgctggtggcttcagtg ggcacgggcggcaccatcacgggcattgccaggaagctgaaggagaagtgtcctggatgc aggatcattggggtggatcccgaagggtccatcctcgcagagccggaggagctgaaccag acggagcagacaacctacgaggtggaagggatcggctacgacttcatccccacggtgctg gacaggacgggatgtagcccaggtggcacaggcagggagcagggaaaacgtgtgaaaatt aaggccccggaaaagtctgttaagagccaaaaagctgactttggcctgagggagagcaga gaccgaagggccggtgtgggtcacacggatgctcagtggggctggcgggcagttttgctg tctgcaaaacgtgttggaactggaaagtctgcagacgtgccgatcccagggggagaggct ggggtggtggacaagtggttcaagagcaacgatgaggaggcgttcacctttgcccgcatg ctgatcgcgcaagaggggctgctgtgcggtggcagtgctggcagcacggtggcggtggcc gtgaaggccgcgcaggagctgcaggagggccagcgctgcgtggtcattctgcccgactca gtgcggaactacatggtttcctccatggcctgcaagctgcagcggctacacagctcccga gagctcagctgcccttggagggccaggcctggctggaccaagttcctgagcgacaggtgg atgctgcagaagggctttctgaaggaggaggacctcacggagaagaagccctggtggtgg cacctccgtgttcaggagctgggcctgtcagccccgctgaccgtgctcccgaccatcacc tgtgggcacaccatcgagatcctccgggagaagggcttcgaccaggcgcccgtggtggat gaggcgggggtaatcctgggaatggtgacgcttgggaacatgctctcgtccctgcttgcc gggaaggtgcagccgtcagaccaagttggcaaagtcatctacaagcagttcaaacaggta cccagtcacctacaggcagctcaaacagatgcgcagtcacctacaggcagctcaaacagg tgcccggtcacctacacgcagctcaaacaggtgcgcggtcacctacaggcagctcaaaca ggtgagcggtcacctacaggcagctcaaacaggtacccggtcacctacaggcagctcaaa cagaaaactcgtggggccatgttccccctgccactgaccacgcttcccttgcagatccgc ctcacggacacgctgggcaggctctcgcacatcctggagatggaccacttcgccctggtg gtgcacgagcagatccagtaccacagcaccgggaagtccagtcagcggcagatggtgttc ggggtggtcaccgccattgacttgctgaacttcgtggccgcccaggagcgggaccagaac aatggcttgaattatgccctggcatcaagctacctaaaggatggagacagagccgctgtg gacacgaacaaaggcttttctccgcaggctctggagccgctggtgagcgcacagacacag cccctctccggggcaggcactcacagcagccacaagcaccgccacctcgtggcctgtgtc cacgcttcttctgtaacccctgccgccacaggcgaggctgcagtgagcgtctctgcacag gtctgtgctacccacgatgtttcagccaacaacggactgagcagaccaaggaggtcccgg tgtaaaccatgtcgttcgtggaaccagtgggaagatgttggcaaccaaaggttgaaagcc ttgcaaaaccatcatgcaaacatggagttgccacatcaaagtctgctcacaccaaccata gaaggagcctttgtcactgtcagaagatacagagctttgagacccctatgtggctgctct taccacagcccggtgctggctgtggttgctggcttagtgcacctggtctttttccaaaaa gagggaggagttggccacatcaagaagctcctcatcaatctgaatgcagctctgtaa >gi568815577r:6345561_6563871|GENSCAN_predicted_peptide_2|229_aa MLRRRIFQKAVDPRPRQTQAAQRLAFGPSPRPGALSRKRKTRTVPISSGQRNVLRLRSVP GSEYNTYAESAHVMSGQRINRYPTGGRAEGRGLGLHHPLVAGRLGPQNWRRLEEEEEEEE GKEEEEEGKEEEEEGKEERIFLTLKEGHDLTIPLQKRKTSVQEQGAELGTGPASLHAVWA QADPQEICPHCSTRQDVPSSSSFRTYLLAAAAHLYLHHYLLLYSLLFPD >gi568815577r:6345561_6563871|GENSCAN_predicted_CDS_2|690_bp atgctccgccgccgaatcttccagaaagccgtggacccgaggccccggcagacgcaggcg gcccagcgccttgcttttggcccctcgcctcgccctggagccctctctcgcaagcgaaag acacggacggtgcccatcagctctgggcagaggaacgtgctgcgcctccgcagcgtcccc ggctcagagtataacacgtatgcagaaagtgcacatgtcatgagtggacagaggataaac cgctaccccactggggggcgagcagaggggagaggcctgggcctgcaccatcccctggtg gccggaaggctgggccctcagaactggaggagattggaggaggaggaggaggaagaggaa gggaaggaggaggaagaggaagggaaggaggaggaagaggaagggaaggaggagaggatc ttcctaacactgaaggaggggcacgacctgaccatccctttacagaagagaaagacgagt gtgcaggaacagggagcagagctgggcacagggccagcgtccctgcatgctgtgtgggcc caagctgacccgcaggaaatctgccctcactgctccacgcgccaggacgtgccctcctcc tcctctttccgcacctatctcctggctgcggctgctcacttgtacctgcaccattactta ctgctgtattctttgctttttccagactga >gi568815577r:6345561_6563871|GENSCAN_predicted_peptide_3|644_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSCIGRINIV KMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRTHIAKTILSQKNKAGGITLP DFKLYYKATVTKTAWYWYQNRDIDKWNRTEPSETIPHIYNHLIFDKPDKNKKWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKD FMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFAIYSSDKGLISRI YKELKQTYKKKTNNPIKKWAKDMNRHFSKEDIYAANRHMKKCLSSLAIREMQIKTIMRYH LTPVRMAIIKKSGNNRVGGVGSSVDGSGGGGWEMAEYLASIFGTEKDNRRLPEAASSRAE SCLEHPGLLDVDTFVPTFGCECPVEQALTILIQNIYRNPQNSAQTADGSHCAVSDVEMQE HYDEFFEEVFTEMEEKYGEVEEMNVCDNLGDHLVGNVYVKFRREEDAEKAVIDLNNRWFN GQPIHAELSPVTDFREACCRQYEMGECTRGGFCNFMHLKPISRELRRELYGRRRKKHRSR SRSRERRSRSRDRGRGGGGGGGGGGGGRERDRRRSRDRERSGRF >gi568815577r:6345561_6563871|GENSCAN_predicted_CDS_3|1935_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagagatgtgaaggacctcttcaaggagaactacaaaccactgctcaaagaaataaaa gaggacacaaacaaatggaagaacattccatgctcatgcataggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagtttatatggaaccaaaaaaga acccacattgccaagacaatcctaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagacaaatggaatagaacagagccctcagaaacaataccacacatctacaac catctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcgagatggattaaagacttaaatgttagacctaaaacc ataaaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagagactaccatcagagtgaacaggcaacct acagaatgggagaaaatttttgcaatctactcatctgacaaagggctaatatccagaatc tacaaagaactcaaacaaacttacaagaaaaaaacaaacaaccccatcaaaaagtgggca aaggatatgaacagacacttctcaaaagaagacatttatgcagccaacagacacatgaaa aaatgcttatcatcactggccatcagagaaatgcaaatcaaaaccataatgagataccat ctcacaccagttagaatggcgatcattaaaaagtcaggaaacaacagggtcggcggcgtc ggcagcagtgtcgacggcagcggcggcggcgggtgggaaatggcggagtatctggcctcc atcttcggcaccgagaaagacaatcgccggctcccggaagccgcctcgagccgcgctgaa agttgcctggagcatccgggccttttggacgtggacacgtttgttccaactttcggctgt gaatgtcctgtcgagcaagctctgaccatcttgattcaaaacatctatcgtaatccccaa aacagtgcacagacggctgacggctcacactgtgccgtgagcgatgtggagatgcaggaa cactatgatgagttttttgaggaggtttttacagaaatggaggagaagtatggggaagta gaggagatgaacgtctgtgacaacctgggagaccacctggtggggaacgtgtacgtcaag tttcgccgtgaggaagatgcggaaaaggctgtgattgacttgaataaccgttggtttaat ggacagccgatccacgccgagctgtcacccgtgacggacttcagagaagcctgctgccgt cagtatgagatgggagaatgcacacgaggcggcttctgcaacttcatgcatttgaagccc atttccagagagctgcggcgggagctgtatggccgccgtcgcaagaagcatagatcaaga tcccgatcccgggagcgtcgttctcggtctagagaccgtggtcgtggcggtggcggtggc ggtggtggaggtggcggcggacgggagcgtgacaggaggcggtcgagagatcgtgaaaga tctgggcgattctga >gi568815577r:6345561_6563871|GENSCAN_predicted_peptide_4|115_aa MIYPIFGMSVSDGGGSGGGEITESMSAKHTLLGAPPPTAQLKDKRPQGDSRKGDGVTGQD PSYEWELDQELSVLSHLLPVLGPLSETLQAMELTILPLAAVVLFRGTVIAFISIL >gi568815577r:6345561_6563871|GENSCAN_predicted_CDS_4|348_bp atgatctacccaatatttgggatgtccgtgagtgatggcggtggtagtggtggtggtgaa atcacggaatcaatgtctgcaaagcacacgctcttgggagcacctcctcccaccgcacag ctgaaagacaagcgaccacagggagacagcagaaagggagatggagtcaccggccaggac cccagctatgagtgggagctggaccaggagttgagtgtcctgagccacctcctccctgtg ctgggtccactctcagaaacactccaggccatggagctgaccatcctgcctctggcagct gtggtgctgtttaggggaacggtcattgccttcatctccattttatag >gi568815577r:6345561_6563871|GENSCAN_predicted_peptide_5|280_aa MPTTLVRPEITEESEHPGDAACPSPMQPTLGPAQPLGRRGWSPCAVPSAAHPAERPQAAG LHTCDRPEAPAAGPTRLQFPSRQPMALRPLPDSGAGSLRSPLQARPILWQSLDVPRRPLS RQALQRVGGLGLGSTLRCPEAPLTPASLQVPVVPKLNMDVTIQHPWFKRTLGPFYPSRLF DQFFGEGLFEYDLLPFLSSTISPYYRQSLFRTVLDSGISEVRSDRDKFVIFLDVKHFSPE DLTVKVQDDFVEIHGKHNERQDDHGYISREFHRRYRLPSN >gi568815577r:6345561_6563871|GENSCAN_predicted_CDS_5|840_bp atgcccaccacgctcgtgaggccggagatcaccgaggaatccgagcacccgggggacgct gcatgcccgagccccatgcagcccacactgggtcctgcccagcccctgggccggagggga tggtccccctgcgccgtccccagcgctgcccaccctgcagaacgtcctcaggcggccggg ctccacacctgcgacaggcccgaggccccggctgcaggccccactcggctgcagttcccg tcgaggcagccaatggccttgaggcctctcccagactcaggcgctggctcactcagaagc cccctgcaggcccggccaatcctgtggcagagcctcgacgtcccacggcggcctctgagc cgccaggccctacagcgtgtgggagggctcggccttggctccacactgcgctgcccagag gccccgctgactcctgccagcctccaggtccccgtggtaccaaagctgaacatggatgtg accatccagcacccctggttcaagcgcaccctggggcccttctaccccagccggctgttc gaccagtttttcggcgagggcctttttgagtatgacctgctgcccttcctgtcgtccacc atcagcccctactaccgccagtccctcttccgcaccgtgctggactccggcatctctgag gttcgatccgaccgggacaagttcgtcatcttcctcgatgtgaagcacttctccccggag gacctcaccgtgaaggtgcaggacgactttgtggagatccacggaaagcacaacgagcgc caggacgaccacggctacatttcccgtgagttccaccgccgctaccgcctgccgtccaac