GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:34:18 Sequence gi568815597r:9004422_9229027 : 224606 bp : 49.52% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 Intr - 458 331 128 0 2 131 109 109 0.967 17.70 1.10 Intr - 2964 2889 76 0 1 113 81 138 0.914 14.69 1.09 Intr - 5823 5722 102 0 0 123 41 215 0.926 20.67 1.08 Intr - 9214 9104 111 1 0 94 109 168 0.998 20.08 1.07 Intr - 10447 10260 188 2 2 88 80 326 0.994 31.21 1.06 Intr - 10821 10696 126 1 0 90 86 185 0.980 19.15 1.05 Intr - 13954 13802 153 2 0 82 91 165 0.999 16.24 1.04 Intr - 14912 14788 125 1 2 90 82 222 0.999 22.03 1.03 Intr - 18682 18497 186 0 0 41 97 94 0.587 4.40 1.02 Intr - 20653 20544 110 1 2 63 86 177 0.565 14.08 1.01 Init - 21924 21874 51 0 0 80 98 -5 0.737 0.86 1.00 Prom - 26971 26932 40 -3.96 2.15 PlyA - 28808 28803 6 1.05 2.14 Term - 33368 33165 204 2 0 107 41 335 0.997 28.07 2.13 Intr - 33603 33476 128 1 2 99 109 166 0.999 20.20 2.12 Intr - 34085 34010 76 2 1 112 62 114 0.780 10.29 2.11 Intr - 34247 34155 93 2 0 45 52 79 0.392 0.16 2.10 Intr - 34508 34407 102 2 0 97 94 114 0.999 13.27 2.09 Intr - 35241 35131 111 0 0 88 78 351 0.841 34.68 2.08 Intr - 35566 35379 188 2 2 76 6 486 0.629 38.61 2.07 Intr - 35768 35643 126 0 0 74 92 221 0.977 21.75 2.06 Intr - 37516 37364 153 2 0 97 101 202 0.998 22.44 2.05 Intr - 43313 43189 125 1 2 110 97 -32 0.393 0.13 2.04 Intr - 44849 44679 171 1 0 40 61 95 0.084 1.06 2.03 Intr - 53187 53027 161 0 2 79 110 49 0.212 4.99 2.02 Intr - 53829 53731 99 0 0 76 82 119 0.177 10.41 2.01 Init - 67371 67252 120 0 0 100 51 145 0.062 10.19 2.00 Prom - 71973 71934 40 -2.16 3.06 PlyA - 72226 72221 6 1.05 3.05 Term - 99823 99678 146 2 2 76 55 96 0.602 3.17 3.04 Intr - 101023 100858 166 1 1 93 65 -11 0.266 -3.37 3.03 Intr - 101259 101065 195 0 0 71 53 405 0.783 34.91 3.02 Intr - 107068 106855 214 0 1 130 66 379 0.534 38.62 3.01 Init - 124606 124224 383 1 2 85 94 685 0.903 64.14 3.00 Prom - 132427 132388 40 -3.06 4.14 PlyA - 132475 132470 6 1.05 4.13 Term - 140932 140874 59 2 2 109 55 23 0.537 -1.05 4.12 Intr - 142660 142576 85 1 1 45 82 81 0.233 2.59 4.11 Intr - 147414 147349 66 0 0 98 68 19 0.058 0.00 4.10 Intr - 153175 153005 171 1 0 2 74 110 0.013 1.14 4.09 Intr - 157887 157674 214 2 1 94 71 59 0.021 3.52 4.08 Intr - 177874 177586 289 2 1 157 97 96 0.908 13.70 4.07 Intr - 178670 178422 249 0 0 71 34 118 0.026 2.11 4.06 Intr - 188307 188186 122 2 2 84 75 80 0.268 6.44 4.05 Intr - 191672 191597 76 0 1 80 81 4 0.223 -2.53 4.04 Intr - 192600 192491 110 2 2 1 116 87 0.208 2.73 4.03 Intr - 193128 193012 117 2 0 0 100 93 0.039 1.28 4.02 Intr - 194571 194348 224 0 2 90 5 145 0.019 3.33 4.01 Init - 197495 197304 192 2 0 54 76 155 0.018 9.90 4.00 Prom - 203029 202990 40 -2.86 5.03 PlyA - 203828 203823 6 1.05 5.02 Term - 219045 218820 226 2 1 -238 50 512 0.601 11.05 5.01 Intr - 219387 219059 329 0 2 -71 -1 851 0.919 54.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 117596 117671 76 1 1 89 56 100 0.866 6.20 S.002 Term + 117771 117856 86 1 2 95 47 77 0.964 2.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:9004422_9229027|GENSCAN_predicted_peptide_1|452_aa MENKEAGTPPPIPSREGRLQPTLLLATLSAAFGSAFQYGYNLSVVNTPHKVGTSCGWGNV FQVFKSFYNETYFERHATFMDGKLMLLLWSCTVSMFPLGGLLGSLLVGLLVDSCGRKGTL LINNIFAIIPAILMGVSKVAKAFELIVFSRVVLGVCAGISYSALPMYLGELAPKNLRGMV GTMTEVFVIVGVFLAQIFSLQAILGNPAGWPVLLALTGVPALLQLLTLPFFPESPRYSLI QKGDEATARQALRRLRGHTDMEAELEDMRAEARAERAEGHLSVLHLCALRSLRWQLLSII VLMAGQQLSGINAINYYADTIYTSAGVEAAHSQYVTVGSGVVNIVMTITSAVLVERLGRR HLLLAGYGICGSACLVLTVVLLFQNRVPELSYLGIICVFAYIAGHSIGPSPVPSVVRTEI FLQSSRRAAFMVDGAVHWLTNFIIGFLFPSIQ >gi568815597r:9004422_9229027|GENSCAN_predicted_CDS_1|1356_bp atggagaacaaagaggcgggaacccctccacccattccatccagggaggggcggctccag ccgacgctgttgctggcgacactgagcgcggcctttggctcagccttccagtacggctac aacctctctgtggtcaacacgccgcacaaggtgggcacaagctgtggatggggcaatgtt ttccaggtcttcaagtcattttacaacgaaacctactttgagcgacacgcaacattcatg gacgggaagctcatgctgcttctatggtcttgcaccgtctccatgtttcctctgggcggc ctgttggggtcattgctcgtgggcctgctggttgatagctgcggcagaaaggggaccctg ctgatcaacaacatctttgccatcatccccgccatcctgatgggagtcagcaaagtggcc aaggcttttgagctgatcgtcttttcccgagtggtgctgggagtctgtgcaggcatctcc tacagcgcccttcccatgtacctgggagaactggcccccaagaacctgagaggcatggtg ggaacaatgaccgaggttttcgtcatcgttggagtcttcctagcacagatcttcagcctc caggccatcttgggcaacccggcaggctggccggtgcttctggcgctcacaggggtgccc gccctgctgcagctgctgaccctgcccttcttccccgaaagcccccgctactccctgatt cagaaaggagatgaagccacagcgcgacaagctctgaggaggctgagaggccacacggac atggaggccgagctggaggacatgcgtgcggaggcccgggccgagcgcgccgagggccac ctgtctgtgctgcacctctgtgccctgcggtccctgcgctggcagctcctctccatcatc gtgctcatggccggccagcagctgtcgggcatcaatgcgatcaactactatgcggacacc atctacacatctgcgggcgtggaggccgctcactcccaatatgtaacggtgggctctggc gtcgtcaacatagtgatgaccatcacctcggctgtccttgtggagcggctgggacggcgg cacctcctgctggccggctacggcatctgcggctctgcctgcctggtgctgacggtggtg ctcctattccagaacagggtccccgagctgtcctacctcggcatcatctgtgtctttgcc tacatcgcgggacattccattgggcccagtcctgtcccctcggtggtgaggaccgagatc ttcctgcagtcctcccggcgggcagctttcatggtggacggggcagtgcactggctcacc aacttcatcataggcttcctgttcccatccatccag >gi568815597r:9004422_9229027|GENSCAN_predicted_peptide_2|618_aa MTASRHAPVPALGSLQGGSTGRALVTPWLCLRRPVTGRNRRLTLVLALATLIAAFGSSFQ YGYNVAAVNSPALLMQQFYNETYYGRTGEFMEDFPLTLLWSVTVSMFPFGGFIGSLLVGP LVNKFGRCAVSISGSHVWQVSQCLGMLELHSLTASSIMRVLIMRAPLREAPPEKFALLLS GSPGKGALLFNNIFSIVPAILMGCSRVATSFELIIISRLLVGICAGVSSNVVPMYLGELA PKNLRGALGVVPQLFITVGILVAQIFGLRNLLANVDGWPILLGLTGVPAALQLLLLPFFP ESPRYLLIQKKDEAAAKKALQTLRGWDSVDREVAEIRQEDEAEKAAGFISVLKLFRMRSL RWQLLSIIVLMGGQQLSGVNAIYYYADQIYLSAGVPEEHVQYVTAGTGAVNVVMTFCAVF VVELLGRRLLLLLGFSICLIACCVLTAALALQLIAHVQLISQWLIEVALLGEEPGAILHP PGQDTVSWMPYISIVCVISYVIGHALGPSPIPALLITEIFLQSSRPSAFMVGGSVHWLSN FTVGLIFPFIQEGLGPYSFIVFAVICLLTTIYIFLIVPETKAKTFIEINQIFTKMNKVSE VYPEKEELKELPPVTSEQ >gi568815597r:9004422_9229027|GENSCAN_predicted_CDS_2|1857_bp atgacggcctcccgccacgcccccgtccccgcgctcggctccctccagggcggaagcacg ggtcgagcgttggtgacgccatggctgtgcttgcgacgccctgtcactggcaggaaccgg aggctgacgcttgtgcttgccctggcaaccctgatagctgcctttgggtcatccttccag tatgggtacaacgtggctgctgtcaactccccagcactgctcatgcaacaattttacaat gagacttactatggtaggaccggtgaattcatggaagacttccccttgacgttgctgtgg tctgtaaccgtgtccatgtttccatttggagggtttatcggatccctcctggtcggcccc ttggtgaataaatttggcaggtgtgctgttagtataagcggctctcacgtgtggcaagtt tctcagtgtttaggaatgctggagctccactcgctgacagcttcgtccatcatgagggtc ctcatcatgagggctccattgcgagaggcccctccagagaagtttgctttgcttctgtca ggatctccaggaaaaggggccttgctgttcaacaacatattttctatcgtgcctgcgatc ttaatgggatgcagcagagtcgccacatcatttgagcttatcattatttccagacttttg gtgggaatatgtgcaggtgtatcttccaacgtggtccccatgtacttaggggagctggcc cctaaaaacctgcggggggctctcggggtggtgccccagctcttcatcactgttggcatc cttgtggcccagatctttggtcttcggaatctccttgcaaacgtagatggctggccgatc ctgctggggctgaccggggtccccgcggcgctgcagctccttctgctgcccttcttcccc gagagccccaggtacctgctgattcagaagaaagacgaagcggccgccaagaaagcccta cagacgctgcgcggctgggactctgtggacagggaggtggccgagatccggcaggaggat gaggcagagaaggccgcgggcttcatctccgtgctgaagctgttccggatgcgctcgctg cgctggcagctgctgtccatcatcgtcctcatgggcggccagcagctgtcgggcgtcaac gctatctactactacgcggaccagatctacctgagcgccggcgtgccggaggagcacgtg cagtacgtgacggccggcaccggggccgtgaacgtggtcatgaccttctgcgccgtgttc gtggtggagctcctgggtcggaggctgctgctgctgctgggcttctccatctgcctcata gcctgctgcgtgctcactgcagctctggcactgcagctcatagcccacgttcagctgatt tcccagtggctcatcgaggtggcactgctgggggaggagccaggtgccatcctccaccca ccagggcaggacacagtgtcctggatgccatacatcagcatcgtctgtgtcatctcctac gtcataggacatgccctcgggcccagtcccatacccgcgctgctcatcactgagatcttc ctgcagtcctctcggccatctgccttcatggtggggggcagtgtgcactggctctccaac ttcaccgtgggcttgatcttcccgttcatccaggagggcctcggcccgtacagcttcatt gtcttcgccgtgatctgcctcctcaccaccatctacatcttcttgattgtcccggagacc aaggccaagacgttcatagagatcaaccagattttcaccaagatgaataaggtgtctgaa gtgtacccggaaaaggaggaactgaaagagcttccacctgtcacttcggaacagtga >gi568815597r:9004422_9229027|GENSCAN_predicted_peptide_3|367_aa MQPSPPPTELVPSERAVVLLSCALSALGSGLLVATHALWPDLRSRARRLLLFLSLADLLS AASYFYGVLQNFAGPSWDCVLQGALSTFANTSSFFWTVAIALYLYLSIVRAARGPRTDRL LWAFHVVSWGVPLVITVAAVALKKIGYDASDVSVGWCWIDLEAKDHVLWMLLTGKLWEML AYVLLPLLYLLVRKHINRAHTALSEYRPILSQEHRLLRHSSMADKKLVLIPLIFIGLRVW STVRFVLTLCGSPAVQTPVLVVLHRRHSLIPSRSTVAAPQCRPRHSCREELGPTVLCLPL CLRQDLEERRQSGISVVRGDPQAASSTGTSVLAHGWAFICIRVAVRFGKPVAHSCGHSAP TQWTLMS >gi568815597r:9004422_9229027|GENSCAN_predicted_CDS_3|1104_bp atgcagccgtccccgccgcccaccgagctggtgccgtcggagcgcgccgtggtgctgctg tcgtgcgcactctccgcgctcggctcgggcctgctggtggccacgcacgccctgtggccc gacctgcgcagccgggcacggcgcctgctgctcttcctgtcgctggccgacctgctctcg gccgcctcctacttctacggagtgctgcagaacttcgcgggcccgtcgtgggactgcgtg ctgcagggcgcgctgtccaccttcgccaacaccagctccttcttctggaccgtggccatt gcgctctacttgtacctcagcatcgtccgcgccgcgcgcgggcctcgcacagatcgcctg ctttgggccttccatgtcgtcagctggggggtcccgttggtcatcactgtggcagccgtc gccctgaagaagattggctatgacgcctcggacgtgtctgtgggctggtgctggatcgac ctggaggccaaggaccatgtcctgtggatgctgctgacggggaagctgtgggagatgctg gcatatgtgctgctgcctctgctgtacctcctggtccggaagcacatcaacagagcgcac acggcactctctgagtaccggcccatcctctcccaggagcaccgcctgctgcgccactcc tccatggcggacaagaagctggtgctcatcccgctcatcttcatcggcctcagggtctgg agcaccgtgcggttcgtgctgaccctctgtggctccccggccgtgcagacgccggtgctg gtggttctgcataggaggcacagcctgattccttcccgcagcacagtggctgcaccccag tgtcggccaaggcacagctgcagggaggagctcggccccactgtgctgtgccttcctctc tgcctgagacaggaccttgaagagagaaggcagagtgggatcagtgtggtccggggtgat cctcaggcagcaagttccaccggtaccagcgtcctggcccacgggtgggcgttcatctgc ataagggtagcagtgagattcgggaagccggtggcccacagctgtggccacagtgccccc acccagtggaccttgatgtcctga >gi568815597r:9004422_9229027|GENSCAN_predicted_peptide_4|657_aa MPYCEIFLLQVTADRSSGRGSSGRGSSRRSQLEEEEEEEEEEEEEGEEEGRREGGREGDA AHSQGQLEPKLQQALPRPPRPPARLPAARAKSAVLSAAGLQQQPAMRRPCTTAPGGARVP TATSWRLRAAGAEIQTDARHSIAAASLPDKDKTCTFQAEPTGQLRSCDSCRDEGIRKRTQ EAQGRPGIPWDMWDTDEIISAQPIRHPDSDAVGQAFPFGIYLCVLVRDLLFTVSVHFCVQ KTWVMLSAQLTTVKGVMAANMYKNVAMSVRRIEGLVPVWQAPRDPRAVRGRASCPEARRA KSPLQPAAPELGGAVSGGRAPGRRAVAVAGCRFLALEVLSHLVESGAALALRALRTGNVA ETTGGGDAAVPSVWGQPSSPDPGLERRVAAPGPGGTSRKEDPAAGSAWACLGLFRAGLLL GDHADRGHLEILRESCSQAPGQERPGSLHYLLAPSQHAPSFRFEIPVTPREMLQCLSGWG AGHMVGAERGGHLCACTSWQVNAHLWRGLGAGSEPRSLRLCQAQRRQRAQQASPTHKDHW DIVTCEGSARVRCALGAVGEMGAEDEVMTPHTRNLMFAVMVEACPWWGVLGRGRTGLSPE SPPDAVDRPAQHYQVQSRGPPPRCLPFPGQDDPNFLMESAELASWVCSLDGHTGPGA >gi568815597r:9004422_9229027|GENSCAN_predicted_CDS_4|1974_bp atgccgtactgcgaaatctttctccttcaagtgactgcagaccgcagcagcggcagaggc agcagcggcagaggcagcagcagaaggagtcagctggaggaggaggaggaggaggaggag gaggaggaggaggaaggagaggaggagggaaggagggaaggagggagggaaggggatgca gcccattctcaggggcagttagaacccaagctgcagcaggccctcccccgccctccccgc ccgcctgcccgcctgcccgccgcccgggccaagtcagcagtgctgagcgctgcggggctg cagcagcagccggcgatgcggagaccctgcaccactgctccgggcggcgctcgcgtcccg acggccacctcctggcggctgcgcgcagccggcgccgagatccagacggacgcacggcac agcattgcggctgccagtctccctgacaaggacaagacctgcaccttccaggcagagcca actggccagctgaggtcctgcgactcctgcagggatgaaggtataaggaaaaggacacag gaagctcagggcagacctgggattccctgggacatgtgggacacggatgagatcatcagc gctcaacccatcaggcatcctgactcagatgcagtaggacaagcctttccctttgggatc tatctctgtgtccttgttcgggacttgctgttcacagtctcagtgcacttttgtgtccaa aagacatgggtgatgctgagtgcacagcttactacagtgaaaggagtgatggctgcaaac atgtacaaaaatgtggccatgtctgtcagaaggatcgagggcttagtgccggtgtggcaa gccccccgggacccgagagccgtgcggggccgagcctcgtgccccgaggcacgccgagcc aagtcacccctgcagcccgcggcgccggaacttgggggcgcagtgagtgggggcagggcg ccgggccggcgggcggtggccgtggctgggtgccggttcctggctttagaagtcctttcg catcttgttgaatccggggctgcattggccctccgagctctccggactggcaacgtcgca gagacaacaggtggaggagatgccgctgtcccgtcggtctggggacagcccagctccccg gatcccgggctggagagacgcgtcgcggccccggggcctggtggcacgagcaggaaggag gacccggcggcgggctctgcctgggcttgcctgggcttgttccgagccgggctgcttctc ggtgaccacgcagatcgggggcatttggagattttgcgggagtcctgcagccaagctccg gggcaggagaggcctggaagcctgcactacctgctcgccccgtcccagcatgcacccagc ttcaggtttgaaattccagtgacaccgagggagatgctccagtgcctgtccggctggggg gcaggtcacatggtgggagcagagaggggagggcacttgtgcgcctgcacctcttggcag gtgaatgcacacctatggaggggcctgggtgctggcagcgagccccgctccctgcggctg tgccaggcccagaggcggcagcgagcccaacaggcctctccaacacataaagatcattgg gacatagtcacgtgtgaagggtctgcccgtgtccgctgtgcattaggtgctgttggggaa atgggagcagaggatgaggtcatgaccccacacacccgcaacctgatgtttgctgtcatg gtggaagcatgcccgtggtggggggtgctggggagaggcaggacaggcctgtcccccgag tcccctccggatgccgtggaccggccagctcagcactaccaggtacagtccaggggcccc ccaccaagatgcctgcccttccctggtcaagatgaccccaacttcctgatggaatcagca gagctggcttcatgggtgtgcagcctggatggccacacagggcccggtgcctga >gi568815597r:9004422_9229027|GENSCAN_predicted_peptide_5|184_aa KKKKKRRKRKRKKRKRRKKKKKKRKKKKKKKSKKKTKKKKKEEKDEEVEEEKDEEEEEEE EKDKEEVEEKDEEEEKKKEEEEKDEEEEVEEKDEEEDEKKKKKKRTKKKKRRRRRKEEEE GRRRKGRRRRKRRRRRRRKKKKTKKKEEEKDKEEVEEKDEEEEDEEDDEEEDKDKEEEEE EGHT >gi568815597r:9004422_9229027|GENSCAN_predicted_CDS_5|555_bp aagaagaagaagaagagaaggaagaggaagagaaagaagaggaagaggaggaagaagaag aaaaagaagaggaagaagaagaagaagaaaaagtcaaagaagaagacaaagaagaagaag aaagaagaaaaagatgaagaagtagaagaagaaaaagatgaagaagaagaagaagaagaa gaaaaggacaaagaagaagtagaagaaaaagatgaagaagaagaaaagaagaaggaagaa gaagaaaaggatgaagaagaagaagtagaagaaaaagatgaagaagaagatgaaaagaag aagaagaagaaaaggacgaagaagaagaaacgaagaagaagaagaaaagaagaagaagaa ggaagaagaagaaaaggacgacgaagaagaaaaagaagaagaagaagaagaaggaagaag aaaaagacgaagaagaaggaagaagaaaaagacaaagaagaagtagaagaaaaagatgaa gaagaagaagatgaagaagatgacgaagaagaagacaaagacaaagaagaagaagaagaa gaaggccacacctag