GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:51:56 Sequence gi568815597r:8941624_9158251 : 216628 bp : 48.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 2525 2564 40 -1.76 1.01 Init + 4264 4342 79 0 1 96 99 164 0.928 17.54 1.02 Intr + 7640 7819 180 0 0 80 110 156 0.931 16.84 1.03 Intr + 15514 15662 149 2 2 102 67 177 0.916 16.95 1.04 Intr + 17287 17379 93 0 0 114 99 99 0.996 13.76 1.05 Intr + 20964 21033 70 2 1 132 110 83 0.998 13.65 1.06 Intr + 26036 26193 158 0 2 99 95 93 0.949 10.83 1.07 Intr + 29244 29358 115 2 1 99 96 142 0.965 16.12 1.08 Intr + 40431 40546 116 1 2 36 74 90 0.088 2.57 1.09 Intr + 43787 43925 139 1 1 95 -1 58 0.367 -2.36 1.10 Term + 50027 50217 191 0 2 86 36 129 0.674 5.11 1.11 PlyA + 51336 51341 6 1.05 2.13 PlyA - 52012 52007 6 1.05 2.12 Term - 61895 61677 219 2 0 103 40 235 0.820 17.24 2.11 Intr - 63256 63129 128 2 2 131 109 109 0.997 17.70 2.10 Intr - 65762 65687 76 2 1 113 81 138 0.915 14.69 2.09 Intr - 68621 68520 102 2 0 123 41 215 0.926 20.67 2.08 Intr - 72012 71902 111 0 0 94 109 168 0.998 20.08 2.07 Intr - 73245 73058 188 1 2 88 80 326 0.994 31.21 2.06 Intr - 73619 73494 126 0 0 90 86 185 0.980 19.15 2.05 Intr - 76752 76600 153 1 0 82 91 165 0.999 16.24 2.04 Intr - 77710 77586 125 0 2 90 82 222 0.999 22.03 2.03 Intr - 81480 81295 186 2 0 41 97 94 0.587 4.40 2.02 Intr - 83451 83342 110 0 2 63 86 177 0.565 14.08 2.01 Init - 84722 84672 51 2 0 80 98 -5 0.737 0.86 2.00 Prom - 89769 89730 40 -3.96 3.15 PlyA - 91606 91601 6 1.05 3.14 Term - 96166 95963 204 1 0 107 41 335 0.997 28.07 3.13 Intr - 96401 96274 128 0 2 99 109 166 0.999 20.20 3.12 Intr - 96883 96808 76 1 1 112 62 114 0.780 10.29 3.11 Intr - 97045 96953 93 1 0 45 52 79 0.392 0.16 3.10 Intr - 97306 97205 102 1 0 97 94 114 0.999 13.27 3.09 Intr - 98039 97929 111 2 0 88 78 351 0.841 34.68 3.08 Intr - 98364 98177 188 1 2 76 6 486 0.629 38.61 3.07 Intr - 98566 98441 126 2 0 74 92 221 0.977 21.75 3.06 Intr - 100314 100162 153 1 0 97 101 202 0.998 22.44 3.05 Intr - 106111 105987 125 0 2 110 97 -32 0.393 0.13 3.04 Intr - 107647 107477 171 0 0 40 61 95 0.084 1.06 3.03 Intr - 115985 115825 161 2 2 79 110 49 0.212 4.99 3.02 Intr - 116627 116529 99 2 0 76 82 119 0.177 10.41 3.01 Init - 130169 130050 120 2 0 100 51 145 0.062 10.19 3.00 Prom - 134771 134732 40 -2.16 4.06 PlyA - 135024 135019 6 1.05 4.05 Term - 162621 162476 146 1 2 76 55 96 0.602 3.17 4.04 Intr - 163821 163656 166 0 1 93 65 -11 0.266 -3.37 4.03 Intr - 164057 163863 195 2 0 71 53 405 0.783 34.91 4.02 Intr - 169866 169653 214 2 1 130 66 379 0.534 38.62 4.01 Init - 187404 187022 383 0 2 85 94 685 0.903 64.14 4.00 Prom - 195225 195186 40 -3.06 5.05 PlyA - 195273 195268 6 1.05 5.04 Term - 203730 203672 59 1 2 109 55 23 0.586 -1.05 5.03 Intr - 205458 205374 85 0 1 45 82 81 0.297 2.59 5.02 Intr - 210212 210147 66 2 0 98 68 19 0.226 0.00 5.01 Init - 215886 215803 84 0 0 69 74 61 0.276 3.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 180394 180469 76 0 1 89 56 100 0.866 6.20 S.002 Term + 180569 180654 86 0 2 95 47 77 0.964 2.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:8941624_9158251|GENSCAN_predicted_peptide_1|429_aa MRALVLLLSLFLLGGQAQHVSDWTYSEGALDEAHWPQHYPACGGQRQSPINLQRTKVRYN PSLKGLNMTGYETQAGEFPMVNNGHTVQISLPSTMRMTVADGTVYIAQQMHFHWGGASSE ISGSEHTVDGIRHVIEIHIVHYNSKYKSYDIAQDAPDGLAVLAAFVEVKNYPENTYYSNF ISHLANIKYPGQRTTLTGLDVQDMLPRNLQHYYTYHGSLTTPPCTENVHWFVLADFVKLS RTQVWKLENSLLDHRNKTIHNDYRRTQPLNHRVVESNFPNQGRQPLETLSSVNITEHEIR VHTVLLETQATEECVIRHEEGGLAYGSGCTVALERICPVGGKGCPLGGIMVVSRNLLHWI PGLEEPHLCETQRLLFVLPSSQSMCFEWDVSACYNLLATRIVFRDRCDQGAQSGPTPECG GTQPQHVQN >gi568815597r:8941624_9158251|GENSCAN_predicted_CDS_1|1290_bp atgagggccctggtgcttctgctgtccctgttcctgctgggtggccaggcccagcatgtg tctgactggacctactcagaaggggcactggacgaagcgcactggccacagcactacccc gcctgtgggggccagagacagtcgcctatcaacctacagaggacgaaggtgcggtacaac ccctccttgaaggggctcaatatgacaggctatgagacccaggcaggggagttccccatg gtcaacaatggccacacagtgcagatcagcctgccctccaccatgcgcatgacagtggct gacggcactgtatacatagcccagcagatgcactttcactggggaggtgcgtcctcggag atcagcggctctgagcacaccgtggacgggatcagacatgtgatcgagattcacattgtt cactacaattctaaatacaagagctatgatatagcccaagatgcgccggatggtttggct gtactggcagccttcgttgaggtgaagaattaccctgaaaacacttattacagcaacttc atttctcatctggccaacatcaagtacccaggacaaagaacaaccctgactggccttgac gttcaggacatgctgcccaggaacctccagcactactacacctaccatggctcactcacc acgcctccctgcactgagaacgtccactggtttgtgctggcagattttgtcaagctctcc aggacacaggtttggaagctggagaattccttactggatcaccgcaacaagaccatccac aacgattaccgcaggacccagcccctgaaccacagagtggtggaatccaacttcccgaat cagggccgacaacctttggaaacactgagcagtgttaacatcactgaacacgaaataaga gtacacactgtgcttctggaaactcaggccacagaagaatgtgtaattcgccatgaagaa ggagggctcgcctatggaagcggctgcacagtggctctagaaagaatatgtcctgtagga gggaagggctgccctctgggtggaatcatggtggtctccagaaatctcctccactggatt cctgggctggaggagccacatctctgtgagacacagcggctgctcttcgtcctccccagt tctcagtccatgtgttttgagtgggatgtgtcagcgtgctacaacctgttggctacaagg attgttttccgggatagatgtgatcaaggagcccaatcaggtcccaccccagaatgtggg gggactcaaccacaacatgtacaaaactaa >gi568815597r:8941624_9158251|GENSCAN_predicted_peptide_2|524_aa MENKEAGTPPPIPSREGRLQPTLLLATLSAAFGSAFQYGYNLSVVNTPHKVGTSCGWGNV FQVFKSFYNETYFERHATFMDGKLMLLLWSCTVSMFPLGGLLGSLLVGLLVDSCGRKGTL LINNIFAIIPAILMGVSKVAKAFELIVFSRVVLGVCAGISYSALPMYLGELAPKNLRGMV GTMTEVFVIVGVFLAQIFSLQAILGNPAGWPVLLALTGVPALLQLLTLPFFPESPRYSLI QKGDEATARQALRRLRGHTDMEAELEDMRAEARAERAEGHLSVLHLCALRSLRWQLLSII VLMAGQQLSGINAINYYADTIYTSAGVEAAHSQYVTVGSGVVNIVMTITSAVLVERLGRR HLLLAGYGICGSACLVLTVVLLFQNRVPELSYLGIICVFAYIAGHSIGPSPVPSVVRTEI FLQSSRRAAFMVDGAVHWLTNFIIGFLFPSIQEAIGAYSFIIFAGICLLTAIYIYVVIPE TKGKTFVEINRIFAKRNRVKLPEEKEETIDAGPPTASPAKETSF >gi568815597r:8941624_9158251|GENSCAN_predicted_CDS_2|1575_bp atggagaacaaagaggcgggaacccctccacccattccatccagggaggggcggctccag ccgacgctgttgctggcgacactgagcgcggcctttggctcagccttccagtacggctac aacctctctgtggtcaacacgccgcacaaggtgggcacaagctgtggatggggcaatgtt ttccaggtcttcaagtcattttacaacgaaacctactttgagcgacacgcaacattcatg gacgggaagctcatgctgcttctatggtcttgcaccgtctccatgtttcctctgggcggc ctgttggggtcattgctcgtgggcctgctggttgatagctgcggcagaaaggggaccctg ctgatcaacaacatctttgccatcatccccgccatcctgatgggagtcagcaaagtggcc aaggcttttgagctgatcgtcttttcccgagtggtgctgggagtctgtgcaggcatctcc tacagcgcccttcccatgtacctgggagaactggcccccaagaacctgagaggcatggtg ggaacaatgaccgaggttttcgtcatcgttggagtcttcctagcacagatcttcagcctc caggccatcttgggcaacccggcaggctggccggtgcttctggcgctcacaggggtgccc gccctgctgcagctgctgaccctgcccttcttccccgaaagcccccgctactccctgatt cagaaaggagatgaagccacagcgcgacaagctctgaggaggctgagaggccacacggac atggaggccgagctggaggacatgcgtgcggaggcccgggccgagcgcgccgagggccac ctgtctgtgctgcacctctgtgccctgcggtccctgcgctggcagctcctctccatcatc gtgctcatggccggccagcagctgtcgggcatcaatgcgatcaactactatgcggacacc atctacacatctgcgggcgtggaggccgctcactcccaatatgtaacggtgggctctggc gtcgtcaacatagtgatgaccatcacctcggctgtccttgtggagcggctgggacggcgg cacctcctgctggccggctacggcatctgcggctctgcctgcctggtgctgacggtggtg ctcctattccagaacagggtccccgagctgtcctacctcggcatcatctgtgtctttgcc tacatcgcgggacattccattgggcccagtcctgtcccctcggtggtgaggaccgagatc ttcctgcagtcctcccggcgggcagctttcatggtggacggggcagtgcactggctcacc aacttcatcataggcttcctgttcccatccatccaggaggccatcggtgcctacagtttc atcatctttgccggaatctgcctcctcactgcgatttacatctacgtggttattccggag accaagggcaaaacatttgtggagataaaccgcatttttgccaagagaaacagggtgaag cttccagaggagaaagaagaaaccattgatgctgggcctcccacagcctctcctgccaag gaaacttccttttag >gi568815597r:8941624_9158251|GENSCAN_predicted_peptide_3|618_aa MTASRHAPVPALGSLQGGSTGRALVTPWLCLRRPVTGRNRRLTLVLALATLIAAFGSSFQ YGYNVAAVNSPALLMQQFYNETYYGRTGEFMEDFPLTLLWSVTVSMFPFGGFIGSLLVGP LVNKFGRCAVSISGSHVWQVSQCLGMLELHSLTASSIMRVLIMRAPLREAPPEKFALLLS GSPGKGALLFNNIFSIVPAILMGCSRVATSFELIIISRLLVGICAGVSSNVVPMYLGELA PKNLRGALGVVPQLFITVGILVAQIFGLRNLLANVDGWPILLGLTGVPAALQLLLLPFFP ESPRYLLIQKKDEAAAKKALQTLRGWDSVDREVAEIRQEDEAEKAAGFISVLKLFRMRSL RWQLLSIIVLMGGQQLSGVNAIYYYADQIYLSAGVPEEHVQYVTAGTGAVNVVMTFCAVF VVELLGRRLLLLLGFSICLIACCVLTAALALQLIAHVQLISQWLIEVALLGEEPGAILHP PGQDTVSWMPYISIVCVISYVIGHALGPSPIPALLITEIFLQSSRPSAFMVGGSVHWLSN FTVGLIFPFIQEGLGPYSFIVFAVICLLTTIYIFLIVPETKAKTFIEINQIFTKMNKVSE VYPEKEELKELPPVTSEQ >gi568815597r:8941624_9158251|GENSCAN_predicted_CDS_3|1857_bp atgacggcctcccgccacgcccccgtccccgcgctcggctccctccagggcggaagcacg ggtcgagcgttggtgacgccatggctgtgcttgcgacgccctgtcactggcaggaaccgg aggctgacgcttgtgcttgccctggcaaccctgatagctgcctttgggtcatccttccag tatgggtacaacgtggctgctgtcaactccccagcactgctcatgcaacaattttacaat gagacttactatggtaggaccggtgaattcatggaagacttccccttgacgttgctgtgg tctgtaaccgtgtccatgtttccatttggagggtttatcggatccctcctggtcggcccc ttggtgaataaatttggcaggtgtgctgttagtataagcggctctcacgtgtggcaagtt tctcagtgtttaggaatgctggagctccactcgctgacagcttcgtccatcatgagggtc ctcatcatgagggctccattgcgagaggcccctccagagaagtttgctttgcttctgtca ggatctccaggaaaaggggccttgctgttcaacaacatattttctatcgtgcctgcgatc ttaatgggatgcagcagagtcgccacatcatttgagcttatcattatttccagacttttg gtgggaatatgtgcaggtgtatcttccaacgtggtccccatgtacttaggggagctggcc cctaaaaacctgcggggggctctcggggtggtgccccagctcttcatcactgttggcatc cttgtggcccagatctttggtcttcggaatctccttgcaaacgtagatggctggccgatc ctgctggggctgaccggggtccccgcggcgctgcagctccttctgctgcccttcttcccc gagagccccaggtacctgctgattcagaagaaagacgaagcggccgccaagaaagcccta cagacgctgcgcggctgggactctgtggacagggaggtggccgagatccggcaggaggat gaggcagagaaggccgcgggcttcatctccgtgctgaagctgttccggatgcgctcgctg cgctggcagctgctgtccatcatcgtcctcatgggcggccagcagctgtcgggcgtcaac gctatctactactacgcggaccagatctacctgagcgccggcgtgccggaggagcacgtg cagtacgtgacggccggcaccggggccgtgaacgtggtcatgaccttctgcgccgtgttc gtggtggagctcctgggtcggaggctgctgctgctgctgggcttctccatctgcctcata gcctgctgcgtgctcactgcagctctggcactgcagctcatagcccacgttcagctgatt tcccagtggctcatcgaggtggcactgctgggggaggagccaggtgccatcctccaccca ccagggcaggacacagtgtcctggatgccatacatcagcatcgtctgtgtcatctcctac gtcataggacatgccctcgggcccagtcccatacccgcgctgctcatcactgagatcttc ctgcagtcctctcggccatctgccttcatggtggggggcagtgtgcactggctctccaac ttcaccgtgggcttgatcttcccgttcatccaggagggcctcggcccgtacagcttcatt gtcttcgccgtgatctgcctcctcaccaccatctacatcttcttgattgtcccggagacc aaggccaagacgttcatagagatcaaccagattttcaccaagatgaataaggtgtctgaa gtgtacccggaaaaggaggaactgaaagagcttccacctgtcacttcggaacagtga >gi568815597r:8941624_9158251|GENSCAN_predicted_peptide_4|367_aa MQPSPPPTELVPSERAVVLLSCALSALGSGLLVATHALWPDLRSRARRLLLFLSLADLLS AASYFYGVLQNFAGPSWDCVLQGALSTFANTSSFFWTVAIALYLYLSIVRAARGPRTDRL LWAFHVVSWGVPLVITVAAVALKKIGYDASDVSVGWCWIDLEAKDHVLWMLLTGKLWEML AYVLLPLLYLLVRKHINRAHTALSEYRPILSQEHRLLRHSSMADKKLVLIPLIFIGLRVW STVRFVLTLCGSPAVQTPVLVVLHRRHSLIPSRSTVAAPQCRPRHSCREELGPTVLCLPL CLRQDLEERRQSGISVVRGDPQAASSTGTSVLAHGWAFICIRVAVRFGKPVAHSCGHSAP TQWTLMS >gi568815597r:8941624_9158251|GENSCAN_predicted_CDS_4|1104_bp atgcagccgtccccgccgcccaccgagctggtgccgtcggagcgcgccgtggtgctgctg tcgtgcgcactctccgcgctcggctcgggcctgctggtggccacgcacgccctgtggccc gacctgcgcagccgggcacggcgcctgctgctcttcctgtcgctggccgacctgctctcg gccgcctcctacttctacggagtgctgcagaacttcgcgggcccgtcgtgggactgcgtg ctgcagggcgcgctgtccaccttcgccaacaccagctccttcttctggaccgtggccatt gcgctctacttgtacctcagcatcgtccgcgccgcgcgcgggcctcgcacagatcgcctg ctttgggccttccatgtcgtcagctggggggtcccgttggtcatcactgtggcagccgtc gccctgaagaagattggctatgacgcctcggacgtgtctgtgggctggtgctggatcgac ctggaggccaaggaccatgtcctgtggatgctgctgacggggaagctgtgggagatgctg gcatatgtgctgctgcctctgctgtacctcctggtccggaagcacatcaacagagcgcac acggcactctctgagtaccggcccatcctctcccaggagcaccgcctgctgcgccactcc tccatggcggacaagaagctggtgctcatcccgctcatcttcatcggcctcagggtctgg agcaccgtgcggttcgtgctgaccctctgtggctccccggccgtgcagacgccggtgctg gtggttctgcataggaggcacagcctgattccttcccgcagcacagtggctgcaccccag tgtcggccaaggcacagctgcagggaggagctcggccccactgtgctgtgccttcctctc tgcctgagacaggaccttgaagagagaaggcagagtgggatcagtgtggtccggggtgat cctcaggcagcaagttccaccggtaccagcgtcctggcccacgggtgggcgttcatctgc ataagggtagcagtgagattcgggaagccggtggcccacagctgtggccacagtgccccc acccagtggaccttgatgtcctga >gi568815597r:8941624_9158251|GENSCAN_predicted_peptide_5|97_aa MGAEDEVMTPHTRNLMFAVMVEACPWWGVLGRGRTGLSPESPPDAVDRPAQHYQVQSRGP PPRCLPFPGQDDPNFLMESAELASWVCSLDGHTGPGA >gi568815597r:8941624_9158251|GENSCAN_predicted_CDS_5|294_bp atgggagcagaggatgaggtcatgaccccacacacccgcaacctgatgtttgctgtcatg gtggaagcatgcccgtggtggggggtgctggggagaggcaggacaggcctgtcccccgag tcccctccggatgccgtggaccggccagctcagcactaccaggtacagtccaggggcccc ccaccaagatgcctgcccttccctggtcaagatgaccccaacttcctgatggaatcagca gagctggcttcatgggtgtgcagcctggatggccacacagggcccggtgcctga