GENSCAN 1.0 Date run: 2-Nov-116 Time: 20:08:55 Sequence gi568815596f:24991324_25245093 : 253770 bp : 48.10% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 27023 27085 63 1 0 53 72 66 0.018 2.65 1.02 Intr + 36101 36194 94 1 1 35 80 87 0.013 2.14 1.03 Intr + 40823 41068 246 0 0 75 85 92 0.036 5.03 1.04 Intr + 43950 44061 112 1 1 88 80 65 0.072 5.14 1.05 Term + 52478 52508 31 2 1 125 48 56 0.181 2.63 1.06 PlyA + 55783 55788 6 1.05 2.02 PlyA - 56174 56169 6 1.05 2.01 Sngl - 89987 89475 513 2 0 66 45 353 0.993 24.84 2.00 Prom - 93336 93297 40 -3.46 3.00 Prom + 95664 95703 40 -5.36 3.01 Init + 97663 97710 48 0 0 107 46 70 0.591 3.73 3.02 Intr + 99923 100078 156 1 0 98 90 19 0.327 3.31 3.03 Intr + 101680 101807 128 0 2 91 106 204 0.157 21.98 3.04 Intr + 112314 112464 151 0 1 102 99 301 0.250 32.76 3.05 Intr + 130350 130471 122 2 2 86 89 194 0.993 18.59 3.06 Intr + 134961 135008 48 0 0 99 85 29 0.703 1.50 3.07 Intr + 136860 137009 150 0 0 100 99 119 0.955 13.48 3.08 Intr + 138652 138786 135 1 0 97 84 140 0.999 14.18 3.09 Intr + 139229 139307 79 2 1 86 80 58 0.999 4.35 3.10 Intr + 140045 140180 136 1 1 96 82 261 0.806 26.44 3.11 Intr + 140427 140588 162 1 0 73 41 425 0.997 36.25 3.12 Intr + 141580 141691 112 2 1 109 72 145 0.985 14.44 3.13 Intr + 142060 142111 52 1 1 76 99 8 0.969 -0.39 3.14 Intr + 144144 144316 173 2 2 89 66 224 0.997 19.04 3.15 Intr + 145200 145275 76 0 1 26 92 137 0.729 7.42 3.16 Intr + 146018 146179 162 1 0 97 75 394 0.989 39.17 3.17 Intr + 147736 147867 132 0 0 64 73 181 0.991 15.04 3.18 Intr + 150043 150110 68 0 2 68 94 62 0.994 2.50 3.19 Intr + 152412 152539 128 0 2 114 80 152 0.999 17.32 3.20 Intr + 153637 153728 92 2 2 94 105 91 0.983 11.11 3.21 Intr + 158371 158419 49 0 1 121 111 80 0.998 11.85 3.22 Intr + 160576 160697 122 2 2 18 114 227 0.921 18.51 3.23 Intr + 162389 162438 50 1 2 103 131 48 0.977 8.08 3.24 Term + 162912 163017 106 0 1 120 38 47 0.905 0.78 3.25 PlyA + 164851 164856 6 1.05 4.05 PlyA - 164973 164968 6 -0.45 4.04 Term - 165746 165594 153 2 0 109 48 114 0.966 7.42 4.03 Intr - 166049 165926 124 1 1 99 84 21 0.628 3.39 4.02 Intr - 170429 169762 668 2 2 107 44 1411 0.102 129.13 4.01 Init - 173449 173318 132 1 0 74 92 210 0.990 18.14 4.00 Prom - 175589 175550 40 -5.76 5.04 PlyA - 175985 175980 6 1.05 5.03 Term - 177106 176999 108 1 0 52 37 94 0.099 -0.79 5.02 Intr - 177746 177604 143 0 2 98 93 44 0.033 5.97 5.01 Init - 213271 213082 190 1 1 95 58 144 0.382 9.31 5.00 Prom - 232550 232511 40 -4.56 6.12 PlyA - 232860 232855 6 1.05 6.11 Term - 238844 238736 109 1 1 115 44 81 0.732 4.38 6.10 Intr - 243097 242960 138 0 0 146 47 94 0.810 10.68 6.09 Intr - 244502 244384 119 2 2 73 75 67 0.900 3.16 6.08 Intr - 245682 245613 70 2 1 96 82 48 0.791 4.18 6.07 Intr - 247892 247807 86 2 2 91 77 72 0.821 5.02 6.06 Intr - 249165 248979 187 2 1 68 65 213 0.600 16.69 6.05 Intr - 249407 249313 95 2 2 94 49 139 0.582 9.36 6.04 Intr - 250384 250239 146 2 2 52 50 273 0.960 19.90 6.03 Intr - 252659 252575 85 2 1 63 94 60 0.960 3.49 6.02 Intr - 253015 252832 184 0 1 104 59 202 0.996 18.69 6.01 Intr - 253329 253217 113 0 2 83 94 254 0.999 24.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 170429 169758 672 2 0 107 55 1437 0.898 136.06 S.002 Init + 174255 174399 145 2 1 54 70 114 0.867 6.48 S.003 Term + 174456 174586 131 1 2 92 43 107 0.917 4.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:24991324_25245093|GENSCAN_predicted_peptide_1|181_aa MFEIPETEKLKYLKVAGCGEEDTDCLFIPLDIVIVYNTDNTMMIGSEDLEVPAMSTSSLY LWNPGFFHGSGVELKNLPCSKLPRGCLVQAEGKPSVNHLLQFPSELPRRVTLARLICYFI CCHITNLQLSYLQSDQQEEEMEATPWTFTSFRVWVGLGFAVRASITQDGDGRGSSLKLQI I >gi568815596f:24991324_25245093|GENSCAN_predicted_CDS_1|546_bp atgtttgagatacctgaaactgaaaagctgaagtatttgaaagtggctggctgtggggag gaagacactgattgtctttttatcccattggacatcgtgatagtctacaatactgataac accatgatgataggatctgaagacctggaagtaccagccatgtccaccagttccctctac ctgtggaacccaggcttcttccatgggtctggagtggagctcaagaatctgccttgcagc aagctccccaggggctgcttggtgcaagctgaagggaagccctctgtgaatcacctcctt cagttccccagcgagcttcctagaagagtgacattggccagactcatctgctacttcatc tgttgccacatcacaaacctgcagcttagctacttgcaatctgatcagcaggaagaagag atggaagcaacgccatggaccttcacgtccttcagagtgtgggtgggcttgggctttgca gtgagagcaagtatcacccaggatggagatgggagaggtagcagcctgaagctgcagatc atttga >gi568815596f:24991324_25245093|GENSCAN_predicted_peptide_2|170_aa MKAQVSADGRGKGTFESGLKGGVKIVFSPEEAKAVSSEMIGKKLFTKQMGEKGGICNQAL KRKYPRGEDYFAIGMERSFQGPILIGSSHGGVNIADVAAETPDAIKEPVDIVEGIKKEQA LRLEQKMGFPPNIVDSAAENMVKLYSLFLKYDATMIRNKSNGEDSDGAVL >gi568815596f:24991324_25245093|GENSCAN_predicted_CDS_2|513_bp atgaaggcgcaggtttcagctgatggtcgaggaaaaggaacatttgaaagtggcctcaaa ggaggagtgaagatagttttctctccagaagaagcaaaagctgtttcctcagaaatgatt gggaaaaagttgtttaccaagcaaatgggagaaaagggtggaatatgcaatcaagcactg aagcgaaaataccctaggggagaagactactttgcaatagggatggagaggtcatttcaa ggtcctatattaataggaagttcacatggtggtgtcaacattgcagatgttgctgctgag actcctgatgcaattaaagaacctgttgatattgtagaaggcatcaaaaaggaacaagct ctccggcttgaacagaagatgggatttccacctaatattgtggattccgcagcagaaaac atggtcaagctttacagcctttttctgaaatatgatgcaaccatgataagaaataaatcc aatggtgaagattcagatggagctgtgctgtga >gi568815596f:24991324_25245093|GENSCAN_predicted_peptide_3|878_aa MGPVWWRLLEASVLGKAALALAPTFPGPDQEALLTVFPILIPGVCGCCGALRPRYKRLVD NIFPEDPEDGLVKTNMEKLTFYALSAPEKLDRIGAYLSERLIRDVGRHRYGYVCIAMEAL DQLLMACHCQSINLFVESFLKMVAKLLESEKPNLQILGTNSFVKFANIEEDTPSYHRSYD FFVSRFSEMCHSSHDDLEIKTKLNQYFHPSLLSICDYRIRMSGIKGLQGVVRKTVNDELQ ANIWDPQHMDKIVPSLLFNLQHVEEAESRSPSPLQAPEKEKESPAELAERCLRELLGRAA FGNIKNAIKPVLIHLDNHSLWEPKVFAIRCFKIIMYSIQPQHSHLVIQQLLGHLDANSRS AATVRAGIVEVLSEAAVIAATGSVGPTVLEMFNTLLRQLRLSIDYALTGSYDGAVSLGTK IIKEHEERMFQEAVIKTVGSFASTLPTYQRSEVILFIMSKVPRPSLHQAVDTGRTGENRN RLTQIMLLKSLLQVSTGFQCNNMMSALPSNFLDRLLSTALMEDAEIRLFVLEILISFIDR HGNRHKFSTISTLSDISVLKLKVDKCSRQDTVFMKKHSQQLYRHIYLSCKEETNVQKHYE ALYGLLALISIELANEEVVVDLIRLVLAVQDVAQVNEENLPVYNRCALYALGAAYLNLIS QLTTVPAFCQHIHEVIETRKKEAPYMLPEDVFVERPRLSQNLDGVVIELLFRQSKISEVL GGSGYNSDRLCLPYIPQLTDEDRLSKRRSIGETISLQVEVESRNSPEKEERVPAEEITYE TLKKAIALCVEVDSVAVEEQERERRRQVVEKFQKAPFEEIAAHCGARASLLQSKLNQIFE ITIRPPPSPSGTITAAYGQPQNHSIPVYEMKFPDLCVY >gi568815596f:24991324_25245093|GENSCAN_predicted_CDS_3|2637_bp atgggccctgtgtggtggcggctcctggaggcctctgtgctggggaaggctgccctggcc ctggctccaacctttcctgggcccgaccaggaggctttgctcacagtttttcccattctt attccaggtgtgtgtggctgctgtggtgccctacgccccaggtacaaaaggctggttgac aacatcttccctgaggatcccgaggatggtctggtgaagaccaacatggagaagctgacc ttctatgccctctcagctccagaaaaacttgatcgtattggcgcctacctctctgagagg ctcatccgtgacgtgggtcgccatcgatatgggtacgtgtgcattgctatggaggctttg gaccagctgctcatggcctgccactgccagagcatcaacctcttcgtggagagcttcctc aagatggtggccaagctgctggagtcagagaaacccaacctgcagatcctcggcaccaac tcgtttgtgaagtttgccaacatcgaggaggacaccccgtcctatcaccggagctatgac ttctttgtgtcccgattcagtgaaatgtgccactcgagccatgatgacttagaaatcaag accaagctcaatcagtacttccaccccagcctcctgagtatctgtgactacagaattcga atgtcaggcatcaaaggcctgcaaggggtggtgaggaagacggtgaatgatgaactgcag gccaatatctgggacccacagcacatggataagatcgttccatcactgcttttcaatcta cagcatgtagaggaggcagagagccggtctccctcacccctccaagcacctgagaaggag aaagagagccccgcggagctggctgagaggtgtcttcgggagctgctgggccgggctgcc tttggcaacatcaaaaacgccatcaagcctgttctcatccatctggataaccattctctt tgggaacccaaggtgtttgccatccgttgctttaaaatcatcatgtactcaattcagccg cagcactcacacctggtcatccagcagctcctgggccacctggacgccaacagccgcagc gctgcgacggtgcgcgcgggcatcgtggaagtcttgtcggaagccgcggtcatcgctgcc accggctctgtggggcccacagtactggagatgttcaacacgctgctgaggcagctgcgg ctcagcatcgactacgcgctgaccgggagctacgacggggcggtcagcctcggcaccaag atcatcaaggagcacgaggagcgcatgttccaggaggccgtcatcaagaccgtgggctcc tttgccagcacgctgcccacctaccagcgctccgaggtgatcctcttcatcatgagcaag gtcccgcggccatccctgcaccaggcggtggacacaggcaggacgggggagaataggaac cgtctgacccagattatgctgctaaaatccctcctgcaggtatccacaggtttccagtgc aacaacatgatgtcagccctgcctagcaacttcctggaccgccttctctccaccgccctc atggaggatgcagaaattcgactctttgttctagagattctcatcagtttcattgatcgt catggcaaccgccacaagttctctaccatcagtaccctcagtgacatctctgtcctgaag ctgaaagtggacaagtgctctcgacaggacaccgtcttcatgaagaagcactcccagcag ctctacagacacatctacctgagctgcaaggaggaaacaaacgtgcagaaacactacgag gcgctctatggcttgctggccctcatcagcatcgagctggctaacgaggaggtggtggtg gacctcatccgtctggtgctggctgttcaggacgtggcccaagtcaatgaggagaacttg cctgtctacaaccgctgtgccctctatgctctgggcgcagcctacctgaacctcatcagt cagctcacaacagtgcctgccttctgccagcacatccatgaggtgatagagaccaggaag aaagaggctccatacatgctccccgaggatgtgtttgtggagaggcccaggctgtctcag aatcttgatggggtggtcattgagctcctcttccgccagagcaagatcagtgaagtcctg ggaggcagtggctacaactcggaccggctctgcctgccctacattcctcagctgacagat gaggatcgtttatccaagaggaggagcattggagagaccatctccctgcaggtggaggta gaatcgaggaacagtccggagaaggaggagcgagtgcctgccgaggagatcacctatgag acactgaagaaagccattgctctctgtgtcgaagtggacagcgtagcagtggaggagcag gagcgtgagcggcggcggcaggtggtggagaagttccagaaggcacccttcgaggagatt gctgcacactgcggggcccgggcatcgctgctccagagcaaactcaatcagatctttgaa atcaccatccggcccccaccaagcccatcaggaaccatcactgcagcctacggtcagccg cagaaccactccatccccgtctatgaaatgaagtttcccgatctgtgtgtatactga >gi568815596f:24991324_25245093|GENSCAN_predicted_peptide_4|358_aa MPRSCCSRSGALLLALLLQASMEVRGWCLESSQCQDLTTESNLLECIRACKPDLSAETPM FPGNGDEQPLTENPRKYVMGHFRWDRFGRRNSSSSGSSGAGQKREDVSAGEDCGPLPEGG PEPRSDGAKPGPREGKRSYSMEHFRWGKPVGKKRRPVKVYPNGAEDESAEAFPLEFKREL TGQRLREGDGPDGPADDGAGAQADLEHSLLVAAEKKDEGPYRMEHFRWGSPPKDKRYGGF MTSEKSQTPLVTLFKNAIIKNAYKKGEACAGVRRLPTITLTFTFPILAYLARIQVLLHVS RQPNTLLSDGDEGQKHLGISGSNAGSGKMLQGGDCQVPEPGLTGSAQDEAATGNPNVK >gi568815596f:24991324_25245093|GENSCAN_predicted_CDS_4|1077_bp atgccgagatcgtgctgcagccgctcgggggccctgttgctggccttgctgcttcaggcc tccatggaagtgcgtggctggtgcctggagagcagccagtgtcaggacctcaccacggaa agcaacctgctggagtgcatccgggcctgcaagcccgacctctcggccgagactcccatg ttcccgggaaatggcgacgagcagcctctgaccgagaacccccggaagtacgtcatgggc cacttccgctgggaccgattcggccgccgcaacagcagcagcagcggcagcagcggcgca gggcagaagcgcgaggacgtctcagcgggcgaagactgcggcccgctgcctgagggcggc cccgagccccgcagcgatggtgccaagccgggcccgcgcgagggcaagcgctcctactcc atggagcacttccgctggggcaagccggtgggcaagaagcggcgcccagtgaaggtgtac cctaacggcgccgaggacgagtcggccgaggccttccccctggagttcaagagggagctg actggccagcgactccgggagggagatggccccgacggccctgccgatgacggcgcaggg gcccaggccgacctggagcacagcctgctggtggcggccgagaagaaggacgagggcccc tacaggatggagcacttccgctggggcagcccgcccaaggacaagcgctacggcggtttc atgacctccgagaagagccagacgcccctggtgacgctgttcaaaaacgccatcatcaag aacgcctacaagaagggcgaggcctgtgctggggtgaggagattacctactatcacactc accttcacctttccgattctggcctatttggccagaatccaggttctactccatgtcagc agacaacccaacactctactgagtgatggggatgaggggcagaaacatctggggatctca ggttctaatgctggctctggaaagatgctccagggaggcgactgccaagtcccagaacca gggctcacaggcagtgctcaggatgaagctgccacggggaaccctaatgttaagtag >gi568815596f:24991324_25245093|GENSCAN_predicted_peptide_5|146_aa MRARGFLENRAASAHPAGALELQEAISRRHRIRPNAVARPLRPSEPGGEALGAAPQGSRA IAALEQREYDSPRLPHHSFQTMGKSEASPCADGDIYRQMRTRQMPAPARTQSAGSGSEGG PGRLWLSAVPSSRQTFCADCGMRSRQ >gi568815596f:24991324_25245093|GENSCAN_predicted_CDS_5|441_bp atgagggcgcgcggctttcttgaaaacagagcagcctctgctcaccctgcaggcgctctg gagctgcaggaggccatttctagaaggcaccggattcggcctaatgcggtggcgaggccg ctgcgtccctcggagcccggtggggaggccctgggtgccgcgccccaaggctcgcgggcg attgccgccctggaacagagagaatatgattccccacgacttccacatcacagtttccaa acaatggggaaatcggaggcctccccgtgtgcagacggtgatatttaccgccaaatgcga accaggcagatgccagccccagcacgcacgcagtcggccggctccggctccgaaggcgga cctgggcgcctctggctctccgcggtcccgagttctcgacaaactttctgcgccgactgc ggcatgagaagccgccagtag >gi568815596f:24991324_25245093|GENSCAN_predicted_peptide_6|443_aa NCFLECAYQYDDDGYQSYCTICCGGREVLMCGNNNCCRCFCVECVDLLVGPGAAQAAIKE DPWNCYMCGHKGTYGLLRRREDWPSRLQMFFANNHDQEFDPPKVYPPVPAEKRKPIRVLS LFDGIATGLLVLKDLGIQVDRYIASEVCEDSITVGMVRHQGKIMYVGDVRSVTQKHIQEW GPFDLVIGGSPCNDLSIVNPARKGLYGRQPQLMAFSSDLSEGTGRLFFEFYRLLHDARPK EGDDRPFFWLFENVVAMGVSDKRDISRFLESNPVMIDAKEVSAAHRARYFWGNLPGMNRP LASTVNDKLELQECLEHGRIAKFSKVRTITTRSNSIKQGKDQHFPVFMNEKEDILWCTEM ERVFGFPVHYTDVSNMSRLARQRLLGRSWSVPVIRHLFAPLKEYFACVPQRLLAIEHPIG KQTFQESTRDLLAFETGLGVIDW >gi568815596f:24991324_25245093|GENSCAN_predicted_CDS_6|1332_bp aactgctttctggagtgtgcgtaccagtacgacgacgacggctaccagtcctactgcacc atctgctgtgggggccgtgaggtgctcatgtgcggaaacaacaactgctgcaggtgcttt tgcgtggagtgtgtggacctcttggtggggccgggggctgcccaggcagccattaaggaa gacccctggaactgctacatgtgcgggcacaagggtacctacgggctgctgcggcggcga gaggactggccctcccggctccagatgttcttcgctaataaccacgaccaggaatttgac cctccaaaggtttacccacctgtcccagctgagaagaggaagcccatccgggtgctgtct ctctttgatggaatcgctacagggctcctggtgctgaaggacttgggcattcaggtggac cgctacattgcctcggaggtgtgtgaggactccatcacggtgggcatggtgcggcaccag gggaagatcatgtacgtcggggacgtccgcagcgtcacacagaagcatatccaggagtgg ggcccattcgatctggtgattgggggcagtccctgcaatgacctctccatcgtcaaccct gctcgcaagggcctctacggtagacagccccagctgatggctttctcttccgacctctca gagggcactggccggctcttctttgagttctaccgcctcctgcatgatgcgcggcccaag gagggagatgatcgccccttcttctggctctttgagaatgtggtggccatgggcgttagt gacaagagggacatctcgcgatttctcgagtccaaccctgtgatgattgatgccaaagaa gtgtcagctgcacacagggcccgctacttctggggtaaccttcccggtatgaacaggccg ttggcatccactgtgaatgataagctggagctgcaggagtgtctggagcatggcaggata gccaagttcagcaaagtgaggaccattactacgaggtcaaactccataaagcagggcaaa gaccagcattttcctgtcttcatgaatgagaaagaggacatcttatggtgcactgaaatg gaaagggtatttggtttcccagtccactatactgacgtctccaacatgagccgcttggcg aggcagagactgctgggccggtcatggagcgtgccagtcatccgccacctcttcgctccg ctgaaggagtattttgcgtgtgtgccccagagacttcttgctattgaacatcctattgga aagcaaactttccaagagagcacccgggacctgttagcttttgaaacaggcctgggtgtg attgactggtag