GENSCAN 1.0 Date run: 6-Nov-116 Time: 10:01:27 Sequence gi568815594f:123301592_123502548 : 200957 bp : 40.32% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3159 3258 100 2 1 44 58 143 0.519 7.40 1.02 Intr + 6895 7094 200 2 2 21 53 157 0.276 3.65 1.03 Term + 12297 12473 177 2 0 64 54 157 0.989 6.60 1.04 PlyA + 13504 13509 6 1.05 2.02 PlyA - 13787 13782 6 1.05 2.01 Sngl - 31502 30774 729 2 0 59 49 232 0.978 12.37 2.00 Prom - 32594 32555 40 -4.85 3.04 PlyA - 32930 32925 6 1.05 3.03 Term - 34149 33818 332 1 2 69 54 249 0.847 13.43 3.02 Intr - 35604 35495 110 2 2 31 60 78 0.430 -1.69 3.01 Init - 38434 38295 140 1 2 111 64 89 0.493 8.54 3.00 Prom - 44565 44526 40 -5.35 4.05 PlyA - 44662 44657 6 1.05 4.04 Term - 47007 46675 333 0 0 93 41 137 0.834 3.13 4.03 Intr - 48413 48234 180 2 0 -49 99 168 0.888 3.34 4.02 Intr - 48763 48623 141 1 0 33 81 81 0.352 1.53 4.01 Init - 50553 50341 213 0 0 85 43 128 0.689 6.89 4.00 Prom - 60519 60480 40 -5.25 5.00 Prom + 60576 60615 40 -3.65 5.01 Init + 61589 61647 59 1 2 41 100 52 0.126 2.53 5.02 Intr + 76299 76421 123 0 0 93 83 55 0.045 4.28 5.03 Intr + 95569 95702 134 1 2 27 -22 179 0.444 0.17 5.04 Intr + 96020 96132 113 0 2 97 26 137 0.835 7.58 5.05 Intr + 96443 96639 197 1 2 83 30 199 0.830 10.89 5.06 Intr + 97175 97339 165 2 0 13 15 177 0.262 1.05 5.07 Intr + 99946 100889 944 1 2 92 58 501 0.306 37.36 5.08 Term + 101090 101229 140 0 2 82 38 117 0.598 3.24 5.09 PlyA + 102610 102615 6 1.05 6.00 Prom + 128129 128168 40 -5.45 6.01 Init + 129569 129683 115 1 1 64 65 63 0.412 2.03 6.02 Intr + 132689 132844 156 0 0 78 113 176 0.883 18.16 6.03 Term + 133053 133168 116 1 2 99 43 36 0.045 -2.05 6.04 PlyA + 133179 133184 6 -3.64 7.05 PlyA - 133250 133245 6 1.05 7.04 Term - 134172 133655 518 1 2 94 37 294 0.125 18.39 7.03 Intr - 134450 134335 116 1 2 43 85 81 0.023 2.47 7.02 Intr - 142386 142293 94 1 1 108 45 53 0.012 1.10 7.01 Init - 149411 149390 22 2 1 73 96 25 0.679 1.75 7.00 Prom - 153617 153578 40 -6.35 8.00 Prom + 157332 157371 40 -5.75 8.01 Sngl + 159142 159336 195 0 0 104 47 291 0.992 21.51 8.02 PlyA + 159579 159584 6 1.05 9.03 PlyA - 160411 160406 6 1.05 9.02 Term - 166463 166337 127 1 1 86 38 62 0.169 -2.23 9.01 Init - 179878 179499 380 1 2 93 77 193 0.736 15.12 9.00 Prom - 180530 180491 40 -8.35 10.07 PlyA - 181286 181281 6 -1.75 10.06 Term - 181633 181413 221 2 2 29 44 211 0.350 7.02 10.05 Intr - 193423 193315 109 1 1 33 111 38 0.007 -0.46 10.04 Intr - 194530 194353 178 0 1 54 97 71 0.034 3.60 10.03 Intr - 195303 195197 107 0 2 -1 60 137 0.027 0.09 10.02 Intr - 197761 197675 87 1 0 68 88 65 0.106 3.75 10.01 Init - 199556 199521 36 2 0 112 100 29 0.583 6.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 195150 195346 197 1 2 138 40 176 0.878 14.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:123301592_123502548|GENSCAN_predicted_peptide_1|158_aa MQNSGGSSQAAVTERRDSISANTWIQCAAAEKGPAALDSHRCGNPIVNCACEGSRLCTPY ESLMPDDLRWNGFILKPPALLRPLSMERLSSTKPVPGAKKIVAVCREAALLALEEDIQAN LIMKRHFTQALSTVTPRIPESLRRFYEDYQEKSGLHTL >gi568815594f:123301592_123502548|GENSCAN_predicted_CDS_1|477_bp atgcagaacagcgggggcagcagccaggctgccgtgaccgagcgcagagacagcatctct gccaatacgtggattcagtgtgcagctgcagaaaaaggaccggcggcattagattctcat aggtgtgggaaccctattgtgaactgtgcatgtgagggatccaggttgtgcactccttat gagagtctaatgcctgatgatctgagatggaacggattcatcctgaaaccacccgcactc ctccgccccctgtccatggaaagattgtcttccacgaaaccagtccctggtgctaaaaag attgtagctgtctgcagagaggcagctcttctggctctggaagaagacattcaagccaat ctcatcatgaaaagacatttcactcaggccttgagcactgtgacacctagaattcctgag tcattgagacgtttttatgaagattatcaagagaagagtgggctgcatacactctga >gi568815594f:123301592_123502548|GENSCAN_predicted_peptide_2|242_aa MNIDAKILNKILANQIQQHIKKLIHHDQVGFIPGMQGWFNIYKSINVIYHINRTKDKNHI IISIDTEKALDKTQHPFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKPEAFPLKT SRRQGCPLSPLLFNIVLEVLDRAIRQEKEIKGIRIGREEVRLSLFADDMIVYLETPIVSA QKLLKLISNFSKVSGYKINVQKSQALLYINNSQTESQIMSELPFTIATKRIKYLGIQLTR AT >gi568815594f:123301592_123502548|GENSCAN_predicted_CDS_2|729_bp atgaacattgatgcaaagatcctcaataaaatactggcaaaccaaatccagcagcacatc aaaaagcttattcaccatgatcaagttggcttcatccctgggatgcaaggctggttcaac atatacaaatcaataaatgtaatttatcacataaacagaaccaaagacaaaaaccacatc attatctcaattgatacagaaaaggcccttgataaaactcaacacccctttatgctaaaa actctcaataaactaggtattgatgggacgtatctcaaaataataagagctatttatgac aaacccacagccaatatcatactgaatgggcaaaagccagaagcattccctttgaaaacc agcagaagacaaggatgccctctctcaccactcctatttaacatagtattagaagttctg gacagggcaatcaggcaagagaaagaaataaagggtattcgaataggaagagaggaagtc agattgtctctgtttgcagatgacatgattgtatatttagaaacccccattgtctcagcc caaaaactccttaagctgataagcaacttcagcaaagtctcagggtacaaaatcaatgtg caaaaatcacaagcattactatacatcaataatagtcaaacagagagccaaataatgagt gaactcccattcacaattgctacaaagagaataaaatacctaggaatacaacttacaagg gcaacttga >gi568815594f:123301592_123502548|GENSCAN_predicted_peptide_3|193_aa MATSQALERDLLGPVEGSLGSTSSPSSGFSVKHRLLFVPFDAFLKAIMGVLLAGIKTTLG DLARWLNRNSSSRQLPVRPMQNAGSQLFASKGTNWKKNEFDKFIEGGFKRWVITNSSELK DHVLTQCKETKNLDKRLKEFLTSITSLEKNINDLMELENTAQELREAYTSINSRINQAKE SISEIEDELNEIK >gi568815594f:123301592_123502548|GENSCAN_predicted_CDS_3|582_bp atggccacctcacaggctctggagagggacttgctgggcccagtggaaggcagcttaggc tccacatcctctccgtcttccggcttctctgtgaagcacaggttattgtttgtacccttt gatgcttttctaaaggctataatgggtgttctattagcaggcataaaaacaacattaggg gatctggcaagatggctgaatagaaacagctccagtcggcagctcccagtgagaccaatg cagaatgcaggatcacaactctttgccagcaagggaacaaactggaagaagaacgagttt gacaaattcatagaaggaggcttcaaaaggtgggtaataacaaactcctctgagctaaag gatcatgttctaacccaatgcaaggaaactaagaaccttgataaaagattaaaggaattt ctaactagcataaccagtttagagaagaacataaatgatctgatggagctggaaaacaca gcacaagaacttcgtgaagcgtacacaagtatcaatagccgaatcaatcaagcaaaagaa agtatatcagagatagaagatgaacttaatgaaataaagtga >gi568815594f:123301592_123502548|GENSCAN_predicted_peptide_4|288_aa MGQTHGFCQVLFLREHGRLLSMGTGGAGAGSVTVRGSPQPRPATSLGYLEISHGGSIYAA EMAARHSPALRKCRLVEQRNTHALLFCNQALTEAFRCGAYKIRLIDSLTSPCSVTTLPGG YRSSSLKKQLGLVLGEKQNKTVTVKVPEKRDEADGGSSFNSMVRIKLVHTQRCFENYLEQ SSKCSESGSHLGRLYSASSGDKYVWTRQGSYYSIWKVYGSSEHHYPMKFPCSDFLLMKQP YDDNFWLPMQLSHEESVLISHYDRMAAHDVSILTREQSSHSSLYIKLV >gi568815594f:123301592_123502548|GENSCAN_predicted_CDS_4|867_bp atggggcagacgcatgggttctgccaggtgctgtttctcagggaacacgggaggctgttg tctatggggacgggtggggctggagcaggctctgtgacagtcagaggctctccccaaccc cggccagcgacctcccttgggtaccttgaaatcagccacggcgggagtatttacgccgcg gagatggcagcacgacacagccctgctttgcggaaatgtcgtcttgtagaacaaagaaat acgcacgccctgctgttctgcaatcaggctctgacagaagctttcaggtgtggtgcttac aagatcagactcattgactcactcacttctccctgctccgtcactacactcccaggtggg tatagaagttcaagtttgaagaagcagctaggtctagtgctaggagagaaacagaataag actgtgacagtaaaggtaccagagaaacgggatgaagcagatgggggaagcagtttcaat tccatggtgcgaataaaattggtgcacactcagagatgttttgagaactatttggaacag tcttcaaaatgtagtgaaagtgggtctcacttaggaagactatacagtgcatcttcagga gacaagtatgtgtggacccgccagggtagttattattctatttggaaagtctatggctct agtgaacaccactatccaatgaaattcccttgctcagatttcttgctcatgaaacagcct tatgatgataatttctggcttcctatgcagctttcccatgaagagagtgtactgattagc cactatgaccgtatggctgcccatgacgtgtctattcttactagagagcaatcatcacat tctagtttatacatcaaacttgtgtga >gi568815594f:123301592_123502548|GENSCAN_predicted_peptide_5|624_aa MPKPRKTSESRENGAKVAKNPMILIADAGVSVRTLCTLLTMFPHCAINSGLLYNISIRPV SRRLKSETIEFVKHRAVNAFMRRGASTRCVDQMRVCLGIVGDRIPSISAVEPQARRTGCE RCPGEADLPGPECPGTLCPHLPNEAAIPSFVPCEERLEPEFSEHTVPSAGPSRTRLRHSP TCFSETWEAFLKGFVKTGSWKGAAEDLPRTGSPPSPITRSGGAVAQVVLSESPSRPRGIS GGSGDVPGGPLWRSVPQAGGRQRLISDACQVSTDCQNSRSLHMDPQNQHGSGSSLVVIQQ PSLDSRQRLDYEREIQPTAILSLDQIKAIRGSNEYTEGPSVVKRPAPRTAPRQEKHERTH EIIPINVNNNYEHRHTSHLGHAVLPSNARGPILSRSTSTGSAASSGSNSSASSEQGLLGR SPPTRPVPGHRSERAIRTQPKQLIVDDLKGSLKEDLTQHKFICEQCGKCKCGECTAPRTL PSCLACNRQCLCSAESMVEYGTCMCLVKGIFYHCSNDDEGDSYSDNPCSCSQSHCCSRYL CMGAMSLFLPCLLCYPPAKGCLKLCRRCYDWIHRPGCRCLTHGFFSFLMDDLQQEWTGKL HLAPTFNKSLCHPLEGIESQWAFV >gi568815594f:123301592_123502548|GENSCAN_predicted_CDS_5|1875_bp atgccaaaaccaagaaagacatctgaaagcagagagaatggagcaaaagttgctaagaac cctatgattcttattgcagatgctggtgtctccgttaggaccctttgcaccctgctaaca atgtttccacactgtgctataaactctggtttattgtacaacatctccattagaccagta agtagacgtttaaagtctgaaacgatcgaatttgtgaaacacagagccgttaatgccttt atgcgaagaggggctagcactcggtgtgtggaccaaatgcgtgtttgtttaggcattgtt ggcgaccgaatcccgagcatttcggccgtggaaccccaggctcggaggactgggtgtgag cgctgcccgggagaggctgacctgccgggaccggagtgcccggggacgctgtgcccccac ttgcccaacgaagccgccatcccttccttcgttccttgcgaggagcgactagaacccgag ttctctgagcacaccgtgccgtcagctggcccgtcacgcactcgactccgccactcccct acttgtttttctgagacttgggaagccttcctgaaaggatttgtaaaaactggttcttgg aaaggggctgcagaagacctcccgaggacagggtctcctccatccccgattacccggagc ggcggcgcggtggcccaggttgtcctctcggagagcccttcacgcccaagggggatttca ggcggctctggggatgtccccggaggccctctttggaggagtgtgcctcaagcaggggga cgacaaaggctgatttcagatgcatgccaggtttccactgattgccagaactcgagatca ctacacatggatccccaaaatcaacatggcagtggcagttcgttagttgtgatccagcag ccttctttggatagccgtcagagattagactatgagagagagattcagcctactgctatt ttgtccttagaccagatcaaggccataagaggcagcaatgaatacacagaagggccttcg gtggtgaaaagacctgctcctcggacagcaccaagacaagaaaagcatgaaaggactcat gaaatcataccaattaatgtgaataataactacgagcacagacacacaagccacctggga catgcagtactcccaagtaatgccaggggccccattttgagcagatcaaccagcactgga agtgcagccagctctgggagcaacagcagtgcctcttctgaacagggactgttaggaagg tcaccaccaaccagaccagtccctggtcataggtctgaaagggcaatccggacccagccc aagcaactgattgtggatgacttgaagggttccttgaaagaggacctgacacagcacaag ttcatttgtgaacagtgtgggaagtgcaagtgtggagaatgcactgctcccaggacccta ccatcctgtttggcctgtaaccggcagtgcctttgctctgctgagagcatggtggaatat ggaacctgcatgtgcttagtcaagggcatcttctaccactgctccaatgacgacgaaggg gattcctattcagataatccttgctcctgttcacaatcacactgctgctctagatacctg tgtatgggagccatgtctttatttttaccttgcttactctgttatcctcctgctaaagga tgcctgaagctgtgcaggaggtgttatgactggatccatcgcccagggtgcagatgtcta actcatggatttttctctttcctcatggatgatcttcagcaagagtggactgggaagctg cacctggctcccactttcaacaagagcctctgccatccacttgagggtattgagagccag tgggcttttgtgtag >gi568815594f:123301592_123502548|GENSCAN_predicted_peptide_6|128_aa MPCCCCCFFHFILKLHYPVAPHLLREPVQKTVGSHQQQATPGMAQPASTLGAGVWMRGMW QRPKTQRCQCVRNWQVLGLTDLKNEAVDPHGEFVVLLASGVKLQTFTVSVTAHKSSANPK SEQQQNLL >gi568815594f:123301592_123502548|GENSCAN_predicted_CDS_6|387_bp atgccctgctgctgctgctgcttctttcatttcatcctcaaactgcactacccagtagca ccccacctcttaagagagcccgtgcagaaaacagttggatctcatcaacagcaggctact ccaggtatggcacaaccagcttccacattgggcgctggtgtctggatgagggggatgtgg cagcgcccaaaaactcagagatgccagtgtgtccgaaattggcaggttcttggtctcact gacttgaagaatgaagccgtggaccctcacggtgagtttgtggtcttgctggcctcagga gtgaagctgcaaaccttcacggtgagtgttacagctcataaaagcagtgcaaacccaaaa agtgagcaacagcaaaatttgttgtaa >gi568815594f:123301592_123502548|GENSCAN_predicted_peptide_7|249_aa MAVGPRAGATSAAPLPSPITNLLASRVFSLVLIEAPLNGQWDQMSVTLPGGSDLQTGLLE WDSLSKSCNTPCGSTIAGRCGGRGVGGNRGCTQHSRASASSRWAWARQAGTRSGWQALPT PGSEGLSTWASSCRGGTRSPSTAGWPAPHSNSRRASAACLRCRARDLQPAMPKPPSLPCL VGSHTTQASPTGADPCSTMPGPIDHPRAEECRCAAQDWQAAPPTALVGDPLGEASWAPES GWDLENFYA >gi568815594f:123301592_123502548|GENSCAN_predicted_CDS_7|750_bp atggctgtaggacccagagcaggagctacttctgctgcaccactcccttcccccatcacc aatcttctggcttccagagtcttttcacttgttcttattgaagcacctttgaatgggcag tgggaccaaatgagtgtgacacttcctgggggctcagatcttcagactggactccttgag tgggactccctgagcaaaagctgtaacaccccttgtggctccaccattgctgggaggtgt ggagggagaggcgtgggtggcaaccggggctgcacgcagcactcgcgggccagcgcgagt tccaggtgggcgtgggctcggcaggctggcactcgtagtggctggcaggcgctgcccacc ccaggcagtgaggggcttagcacctgggccagcagctgcagagggggcaccaggtccccc agcactgccggctggcctgcgccacactcgaattctcgccgggcctcagctgcctgcctg cgatgcagggctcgggacctgcagcctgccatgcccaagcccccctcccttccctgcctg gtgggctcccacacaacccaagcctccccgacgggcgctgacccctgctccaccatgccc ggtcccatcgaccacccaagggctgaggagtgcaggtgtgcagcccaggactggcaggcg gctccgcccacggccctggtcggggatcctctaggcgaagccagctgggctcctgagtcg ggttgggacttggagaacttttatgcctag >gi568815594f:123301592_123502548|GENSCAN_predicted_peptide_8|64_aa MDVFLRRVTPPQEEPQAGPSGGVPEEGIVIIGDDSSMRVIAPEDLPVEQHVEVENSDIDD LVPV >gi568815594f:123301592_123502548|GENSCAN_predicted_CDS_8|195_bp atggatgtatttctgagaagagtgacacctcctcaagaagagcctcaggcaggtccttca ggtggtgttccagaagaaggcattgttatcataggagatgacagctccatgcgtgttatt gcccctgaagaccttccagtggaacagcatgtggaggtggaaaatagtgatattgatgac cttgttcctgtgtag >gi568815594f:123301592_123502548|GENSCAN_predicted_peptide_9|168_aa MDIRSRLLQAACQEHQGVINTQQCSDENLAFLESTLASSPRELLYDNRRLLCSSPSCRRL CCGLFPMYMIQSSLPSTSVAAKAPPLMSYAQVIHFLPSPLCSSRAHFSPNRQHENKERNL TFSERGRVVEHTTSGLSKMKLTHLHLTLAAAKYLSSSLKSSTHFLQLL >gi568815594f:123301592_123502548|GENSCAN_predicted_CDS_9|507_bp atggatatcagaagtaggcttctccaagcagcatgtcaagaacaccagggtgttataaac acacaacagtgcagcgatgagaacctggcttttctggagtccacactggcctcttctcca agggagctgctgtatgacaacagacggctgctgtgttcctcaccttcttgtcgtaggctc tgctgtggtttattccccatgtatatgattcagagctccttaccaagcacctctgtggca gccaaggctccaccactgatgagttatgcccaagttattcactttcttccatctccatta tgctcaagcagagcccatttcagccccaacaggcaacatgagaacaaggaaaggaatcta acatttagtgagagaggcagagttgtagaacatactacctcgggtttaagcaagatgaag ctcactcaccttcacttgacccttgctgctgctaaatatttgtcttcatctctcaaatcc tctacacacttcctacaactactctaa >gi568815594f:123301592_123502548|GENSCAN_predicted_peptide_10|245_aa MTTQNSKVEVNKFLICKQQKINIEPLIEQPDHEGTRQSSGKDIRSSGKDSGPSGSSGQLL ASSPVSGFPRKRTRKRSDNSPTVAITSSQWDICVAGLGRDAYIQHSEACVSYLQHTSGYN ATTVHFQFASRSRVTQSHREAITTLQISEYQCQVETALERLGVPLSSVYVLPVWKLLHNR LAESLHGRRLGVDVECEAYSLEAFVHKCTGKVAQQTETARTSQRFSRGAAACRQAFENYS KCLEL >gi568815594f:123301592_123502548|GENSCAN_predicted_CDS_10|738_bp atgaccacacagaactcgaaggtggaagtgaacaagtttctaatctgcaagcagcagaaa atcaatattgagcctttaatagagcaacctgaccatgaaggaactaggcagtcatcaggt aaggatatcagatcttctggcaaggacagtggcccatctggttcttctgggcagctgcta gcatcaagtcctgtttcaggtttccccaggaagcggactcgaaaacggagtgacaatagt cctactgtggccattacaagctcccagtgggatatctgtgttgcaggattgggtagagat gcatacattcagcactcagaagcctgtgtgagctaccttcagcacacctctggctataat gcaaccactgtacattttcagtttgcatcacgttccagggtgactcagtctcacagggaa gccataacaactcttcaaatttctgagtatcaatgtcaagttgaaacagctcttgaaagg ttgggtgttcccctttcctcggtgtatgttcttccagtgtggaaacttctacacaatcgt cttgcagagtcacttcacgggagaagacttggggtagatgtggagtgtgaagcctattct ctggaggcatttgtacacaaatgcacaggcaaggttgcacagcagactgagacagcgaga acttctcagaggttctcaagaggtgcagcagcttgcagacaagcttttgaaaattactcc aagtgtttggagctgtag