GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:32:31 Sequence gi568815590f:54358139_54559992 : 201854 bp : 41.19% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1679 1684 6 1 0 85 97 11 0.424 2.00 1.02 Term + 6475 6699 225 0 0 104 43 216 0.961 14.50 1.03 PlyA + 7988 7993 6 1.05 2.00 Prom + 12993 13032 40 -4.75 2.01 Init + 16700 16754 55 1 1 27 98 35 0.398 -0.10 2.02 Intr + 17653 17713 61 2 1 124 103 0 0.561 1.97 2.03 Intr + 21075 21154 80 0 2 97 70 198 0.978 17.18 2.04 Intr + 22587 22750 164 1 2 80 40 100 0.427 3.17 2.05 Term + 24888 25043 156 2 0 66 49 128 0.357 3.65 2.06 PlyA + 25236 25241 6 1.05 3.02 PlyA - 26184 26179 6 1.05 3.01 Sngl - 29473 29018 456 1 0 51 49 296 0.803 17.93 3.00 Prom - 36048 36009 40 -8.25 4.03 PlyA - 36108 36103 6 1.05 4.02 Term - 38719 38585 135 1 0 49 37 105 0.285 -1.46 4.01 Init - 39531 39364 168 0 0 70 89 118 0.567 9.68 4.00 Prom - 57191 57152 40 -6.55 5.04 PlyA - 57968 57963 6 1.05 5.03 Term - 61092 60998 95 1 2 107 32 66 0.415 -0.09 5.02 Intr - 66534 66390 145 0 1 -7 72 187 0.752 6.53 5.01 Init - 67648 67508 141 1 0 32 111 92 0.955 6.08 5.00 Prom - 85078 85039 40 -4.85 6.00 Prom + 91819 91858 40 -3.75 6.01 Init + 95661 95682 22 2 1 47 49 52 0.163 -2.82 6.02 Intr + 95730 95816 87 1 0 87 54 132 0.376 8.72 6.03 Intr + 96265 96448 184 2 1 110 76 114 0.660 10.32 6.04 Term + 96454 96601 148 1 1 1 53 190 0.451 3.19 6.05 PlyA + 97170 97175 6 1.05 7.00 Prom + 99448 99487 40 -9.15 7.01 Init + 100001 100307 307 1 1 89 92 454 0.830 43.40 7.02 Term + 100920 101857 938 1 2 93 48 1069 0.999 94.68 7.03 PlyA + 102215 102220 6 1.05 8.04 PlyA - 102309 102304 6 1.05 8.03 Term - 109294 108650 645 1 0 14 54 371 0.827 19.43 8.02 Intr - 111799 111662 138 1 0 -32 115 175 0.659 7.94 8.01 Init - 112288 111833 456 1 0 81 65 223 0.886 13.59 8.00 Prom - 116196 116157 40 -8.35 9.05 PlyA - 116415 116410 6 1.05 9.04 Term - 117178 116985 194 2 2 92 43 134 0.784 5.90 9.03 Intr - 119280 119068 213 1 0 68 32 149 0.431 5.06 9.02 Intr - 119534 119385 150 0 0 39 18 154 0.280 2.71 9.01 Init - 120216 120156 61 0 1 92 94 24 0.836 4.86 9.00 Prom - 127926 127887 40 -4.35 10.10 PlyA - 128365 128360 6 1.05 10.09 Term - 132362 132207 156 2 0 44 49 164 0.872 5.05 10.08 Intr - 138610 138301 310 0 1 25 59 158 0.067 1.99 10.07 Intr - 148564 148491 74 1 2 76 63 49 0.001 -1.51 10.06 Intr - 151470 151290 181 2 1 5 66 139 0.050 2.35 10.05 Intr - 151760 151706 55 0 1 110 52 73 0.088 3.02 10.04 Intr - 165164 164649 516 0 0 71 78 296 0.335 18.90 10.03 Intr - 167160 166983 178 0 1 64 28 126 0.745 2.77 10.02 Intr - 184661 184582 80 0 2 55 89 61 0.016 1.25 10.01 Init - 193320 193209 112 0 1 82 75 112 0.652 9.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 121691 121425 267 2 0 60 47 239 0.974 12.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:54358139_54559992|GENSCAN_predicted_peptide_1|76_aa MASCPYTNPFKASGYLQVSAKTPGMADNSLHFGPYANCGCFSAAKSWLPPPSTRGRVAME LLQLKLRAHPAYVSGT >gi568815590f:54358139_54559992|GENSCAN_predicted_CDS_1|231_bp atggcctcatgcccttacacaaatcctttcaaagcctctggttacctgcaggtatctgcc aaaacccctgggatggcagacaacagccttcactttgggccctatgcaaactgtggctgc ttctctgcagccaagtcctggctccctcctccttcaaccaggggcagagttgccatggag ctcctgcaacttaagttgagagcccatccggcatatgtgtccggcacctaa >gi568815590f:54358139_54559992|GENSCAN_predicted_peptide_2|171_aa MGAEEDREPEELFVSGYKGPGIHYSTFCLYEFDHSRVIIIWTALKPGKQQTLLDREISDG KQHQEITSEGDTLTIVCFIDEDTAKDLRASHSSRLSELEVECVSTLRSTPTPFCFLLSLN KVEHSQQPAPRSQRFNPTVTRNALKLSASRSGSFWQAAVPRDVDFDAVWTP >gi568815590f:54358139_54559992|GENSCAN_predicted_CDS_2|516_bp atgggggctgaggaagacagggaaccagaagagctttttgtttctggctataaaggccct ggcatccactattctactttctgtctctatgagtttgaccactctagagtcatcataatc tggactgcactgaaacctggaaaacaacagacgctgttggatcgggaaatatctgacggc aaacagcaccaggagattaccagtgagggagatactctcaccatcgtctgtttcatagat gaggacactgccaaggatttgcgtgcatcacacagctcacggctgtcagagctggaagtg gagtgcgtttccaccttgagaagcacaccaacacctttttgttttcttctgtctttaaac aaagttgaacactctcagcaacctgctcccagaagtcagaggtttaatccaacagttacc agaaatgctcttaaactttctgcttccaggtcaggaagtttttggcaggccgctgttccc agggacgtggactttgatgcagtgtggactccttaa >gi568815590f:54358139_54559992|GENSCAN_predicted_peptide_3|151_aa MNDKKVKQTYCSYEENFSGLDRRSNQLQHSLNPNPHLKSKALALFNSLNTKRDEEAAEEK LKASRVCFMKFKNRSHLCNIKVQSEAAHTDGEAAAGYPEDLGKIIDEGSYTKQQIFSVDK KAFYWKKMPSRIFIARQETSMPDFKGQAGSH >gi568815590f:54358139_54559992|GENSCAN_predicted_CDS_3|456_bp atgaatgataagaaagtgaaacagacttattgctcatatgaagaaaattttagtggcctg gatagaagatcaaaccagctacaacattccctgaatccaaaccctcatctcaagagcaaa gccctagctctctttaattccttgaatactaagagagatgaggaagctgcagaggaaaag ttgaaagctagcagagtttgcttcatgaaatttaagaacagaagccatctctgtaacata aaagtgcaaagtgaagcagcacatactgatggagaagctgcagcaggttatccagaagat ctaggaaagatcattgatgaaggtagctacaccaaacaacagattttcagtgtagacaaa aaagccttttattggaagaagatgccatctaggattttcatagctagacaggagacatca atgcctgacttcaaaggacaggctggctctcattaa >gi568815590f:54358139_54559992|GENSCAN_predicted_peptide_4|100_aa MVQTHILHREKGKPWPTGKQGHWVKELRGHRVGCVKNKSKCEDEEKECSSKVSQTVEISQ ELLWVSTAGDFRANTPRGSSPSFDKDGHNGLRVLRQLFVM >gi568815590f:54358139_54559992|GENSCAN_predicted_CDS_4|303_bp atggttcagactcacattctgcaccgtgagaaagggaagccatggcccactggaaagcaa ggtcactgggtcaaggagctaagaggtcacagagtcggatgtgtcaagaataaaagcaaa tgtgaggatgaagagaaagaatgcagttccaaagtttcacaaacagtggaaatctcacaa gagttgctctgggtttccacagctggagacttccgagctaacactccacggggcagttca ccttcttttgacaaagatggccataatggcctcagagtcttgaggcagctttttgtcatg taa >gi568815590f:54358139_54559992|GENSCAN_predicted_peptide_5|126_aa MSESSGIFELIRIIDKEIHSSSFSVDSLQPCFIAQDNTSQELYSINLNSGIPKRPAGINY HTLALVVTEQGDLRDGGVQDSTHIHRNVVDAASKEGQVPHVQMATSSSRLGFSHFANPAE GTLADQ >gi568815590f:54358139_54559992|GENSCAN_predicted_CDS_5|381_bp atgtcagagagcagtggcatatttgagctgataagaattatcgataaggaaattcactcc agcagcttctctgtggactccctgcagccttgcttcattgctcaagacaatacaagtcag gaattgtacagtatcaacttgaactcaggaatccccaagcgtccagcaggcatcaactat cacactctggccctggtggtgacagagcagggagacctaagagatggcggggtccaggat tccactcacatccacaggaatgtggtagatgctgccagcaaggagggccaagtccctcat gtccagatggccaccagcagctccaggctcggattctcccattttgcaaacccagcagaa ggaactcttgctgaccagtag >gi568815590f:54358139_54559992|GENSCAN_predicted_peptide_6|146_aa MHAPVIGVLTGRGRRSSGAPQGLDRTRTAASAGLRPDLLSSANPDLPRCPRVLPDRLECP SSSPCFPSIENTKACRKSPSLCLVTQGPTDKRGSDSSSPTVRHLPELISAPADQRPQLAT GTASWCASRFLAQPLEPRVRPKDPGT >gi568815590f:54358139_54559992|GENSCAN_predicted_CDS_6|441_bp atgcatgctccggtcatcggggtcctcacggggagaggccggcgcagcagtggagcaccg caggggctggatcggacccgcactgcagcatctgcagggctccgacctgacttgctctcc agcgccaacccggatctgcctaggtgcccacgggtccttcccgatcgacttgagtgtcct tcctccagcccctgctttcccagcatcgaaaacacaaaagcctgccggaaatcacccagc ctttgcctggtcactcaggggcccacggataagcgtggctctgattcctccagccccacg gttcgccacctacccgagttaatttctgcgcccgctgaccaaaggccccaactggcaacg ggcacagcttcttggtgcgctagtcgcttccttgcccagcctttagagcctcgggtccgg cccaaggacccggggacgtga >gi568815590f:54358139_54559992|GENSCAN_predicted_peptide_7|414_aa MSSPDAGYASDDQSQTQSALPAVMAGLGPCPWAESLSPIGDMKVKGEAPANSGAPAGAAG RAKGESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAELSKMLGKSWKALTLAEKRPFVEE AERLRVQHMQDHPNYKYRPRRRKQVKRLKRVEGGFLHGLAEPQAAALGPEGGRVAMDGLG LQFPEQGFPAGPPLLPPHMGGHYRDCQSLGAPPLDGYPLPTPDTSPLDGVDPDPAFFAAP MPGDCPAAGTYSYAQVSDYAGPPEPPAGPMHPRLGPEPAGPSIPGLLAPPSALHVYYGAM GSPGAGGGRGFQMQPQHQHQHQHQHHPPGPGQPSPPPEALPCRDGTDPSQPAELLGEVDR TEFEQYLHFVCKPEMGLPYQGHDSGVNLPDSHGAISSVVSDASSAVYYCNYPDV >gi568815590f:54358139_54559992|GENSCAN_predicted_CDS_7|1245_bp atgagcagcccggatgcgggatacgccagtgacgaccagagccagacccagagcgcgctg cccgcggtgatggccgggctgggcccctgcccctgggccgagtcgctgagccccatcggg gacatgaaggtgaagggcgaggcgccggcgaacagcggagcaccggccggggccgcgggc cgagccaagggcgagtcccgtatccggcggccgatgaacgctttcatggtgtgggctaag gacgagcgcaagcggctggcgcagcagaatccagacctgcacaacgccgagttgagcaag atgctgggcaagtcgtggaaggcgctgacgctggcggagaagcggcccttcgtggaggag gcagagcggctgcgcgtgcagcacatgcaggaccaccccaactacaagtaccggccgcgg cggcgcaagcaggtgaagcggctgaagcgggtggagggcggcttcctgcacggcctggct gagccgcaggcggccgcgctgggccccgagggcggccgcgtggccatggacggcctgggc ctccagttccccgagcagggcttccccgccggcccgccgctgctgcctccgcacatgggc ggccactaccgcgactgccagagtctgggcgcgcctccgctcgacggctacccgttgccc acgcccgacacgtccccgctggacggcgtggaccccgacccggctttcttcgccgccccg atgcccggggactgcccggcggccggcacctacagctacgcgcaggtctcggactacgct ggccccccggagcctcccgccggtcccatgcacccccgactcggcccagagcccgcgggt ccctcgattccgggcctcctggcgccacccagcgcccttcacgtgtactacggcgcgatg ggctcgcccggggcgggcggcgggcgcggcttccagatgcagccgcaacaccagcaccag caccagcaccagcaccaccccccgggccccggacagccgtcgccccctccggaggcactg ccctgccgggacggcacggaccccagtcagcccgccgagctcctcggggaggtggaccgc acggaatttgaacagtatctgcacttcgtgtgcaagcctgagatgggcctcccctaccag gggcatgactccggtgtgaatctccccgacagccacggggccatttcctcggtggtgtcc gacgccagctccgcggtatattactgcaactatcctgacgtgtga >gi568815590f:54358139_54559992|GENSCAN_predicted_peptide_8|412_aa MRGLRACSRDLRLAAGAGQAWGSAGRGRRGPAPRGQTGTQGVGSGGSGAGVRPEPCAGQR PAGGKALRGEGQAQAAQGYAAANGWGTVLVSGLVPQDPRAAPPPGGRQLLLLRPPRWVPS CFPNSQSADISQEDRKPAKRRPTHFYHPDEHSSSDPISLRLSGQISPPGQNQLLRSQPYD MRVPEVPISHPRRPALAQPLALDLRPTWKAPDARLALGSARGPGRRLALHLQVPEGARQL GPRTGAQPSRHRRQRAPGLALVVTGVSRIRPAHGAPGLPRAPKPSAGRGGKEGGERGTEM LRRLRGSPRPNRHPRSGPVPAHGVASAAAGPATLRAPPGRGASSETMWPTENLSSRRRAP PQRRRTRAPSIARSLPGSGRLQGLIFPSTGKNVQRHKSPPLPSEESLSQRSR >gi568815590f:54358139_54559992|GENSCAN_predicted_CDS_8|1239_bp atgagggggctgcgggcttgcagtagagacctgcgcttagctgctggtgcggggcaggcc tggggcagcgcggggcgaggaaggcggggtccggctccacgcggtcaaacggggacgcag ggcgtgggcagtggtggcagtggagcgggagtacgaccagagccatgtgcggggcagcga ccggcaggagggaaagccctgcgcggggaagggcaggcccaggccgcccagggctacgcg gccgcgaacggttgggggacggtcctggtctctggcctggtcccgcaagatccccgcgct gcgccgcctcccggaggaaggcagctcctcctcctgcgcccacctcgctgggtacccagt tgtttcccaaactcacaaagcgcagacatctcccaagaggacaggaagcctgcaaagagg cgccccacacacttttatcacccagacgaacacagtagcagcgaccccatcagcctacga ttatcggggcagatctcacctcctggacagaatcagcttcttaggtctcagccttacgac atgagggtcccggaagtgccaataagccatccacgcagaccagcacttgcccagcctctg gcgctcgaccttcgccccacctggaaggcgccggatgcgagactcgcccttggctcggcc cgcggccccggccggcgcctcgcccttcaccttcaagtccccgagggagctcggcaactc ggcccaaggacaggggcccagcctagccgtcaccgccggcagcgcgctccgggtctggct ctggtagtcactggcgtatcccgcatccggcctgctcacggcgctccaggcctgccccgc gcgccgaaaccctccgccggacgcggaggaaaagaaggtggtgagcgaggcacggagatg ctccgacggctgcgcgggtctccgaggccgaaccgccacccgcgttccgggccagtccca gcccacggcgtggccagcgctgccgccggtccagcgacactgcgggcgcccccgggccgc ggggcctcttctgagaccatgtggcccacagagaatctaagctcccgccgccgggctcca cctcaacggcggagaactcgggcgccgagcatcgcccgatccctacccggcagcggacgg ctacagggtttgattttcccctccacaggaaagaacgtgcagaggcacaaatcccctcct cttccctcagaggaaagcctgtcccagcggtcccgctga >gi568815590f:54358139_54559992|GENSCAN_predicted_peptide_9|205_aa MQSDQALVILPVHHRENCLPVNYDCSGGWSLGGSGCIARYRIALVGQKLLPHQLPPSSRG NIRSEELRYTGRKQTFRVVGAGPRSQTQLGGSQNEVSSSITCAAPTLPEYAACTNSRQSP SDTPAPLLQLLRLLSREPRMRALFRSHFLHMEIGTLEPQSHQHSLALLSAFTPAAESTAS DESYSLISVYMGASAHKSLKRDADL >gi568815590f:54358139_54559992|GENSCAN_predicted_CDS_9|618_bp atgcaaagtgatcaggccctagtcatcttgccagtgcaccacagagaaaactgccttcca gtgaattatgactgctcaggagggtggtcactcggaggctcaggctgcattgctcggtac agaattgcgctggtggggcagaagcttcttcctcatcagctgccccccagctcccgtgga aatatcaggtcagaggaactcaggtacacagggaggaaacagaccttccgagtggttggg gctggccccaggtcacagacccagctagggggctctcaaaatgaggtgtccagctccatc acctgtgcagcgcccaccctgcctgagtatgctgcttgcacgaactctcgccagagcccc tcagacacccctgcgcccctgctccagctgctcaggctgctgtccagagaacccagaatg cgagctctcttcagatctcacttcctccatatggaaattggaactctcgaaccccagtct caccaacactctctggccctcttgtctgccttcacccctgctgctgaaagcacagcttcg gatgaatcctatagtctcatctctgtttacatgggagcttctgctcacaagagcctgaag agggatgctgatctatga >gi568815590f:54358139_54559992|GENSCAN_predicted_peptide_10|553_aa MEAMLGRQGSFLIILKDRYRCPVTQACFSHTGISPPSGTWVELEAMILSKVIKEQKTKCR ILSLCSLATSPLAPSHIIRFLCIALEGKDLALERRKPAPARVPTKCPLRGSCHLFGFLSL DGAVFLDDVQWMNKWRLYYQVLNFGMIVSSALMIWKGLMVITGSESPIVLLSGSMEPAFH RGYLLFLTNRVEDPIRVGEIAVLRIEGRKIPIVHRVLKIHEKQNGHIKFLTKGDNNAVDD RGLYKQDQHWLEKKDVVGRARGFVPYIGIGTSLMNDYPKHKYEVLFLLGLFVLVHHGTAF HRRKGRQFLMLPSQKSNTEEGKEKEKKNGIRALTSVRSNQSVPFPREDITRRSHSGAISK PSPDTQSAFTVDFKQRQGGQASGHQQQNFLFEVPVLERSKIGIQETYLNVIKAIYDKPTA NVILKREKLKAFSLRTGIRQGRPLSPLLSNIVPEVPARAMRQKKKIKGIQIGKEEVKLLL FTDDMIVYLENPKDSSKKLPELHLCFSAFLIKGLLREYLNAAAKDVLKSVGNNEDKAFLV LSCYTETEQSYDV >gi568815590f:54358139_54559992|GENSCAN_predicted_CDS_10|1662_bp atggaggcaatgctgggccggcaaggcagcttcctcatcatcttaaaggaccgataccga tgtcctgtcactcaggcctgcttctcccacaccgggatctcacctccttcagggacatgg gtggagctggaagccatgatccttagcaaagtaattaaggaacaaaaaactaaatgccgt atcttatcactttgctctttggccacaagcccactggccccaagccacatcatcaggttc ctctgcattgccttggaagggaaggatttggcattggagcgcaggaagcctgctccagcc agagttcctacaaagtgccctctgcgtggcagctgccatctgttcggctttctctcctta gacggagcagtctttttggatgatgtgcagtggatgaacaagtggcggctctattatcaa gtcctaaattttggaatgattgtctcatcagcactcatgatctggaaggggttaatggta ataactggaagtgaaagtccaattgtgttgctcagtggcagcatggaacctgcatttcat agaggatatcttctctttctaacaaatcgagttgaagatcccatacgagtgggagaaatc gctgttctaaggatagaaggaagaaagattcctatagttcaccgagtcttgaagattcat gaaaagcaaaatgggcatatcaagtttttgaccaaaggagataataatgcggttgatgac agaggcctctataaacaagatcaacattggctagagaaaaaagatgtcgtggggagagcc aggggatttgttccttatattggaattgggacgagcctcatgaatgactatcctaaacat aagtatgaagtgctctttttgctgggtttatttgtgctggtccatcatgggacagccttt catcgtcgcaaaggaaggcagttcctgatgctcccctctcaaaaatccaacacagaagag ggaaaagaaaaagaaaaaaagaatggcattagagcccttacaagtgtccgaagcaaccag agtgttccttttccacgcgaggacataacaagaaggtcccactcaggagccataagcaag ccctcaccagacactcaatctgccttcactgtggacttcaagcagaggcagggcggccaa gcttctggtcaccaacagcagaactttctatttgaagtcccggttttggaaagaagcaaa atcggcatacaagagacatacctcaatgtaataaaagcaatctatgacaaacccacagcc aacgttatactgaaacgggaaaagttgaaagcattctccctgagaactggcataagacaa ggacgcccactctcaccacttctatctaacatagtaccggaagtcccagccagagcaatg agacaaaagaaaaaaatcaagggcatccaaattggtaaagaggaagtcaaacttttgttg tttactgatgatatgatcgtatacctagaaaaccctaaagactcatccaaaaagctccca gaactgcatctctgtttctccgccttcctaatcaaaggcttacttcgtgagtatctgaat gcagctgcaaaagatgtcctaaaatccgtgggaaataatgaagacaaggctttccttgtt ctatcctgctacacagaaacagagcagtcttatgatgtatag