GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:57:36 Sequence gi568815596r:171417273_171617716 : 200444 bp : 40.74% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2985 3128 144 2 0 79 103 73 0.745 8.08 1.02 Intr + 6722 7196 475 1 1 42 64 303 0.139 15.01 1.03 Term + 14639 14799 161 0 2 61 52 127 0.625 3.52 1.04 PlyA + 15149 15154 6 1.05 2.00 Prom + 16538 16577 40 -10.94 2.01 Init + 17306 17431 126 1 0 82 60 160 0.771 10.82 2.02 Intr + 17811 17914 104 2 2 75 93 76 0.880 4.85 2.03 Intr + 26251 26341 91 1 1 95 83 -17 0.023 -2.42 2.04 Intr + 31409 31545 137 1 2 112 68 45 0.039 3.35 2.05 Intr + 32607 32685 79 0 1 80 74 43 0.049 0.73 2.06 Intr + 40699 40803 105 0 0 95 80 99 0.901 9.29 2.07 Intr + 41100 41205 106 2 1 87 80 32 0.859 1.17 2.08 Intr + 51616 51758 143 2 2 111 59 65 0.822 5.05 2.09 Intr + 56594 56703 110 1 2 83 89 43 0.814 2.06 2.10 Intr + 59588 59678 91 2 1 92 106 77 0.997 8.98 2.11 Intr + 62766 62921 156 2 0 68 89 122 0.998 9.59 2.12 Term + 74150 74566 417 1 0 92 38 366 0.999 26.29 2.13 PlyA + 74926 74931 6 1.05 3.00 Prom + 75535 75574 40 -8.85 3.01 Init + 78475 78759 285 0 0 64 55 183 0.152 9.72 3.02 Intr + 96870 96953 84 2 0 117 99 -16 0.151 1.50 3.03 Term + 99449 99574 126 1 0 -5 54 205 0.239 5.10 3.04 PlyA + 99670 99675 6 -4.33 4.02 PlyA - 99987 99982 6 1.05 4.01 Sngl - 100444 99998 447 1 0 95 39 813 0.859 73.07 4.00 Prom - 103108 103069 40 -9.25 5.00 Prom + 103874 103913 40 -9.05 5.01 Init + 105280 105487 208 0 1 90 33 384 0.259 30.13 5.02 Intr + 105780 106040 261 1 0 45 38 153 0.159 2.54 5.03 Intr + 106887 106942 56 1 2 15 41 124 0.054 -1.82 5.04 Intr + 112626 112879 254 2 2 49 51 216 0.085 9.31 5.05 Intr + 124329 124521 193 0 1 73 82 96 0.161 6.07 5.06 Intr + 130019 130054 36 1 0 75 100 33 0.117 0.74 5.07 Intr + 136074 136228 155 2 2 83 95 45 0.336 2.65 5.08 Term + 137252 137555 304 2 1 62 38 176 0.650 3.66 5.09 PlyA + 138665 138670 6 1.05 6.06 PlyA - 139784 139779 6 1.05 6.05 Term - 140339 140154 186 2 0 57 38 104 0.111 -1.19 6.04 Intr - 141613 141525 89 2 2 77 72 72 0.257 3.27 6.03 Intr - 143898 143725 174 1 0 53 81 102 0.382 4.99 6.02 Intr - 144245 143954 292 2 1 69 61 230 0.166 14.18 6.01 Init - 145233 145141 93 0 0 79 67 40 0.028 1.50 6.00 Prom - 148507 148468 40 -7.95 7.00 Prom + 150284 150323 40 -5.45 7.01 Init + 154790 154904 115 1 1 41 76 73 0.336 1.82 7.02 Intr + 155480 155682 203 0 2 22 100 121 0.357 4.78 7.03 Intr + 156110 156242 133 1 1 60 50 48 0.206 -2.50 7.04 Intr + 160548 160690 143 1 2 1 98 168 0.364 8.25 7.05 Intr + 169013 169139 127 1 1 36 25 132 0.474 1.03 7.06 Intr + 169809 169927 119 1 2 62 54 64 0.546 -0.34 7.07 Term + 169970 170302 333 1 0 -42 38 355 0.391 11.13 7.08 PlyA + 170906 170911 6 1.05 8.00 Prom + 172259 172298 40 -7.35 8.01 Init + 173535 173624 90 2 0 36 56 112 0.160 3.34 8.02 Intr + 182599 182716 118 0 1 74 75 88 0.747 5.22 8.03 Intr + 190248 190354 107 1 2 93 82 70 0.114 5.91 8.04 Term + 193739 193771 33 1 0 81 43 56 0.088 -2.99 8.05 PlyA + 195542 195547 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 30815 30654 162 2 0 110 38 96 0.869 3.75 S.002 Term + 63702 63842 141 2 0 78 42 132 0.885 4.55 S.003 Init + 112632 112879 248 2 2 53 51 208 0.839 10.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:171417273_171617716|GENSCAN_predicted_peptide_1|259_aa MVLALELVAFDSWYDALPARLFFCLFSREWHGYAERQEEEKEDEADMVPQDLVPCTPAAP AVAKRGQGTAQAMASEGTSSKPWQLPRVIGPAGAQKSRTEFEEPLPRFQKMYRNAWMSKQ KFAAWVEPSWRTSAMAVRKGNVGLEPPHRVPTGTLPSGDVRRGPPSSRLQNCRSTDSLHH VPGKASDTLHQPMKAARTGAVPCKATVWTPEPTVGVTCDMPGPAASGARSQLLCRHLEQP AGYHTTLHSPAQTPSPARG >gi568815596r:171417273_171617716|GENSCAN_predicted_CDS_1|780_bp atggtgctggcactagaacttgtagcttttgactcctggtatgatgctcttcccgctaga ctattcttttgtcttttcagtagagagtggcatggttatgcagagagacaagaagaagaa aaagaagatgaggcagacatggtgcctcaggacttggtgccctgcaccccagctgctcca gctgtggctaaaaggggccaaggtacagcacaggccatggcttcagaggggacaagctcc aagccttggcagcttccacgtgttattgggcctgcgggtgcacagaagtcaagaactgag tttgaggaacctctgcctagatttcagaagatgtatagaaatgcctggatgtccaagcag aagtttgctgcatgggtggagccctcatggagaacctctgctatggcagtgcggaaggga aatgtgggattggagcccccacacagagtccctactgggacactgcctagtggagatgtg agaagagggcccccatcctccagactccagaactgtagatccactgacagcttgcaccat gtgcctggaaaagcctcagacactctacaccagcccatgaaagcagccaggacaggggct gtaccctgcaaagccacagtctggactccagagcccactgtgggagtcacttgtgacatg cctggtccagccgcaagcggtgcacggagccagctcctgtgccggcacttggagcaacca gctggataccacacaactctgcactcaccagctcagacaccctctcctgccaggggctga >gi568815596r:171417273_171617716|GENSCAN_predicted_peptide_2|554_aa MGPTRKPNVCSRLSRRALGCFSRDAGVVQRTNLGILRALVCQESTKFKNVWTTHSRSPIA YERGRIYFDNYRRCVSSVASEPRKLYEMPKCSKSEKIEDALLWECPVGDILPNSSDYKSS LIALTAHNWLLRISATTGKILEKIYLAPYCKFRYLSWDTPQEVIAVKSAQNRGSAVARQI FGNVTDATLSHGILIVMYSSGLVRLYSFQTIAEQFMQQKLDLGCACRWGGTTGTVGEAPF GIPCNIKITDMPPLLFEVSSLENAFQIGGHPWHYIVTPNKKKQKGVFHICALKDNSLAKN GIQEMDCCSLESDWIYFHPDASGRIIHVGPNQVKVLKLTEIENNSSQHQISEDFVILANR ENHKTFKIVDYEDELDLLSVVAVTQIDAEGKAHLDFHCNEYGTLLKSIPLVESWDVGITR VRNTTDAAGIVLKELKRQSSLGVFHLLVAVDGINALWGRTALKREDKCPIAPEELALVHN LRKMMKNDWHGGAIVLTLNQTGSLFKPRKAYLLQELLVKEGFDALDPFIPILVSNYNPKE FESRIQYCLENNWL >gi568815596r:171417273_171617716|GENSCAN_predicted_CDS_2|1665_bp atgggcccgacccggaagcccaacgtgtgcagccggctgagtcgccgggcgctgggctgc ttctcgcgcgacgcaggcgtggtgcagaggaccaacctgggcatcctgcgggcgctggtg tgccaggaaagtactaaatttaagaatgtctggacaactcattccaggtcacctatagcc tatgagagaggaagaatatattttgacaattatcggcgctgtgtcagcagtgttgcatct gagccaagaaaactttatgaaatgccaaaatgttccaaatcagaaaaaatagaggatgct ttattatgggaatgcccagtgggagatatacttcccaattcatcagattataagtcctca ctcatagcactgactgctcataattggctacttcgtatatcagcaactacgggaaaaatc cttgagaaaatatatcttgcaccttattgcaaattcagatacttgagctgggacactcct caagaagtcattgcagttaagtcagctcagaacagaggctcagcagtggcccggcagatt tttgggaacgttacagatgctaccttgtctcatggaatactgattgtgatgtacagctca ggactggtcagactctatagcttccaaaccatcgctgaacagttcatgcaacagaaactt gacttagggtgtgcatgcagatggggtgggactactggaactgtaggagaggctcctttt ggcattccttgtaatattaaaatcacagacatgccaccactgctctttgaggtgtcatcc ctggagaatgcttttcagattggaggccatccttggcactacatcgtcacacctaataag aagaaacagaaaggagttttccatatttgtgccctaaaagacaattccctggcaaaaaat gggatccaagaaatggattgttgttctctagaatctgactggatctatttccatcctgat gcttctggtagaataatacatgttggtccaaatcaagtcaaagttttgaagctaactgaa atagaaaataatagttctcagcatcagatctctgaagattttgtcattttggccaacagg gagaaccataaaactttcaaaattgtggactatgaagatgagttagatttgctttctgtg gtagctgttactcaaatagatgctgaaggaaaagctcacctggatttccactgtaatgaa tatggaactttacttaaaagcattccactagtggagtcatgggatgtgggcataacacgg gtgaggaacaccacagatgcagctggaattgtgctgaaagagctaaagaggcaaagttct ttgggtgtttttcacctgctggtggccgtggatggaatcaatgctctttggggaaggact gctctgaaaagagaagataaatgcccgattgccccagaggaattagcacttgttcacaac ctgaggaaaatgatgaaaaatgattggcatggaggtgccattgtgttgactttgaaccag actgggtctctctttaagccccggaaagcctatctgctccaggagttgctggtaaaggaa ggatttgatgccctggatccctttattcccatcctggtttccaactataacccaaaggaa tttgaaagtcgtattcagtattgtttggaaaacaattggctttaa >gi568815596r:171417273_171617716|GENSCAN_predicted_peptide_3|164_aa MCKAAGNAPLGTGLFQDITEDRKPGDHSSVELLGLTISITKEGTWFEMVSDSIIPKVGPV SQERGLDLFKEPGQAQWLTPVIPAMWESEVAGSLEGTVNIKNLLPSWHFVSVCCTISAQN NFQRCNSLKGTRQRVVKNQGQAQQEANATHRRVEASGDGVTGPL >gi568815596r:171417273_171617716|GENSCAN_predicted_CDS_3|495_bp atgtgcaaagcagctggaaatgccccacttgggactggactgtttcaggatataactgag gacaggaaacctggggatcacagttcggtggagctccttggcctcacgatcagcatcaca aaggaaggaacctggtttgaaatggtttctgacagcataattcctaaagttgggcctgta tctcaggaaagaggtctagatctcttcaaagaacctggacaggcacagtggctcacgcct gtaatcccagcaatgtgggagtccgaggtggctggatcacttgaggggactgtgaacata aaaaacctccttccttcatggcattttgtgtctgtgtgttgtactatatctgctcaaaac aatttccagaggtgcaacagccttaagggaactcggcaaagagtggtgaagaaccaggga caagcacagcaggaagccaatgccactcatagacgcgtagaagcaagtggtgatggtgtt actggacccctgtga >gi568815596r:171417273_171617716|GENSCAN_predicted_peptide_4|148_aa MAEVEQKKKRTFRKFTYRGVDLDQLLDMSYEQLEQPMQLYSARQRRRLNRGLRRKQHSLL KRLRKAKKEAPPMEKQEVVKTHLRDMIILPEMVGSMVGVYNGKTFNQVEIKPEMIGHYLG EFSITYKPVKHGRPGIGATHSSRFIPLK >gi568815596r:171417273_171617716|GENSCAN_predicted_CDS_4|447_bp atggcagaagtagagcagaagaagaagcggaccttccgcaagttcacctaccgcggcgtg gacctggaccagctgctggacatgtcctacgagcagctggagcagccgatgcagctgtac agtgcgcgccagcggcggcggctgaaccggggcctgcggcggaagcagcactccctgctg aagcgcctgcgcaaggccaagaaggaggcgccgcccatggagaagcaggaagtggtgaag acgcacctgcgggacatgatcatcctacccgagatggtgggcagcatggtgggcgtctac aacggcaagaccttcaaccaggtggagatcaagcccgagatgatcggccactacctgggc gagttctccatcacctacaagcccgtaaagcacggccggcccggcatcggggccacccac tcctcccgcttcatccctctcaagtaa >gi568815596r:171417273_171617716|GENSCAN_predicted_peptide_5|488_aa MEGYWRFLALLGSALLVGFLSVIFALVWVLHYREGLGWDGSALEFNWHPVLMVTGFVFIQ GIGTGTSWGGRTGGASKTQKLEPGRDAGAPRSRAAAAHAPRKVAPVDRACAIGGAETLSP QPFAELASRLQMRRGRQRGKDCRSENLPVFVTSSALAFKGFPGVRSASGCEIGIRVSMNV TLLGNRIFADVNYGSQDEVTLDLDWALNPMIGALIRERRRRFETKTHREEGYVKTGRDWS DVATSQGKWRIVGSPRGRRGLPWTWKCSKLLMKSIHAGLNAVAAILAIISVVAVFENHNV NNIANMYSLHSWVGLIAVICYLLQEPDCNGLKREQKLLSGFSVFLLPWAPLSLRAFLMPI HVYSGIVIFGTVIATALMGLTEKLIFSLRDPAYSTFPPEGVFVNTLGLLILVFGALIFWI VTRPQWKRPKEPNSTILHPNGGTEQGARGSMPAYSGNNMDKSDSELNSEVAARKRNLALD EAGQRSTM >gi568815596r:171417273_171617716|GENSCAN_predicted_CDS_5|1467_bp atggagggctactggcgcttcctggcgctgctggggtcggcactgctcgtcggcttcctg tcggtgatcttcgccctcgtctgggtcctccactaccgagaggggcttggctgggatggg agcgcactagagtttaactggcacccagtgctcatggtcaccggcttcgtcttcatccag ggcatcggtactggcacctcctgggggggacgcactggaggggcctccaagacccagaaa cttgagcccgggcgcgatgccggagcgccgcgcagtcgggccgctgcagcccacgcgccc cggaaagtcgcccctgtggaccgagcctgcgcgatcgggggcgcggaaacactttcgcct cagccttttgctgaacttgcttccaggctgcagatgaggaggggaaggcagcggggaaaa gattgccggagcgagaatttgcccgtatttgtaacatcctcggccctggcattcaaaggc ttccctggtgtgagatcggcatcagggtgtgagatcggcatcagggtatccatgaatgtg accttacttggaaataggatctttgcagatgttaactatggatctcaagatgaggtcacc ctggatttagactgggctctaaatcctatgattggtgcccttataagagaaaggagaagg agatttgagacaaagacacacagagaagaaggctatgtgaagacaggcagagattggagt gatgtggccacaagccaaggaaaatggagaattgttggctcccctagaggccgaagagga ctgccgtggacctggaaatgcagcaagctcctgatgaaatccatccatgcagggttaaat gcagttgctgccattcttgcaattatctctgtggtggccgtgtttgagaaccacaatgtt aacaatatagccaatatgtacagtctgcacagctgggttggactgatagctgtcatatgc tatttgttacaggagccagactgcaatggattgaagagggagcagaagcttctttcaggt ttttcagtctttctgcttccatgggctccgctttctctccgagcatttctcatgcccata catgtttattctggaattgtcatctttggaacagtgattgcaacagcacttatgggattg acagagaaactgattttttccctgagagatcctgcatacagtacattcccgccagaaggt gttttcgtaaatacgcttggccttctgatcctggtgttcggggccctcattttttggata gtcaccagaccgcaatggaaacgtcctaaggagccaaattctaccattcttcatccaaat ggaggcactgaacagggagcaagaggttccatgccagcctactctggcaacaacatggac aaatcagattcagagttaaacagtgaagtagcagcaaggaaaagaaacttagctctggat gaggctgggcagagatctaccatgtaa >gi568815596r:171417273_171617716|GENSCAN_predicted_peptide_6|277_aa MCSWTAPVSSILTHPNWPVLRMAGLTGCASVAMKKKIEHNCQQVIAQTFTTRGDLLEIPL TDPGLNLYTDGSSFVEKGLRKAGYAVVSDNGILESNPLTPGTSAQLAELIALTWAPELEE GKRVKRKKAIWREREFLTSEGTPIKHQEAIRRLLLAVQKPKEVAVLHCRGHRKGTAKWIL KPKEPQGVKWPEVTKFVTSPVAPIDNITIVEPKIGQHSLHETTQVKERKCFRGQVPFDTS STGSRTLLMTVIQKVYSFEFLNKRFERKGFNVDSDEC >gi568815596r:171417273_171617716|GENSCAN_predicted_CDS_6|834_bp atgtgctcctggaccgctcccgtatcgagcatcctcacccacccaaattggcctgtcctt agaatggctggccttacgggttgtgcctctgtagcaatgaagaaaaaaatagaacataac tgtcaacaagtaattgctcaaaccttcaccactcgaggggaccttctagagattcccttg actgatcctggcctcaacttgtatactgatggaagttcctttgtagaaaaaggacttcga aaagcggggtatgcagtggtcagtgataatggaatacttgaaagtaatcccctcactcca ggaactagcgctcagctggcagaactaatagccctcacttgggcaccagaattagaagaa ggaaaaagggtaaaaaggaaaaaggcaatatggagagaaagggaattcctaacttccgag ggaacacctatcaaacatcaggaagccattaggagattattattagctgtacagaaacct aaagaggtggcagtcttacactgtcggggtcatcggaagggaactgccaagtggatattg aagccaaaagagccacaaggagtcaagtggccagaggtcacaaaatttgtgacttcccca gttgctcctatagataacatcactattgtagaacctaagattggtcagcacagcttacat gaaaccacgcaagtaaaggaaagaaagtgctttagagggcaggttccatttgacacaagc agcacaggaagccgtacacttttgatgacagtaattcaaaaagtatattcatttgagttt ctcaataaacgctttgagagaaaaggctttaatgttgattcagatgagtgttaa >gi568815596r:171417273_171617716|GENSCAN_predicted_peptide_7|390_aa MLKSQETKEGGQQIQCVSKGYFIGELTDRSVVLGSSKTVSSHLYKLSIGSSATLNAHYLC KQKLQLSELRGDVIQWLEELVKMHGLDTYLHHVLPGGLADVIKDLKEHICIGKSVMKQMG MMNTKFREVETVKRRKGIQLRGHKGFTCLDENVKSTIKTEVLPESGPNPDLKRGLSDLVQ ERIWGESIEPSESKFIKKPLLEELTGSQQDRSKDRNQPHPIPIGNLLMGDKGSLAQVREN VIRQNDEHGKRGEAPDICSLGLLENMGFFLWPRICESMRKECPTNVNHGKTGRVYSVPQR AVGIVINKQAKGKILAKKINVHIEHVKHSKIPDSFLKHMKENDQKKKEAKEKATRVQLKC QPAPPRETHFVRTSGKKPELLEPIPYEFMA >gi568815596r:171417273_171617716|GENSCAN_predicted_CDS_7|1173_bp atgctgaagagccaagaaaccaaagaaggtggccaacaaatccagtgtgtcagtaaaggg tattttattggggaacttacagacagaagcgtggtcttgggcagcagcaagacagttagt tctcacctatataagctgagtattggatccagtgccacattaaatgcccattatttgtgt aagcagaagttgcagctttctgagcttagaggcgatgtcatccagtggttggaggagttg gtcaaaatgcatgggttggacacctacctccaccatgtattgcctgggggcttggcagat gtaattaaggacctcaaggaacacatatgtatcggcaaaagcgtaatgaaacaaatggga atgatgaacaccaaatttagagaagtggaaactgtaaagagaaggaagggaatacagctg agggggcacaaaggcttcacctgtttggatgaaaatgtgaaaagcacaatcaaaacagaa gtgttaccggaaagcggtcccaatccagacctcaagagagggttgtcggacctcgtgcaa gaaagaatttggggtgagtccatagagccgagtgaaagcaagtttattaagaagccactc ttggaagaactaactggatctcaacaggacagatccaaagaccgaaaccagccccacccc atccccataggaaatcttcttatgggggataaagggtcacttgctcaagtcagagaaaat gtaattcgccaaaatgatgaacacgggaaaagaggagaggcaccagatatatgttctcta ggccttttagaaaacatggggtttttcctttggccacgtatatgtgaatctatgagaaag gaatgccccacaaatgttaaccatggcaaaactgggagagtctacagtgttccccagcgt gctgttggcattgttataaacaaacaagctaagggcaagattcttgccaagaaaattaat gtgcatattgagcatgttaaacattctaagatccctgatagtttcctgaaacacatgaag gaaaatgatcagaaaaagaaggaagccaaagagaaagctacccgggttcagctgaagtgc cagcctgctccacccagagaaacacactttgtgagaaccagcgggaagaagcctgagctg ctggaacctattccctatgaattcatggcttaa >gi568815596r:171417273_171617716|GENSCAN_predicted_peptide_8|115_aa MIAAPAAHSSLVRAPEDPAKPCPDSSPTETRTWTNTSGNGNAYPKERLLQFKAKVLLYWI AAATQSKNKGREVKSFPVVEATPVFALDGGKLKLLPENDGESGPNAQFELTQLPP >gi568815596r:171417273_171617716|GENSCAN_predicted_CDS_8|348_bp atgatagcagcacctgctgctcatagcagccttgtgagagccccagaagacccagccaag ccatgtccagattcttcacccacagaaactaggacctggactaacacatcaggcaatggc aatgcctaccctaaagaacgacttttacagttcaaggccaaagtccttctgtattggata gcagcagcaacccaaagcaagaacaaagggagggaggttaagagcttccctgttgtagag gctactccagtttttgcccttgatgggggtaagctaaagctgcttcctgagaacgatggt gagagtggcccaaatgcccagtttgagctgactcagctgccgccatga