GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:23:36 Sequence gi568815596r:232668194_232871362 : 203169 bp : 40.74% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4116 4250 135 2 0 111 100 210 0.992 24.34 1.02 Term + 13392 13526 135 2 0 116 37 207 0.842 15.44 1.03 PlyA + 14559 14564 6 1.05 2.00 Prom + 17456 17495 40 -5.75 2.01 Init + 18102 18147 46 2 1 94 58 18 0.754 0.31 2.02 Intr + 18454 18528 75 2 0 139 110 41 0.911 9.97 2.03 Intr + 22122 22188 67 1 1 96 106 -29 0.399 -3.26 2.04 Intr + 24139 24236 98 1 2 99 89 86 0.463 8.53 2.05 Intr + 25012 25070 59 2 2 60 62 27 0.396 -5.02 2.06 Intr + 28709 28894 186 1 0 87 80 106 0.641 8.66 2.07 Intr + 29200 29386 187 0 1 81 71 45 0.521 0.44 2.08 Term + 29719 29831 113 2 2 79 43 114 0.841 3.74 2.09 PlyA + 32621 32626 6 1.05 3.03 PlyA - 32748 32743 6 1.05 3.02 Term - 62026 61652 375 1 0 98 49 474 0.927 38.25 3.01 Init - 64589 64587 3 2 0 90 54 0 0.155 -3.15 3.00 Prom - 65563 65524 40 -7.75 4.00 Prom + 65932 65971 40 -8.65 4.01 Init + 67005 67045 41 2 2 51 101 68 0.464 4.11 4.02 Intr + 79422 79551 130 0 1 118 116 107 0.981 16.28 4.03 Intr + 80794 80889 96 0 0 87 72 100 0.983 7.59 4.04 Intr + 88030 88141 112 0 1 100 83 98 0.920 9.53 4.05 Intr + 92287 92398 112 2 1 116 88 38 0.913 5.12 4.06 Intr + 93203 93243 41 2 2 109 82 69 0.633 5.35 4.07 Term + 96199 96323 125 2 2 83 37 38 0.209 -4.23 4.08 PlyA + 97028 97033 6 -0.45 5.03 PlyA - 98033 98028 6 1.05 5.02 Term - 100620 99998 623 1 2 54 47 522 0.999 38.29 5.01 Init - 103169 102710 460 2 1 71 91 302 0.983 24.76 5.00 Prom - 107429 107390 40 -6.05 6.00 Prom + 107725 107764 40 -4.95 6.01 Init + 116946 117013 68 2 2 65 110 11 0.483 1.60 6.02 Intr + 117380 117450 71 2 2 73 85 0 0.299 -3.89 6.03 Intr + 118957 119136 180 2 0 102 99 131 0.985 14.52 6.04 Intr + 122505 122722 218 1 2 93 69 291 0.731 25.00 6.05 Intr + 122815 122977 163 0 1 98 87 238 0.994 23.33 6.06 Intr + 123065 123253 189 0 0 49 80 225 0.961 16.44 6.07 Intr + 126555 126751 197 1 2 84 98 244 0.991 23.21 6.08 Intr + 127869 128060 192 2 0 78 28 92 0.645 0.97 6.09 Intr + 130752 130845 94 2 1 56 103 74 0.939 4.32 6.10 Intr + 138298 138464 167 2 2 67 115 63 0.942 5.66 6.11 Intr + 141527 141618 92 1 2 83 61 136 0.999 8.27 6.12 Intr + 143051 143158 108 2 0 39 73 140 0.983 6.08 6.13 Intr + 144198 144298 101 0 2 97 98 58 0.998 6.53 6.14 Intr + 147444 147544 101 1 2 100 47 159 0.537 11.91 6.15 Intr + 148678 148839 162 0 0 32 91 320 0.535 25.95 6.16 Intr + 151634 151792 159 1 0 136 60 251 0.998 26.36 6.17 Intr + 156296 156521 226 1 1 -17 32 263 0.284 6.84 6.18 Intr + 163434 163591 158 1 2 45 26 63 0.449 -5.39 6.19 Intr + 164664 164900 237 2 0 47 98 560 0.988 49.79 6.20 Intr + 171656 171778 123 1 0 78 83 180 0.998 16.36 6.21 Intr + 175853 176062 210 1 0 43 78 402 0.997 32.89 6.22 Intr + 176176 176381 206 0 2 108 91 90 0.977 8.38 6.23 Intr + 177539 177693 155 2 2 50 95 200 0.980 15.69 6.24 Intr + 179155 179378 224 2 2 116 75 386 0.974 37.02 6.25 Intr + 182069 182216 148 1 1 95 71 79 0.966 5.79 6.26 Term + 188600 188667 68 0 2 84 53 157 0.954 8.92 6.27 PlyA + 190397 190402 6 1.05 7.00 Prom + 199230 199269 40 -6.55 7.01 Init + 202149 202221 73 2 1 90 110 178 0.820 19.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:232668194_232871362|GENSCAN_predicted_peptide_1|89_aa FLLIFHKAAAGELQEDSGLMALAKLSEIDVALEGVKGAKNFFEAKVQALSSASKFEAELK AEQDERKREEEERRLRQAAFQKLKANFNT >gi568815596r:232668194_232871362|GENSCAN_predicted_CDS_1|270_bp ttcctgctcattttccacaaggccgcggcaggggagctgcaggaggacagtgggctgatg gcgctggcaaagctttctgagatcgatgtggccctggagggtgtcaaaggtgccaagaac ttctttgaagccaaggtccaagccttgtcatcggccagtaagtttgaagcagagttgaaa gctgagcaagatgagcggaagcgggaggaggaggagaggcggctccgccaggcagccttc cagaaactcaaggccaacttcaatacatag >gi568815596r:232668194_232871362|GENSCAN_predicted_peptide_2|276_aa MVTAQTFIGSFLGLYRGLLLTVEDSAQRLSSGSFLDASRPVGRNKAMCICPFSHCYKELP EVGLEYSGIIIAHDIIKLLDSKDPSTSDSQVAETTGSFRDCQTVFASNKTVSGSKDGNCE EKCRTKSCGKVLIWKTSPQVMQRLLNCKRALGVQDSESHGAWSGPNPGHQHHSTQGQVTS RSVSLPCPALLRAPAPPPRPGPPVSPVRVRAPAPPARAGPAAWTPTGRSGRLGPSGDGTE HLDGGDSFRGESTGSGPGLRRGAATMIGTLGPRLGN >gi568815596r:232668194_232871362|GENSCAN_predicted_CDS_2|831_bp atggtcacagctcagaccttcataggttccttcctgggtctctaccgtggattacttctc actgttgaagactcagctcagagattatcttcaggaagcttccttgatgcttccaggcca gttgggaggaataaagccatgtgtatttgtccgttctcacattgctataaagaactacct gaggttgggctggagtacagtggcataatcatagctcacgacatcatcaaactcctggac tcaaaagatccttccacctcagactcccaagtagctgaaactacaggcagcttcagggac tgccaaactgtttttgcttcaaacaagactgtttcaggaagcaaggatggcaactgcgaa gaaaaatgccggactaaaagctgtggaaaagttctaatttggaaaaccagcccgcaagtg atgcagaggcttttgaactgcaaaagagctctgggcgtacaggacagtgagtcacacggt gcctggtccgggcccaaccctggacaccaacaccactcaactcaagggcaggtaaccagc cgttccgtgtcgctcccctgcccggctctgctccgcgccccggccccgccgccgcgtccc gggccgcctgtgtctccggtccgcgttcgcgcccctgctcctcctgcccgagctggcccg gccgcatggacacccacaggccgctccggccggcttgggccgtcaggcgacgggacagag catctggacggaggagactccttccgtggcgagagtactgggtctggaccaggcctgagg cgaggtgcagccaccatgatcggaaccttgggaccgaggctgggaaactag >gi568815596r:232668194_232871362|GENSCAN_predicted_peptide_3|125_aa MLSGTADAMDFRDLKSPAGLQVLNNYLADKSYIEGYVPSQADVAVFEAASGPPPADLCYA LRWYNHIKSYEKEKASLPGVKKALVKYGPVDMEDTIGRGAKDSKDDDDIDLFGADGRRKV KKRRG >gi568815596r:232668194_232871362|GENSCAN_predicted_CDS_3|378_bp atgctctcaggtacagctgatgccatggatttcagagacctgaaaagccctgccggcctc caggtgctcaacaattacctggcggacaagagctacatcgaggggtatgtgccatcacaa gcagatgtggcagtatttgaagcagcatctggcccaccgcctgctgacttgtgttatgcc cttcgttggtataatcacatcaagtcttacgaaaaagaaaaggccagccttccaggagtg aagaaggctttggtcaagtatggtcctgtggatatggaagacactataggaagaggagct aaagatagtaaagatgatgatgacattgatctctttggagctgatgggaggagaaaagtg aagaaacgaagaggctaa >gi568815596r:232668194_232871362|GENSCAN_predicted_peptide_4|218_aa MAAETQTLNFGPEWLRALSSGGSITSPPLSPALPKYKLADYRYGREEMLALFLKDNKIPS DLLDKEFLPILQEEPLPPLALVPFTEEEQRNFSMSVNSAAVLRLTGRGGGGTVVGAPRGR SSSRGRGRGRGECGFYQRSFDEVEGVFGRGGGREMHRSQSWEERGDRRFEKPGRKDVGKP KHCVHFWDHPLKGSTHLSRRAGAFEALRWELCEGNGAV >gi568815596r:232668194_232871362|GENSCAN_predicted_CDS_4|657_bp atggcagcggaaacgcagacactgaactttgggcctgaatggctccgagctctgtccagt ggtgggagtattacatcccctcctctttctccagcattgccgaagtataaattagcagat tatcgttacggcagagaagaaatgttagcacttttccttaaagacaacaagataccttca gaccttctggataaagaatttctgcctatcctccaggaggaaccccttccaccattggct ctggtaccctttacagaagaagaacagagaaacttttccatgtctgtaaatagtgctgct gtcctgcgattgacaggacgaggaggaggaggaacagtggtgggggctcctagaggtcga agttcttcaagagggcgaggcagaggcagaggtgaatgtggtttctaccaaagaagtttt gatgaagtagagggtgtttttggtcgaggaggtggcagagaaatgcatagatcgcagagc tgggaggaaaggggtgacagacgttttgaaaaaccaggacgaaaagatgtaggcaagcct aagcattgtgttcacttctgggaccaccccttgaaagggtccacacatttgagtaggagg gctggggcatttgaagccttgcgatgggaactgtgtgaaggaaatggagctgtttag >gi568815596r:232668194_232871362|GENSCAN_predicted_peptide_5|360_aa MDSSNCKVIAPLLSQRYRRMVTKDGHSTLQMDGAQRGLAYLRDAWGILMDMRWRWMMLVF SASFVVHWLVFAVLWYVLAEMNGDLELDHDAPPENHTICVKYITSFTAAFSFSLETQLTI GYGTMFPSGDCPSAIALLAIQMLLGLMLEAFITGAFVAKIARPKNRAFSIRFTDTAVVAH MDGKPNLIFQVANTRPSPLTSVRVSAVLYQERENGKLYQTSVDFHLDGISSDECPFFIFP LTYYHSITPSSPLATLLQHENPSHFELVVFLSAMQEGTGEICQRRTSYLPSEIMLHHCFA SLLTRGSKGEYQIKMENFDKTVPEFPTPLVSKSPNRTDLDIHINGQSIDNFQISETGLTE >gi568815596r:232668194_232871362|GENSCAN_predicted_CDS_5|1083_bp atggacagcagtaattgcaaagttattgctcctctcctaagtcaaagataccggaggatg gtcaccaaggatggccacagcacacttcaaatggatggcgctcaaagaggtcttgcatat cttcgagatgcttggggaatcctaatggacatgcgctggcgttggatgatgttggtcttt tctgcttcttttgttgtccactggcttgtctttgcagtgctctggtatgttctggctgag atgaatggtgatctggaactagatcatgatgccccacctgaaaaccacactatctgtgtc aagtatatcaccagtttcacagctgcattctccttctccctggagacacaactcacaatt ggttatggtaccatgttccccagtggtgactgtccaagtgcaatcgccttacttgccata caaatgctcctaggcctcatgctagaggcttttatcacaggtgcttttgtggcgaagatt gcccggccaaaaaatcgagctttttcaattcgctttactgacacagcagtagtagctcac atggatggcaaacctaatcttatcttccaagtggccaacacccgacctagccctctaacc agtgtccgggtctcagctgtactctatcaggaaagagaaaatggcaaactctaccagacc agtgtggatttccaccttgatggcatcagttctgacgaatgtccattcttcatctttcca ctaacgtactatcactccattacaccatcaagtcctctggctactctgctccagcatgaa aatccttctcactttgaattagttgtattcctttcagcaatgcaggagggcactggagaa atatgccaaaggaggacatcctacctaccgtctgaaatcatgttacatcactgttttgca tctctgttgacccgaggttccaaaggtgaatatcaaatcaagatggagaattttgacaag actgtccctgaatttccaactcctctggtttctaaaagcccaaacaggactgacctggat atccacatcaatggacaaagcattgacaattttcagatctctgaaacaggactgacagaa taa >gi568815596r:232668194_232871362|GENSCAN_predicted_peptide_6|1338_aa MQNGLINQGKPEYSSDIPDTSCSCILLVQLIIGGRGFHRDQQSGIVGRPNFEEGGPTSVG RKHEFIRSESENWRIFREEQNGEDEDGGWRLAGSRRDGERWRPHSPDGPRSAGWREHMER RRRFEFDFRDRDDERGYRRVRSGSGSIDDDRDSLPEWCLEDAEEEMGTFDSSGAFLSLKK VQKEPIPEEQEMDFRPVDEGEECSDSEGSHNEEAKEPDKTNKKEGEKTDRVGVEASEETP QTSSSSARPGTPSDHQSQEASQFERKDEPKTEQTEKAEEETRMENSLPAKVPSRGDEMVA DVQQPLSQIPSDTASPLLILPPPVPNPSPTLRPVETPVVGAPGMGSVSTEPDDEEGLKHL EQQAEKMVAYLQDSALDDERLASKLQEHRAKGVSIPLMHEAMQKWYYKDPQGEIQGKLFL FPLNTESLAITIARDWRKSVSMNFLSVSMNFTIPSTSCPFNNQEMAEWFQAGYFTMSLLV KRACDESFQPLGDIMKMWGRVPFSPGPAPPPHMGELDQERLTRQQELTALYQMQHLQYQQ FLIQQQYAQVLAQQQKAALSSQQQQQLALLLQQFQTLKMRISDQNIIPSVTRSVSVPDTG SIWELQPTASQPTVWEGGSVWDLPLDTTTPGPALEQLQQLEKAKAAKLEQERREAEMRAK REEEERKRQEELRRQQEEILRRQQEEERKRREEEELARRKQEEALRRQREQEIALRRQRE EEERQQQEEALRRLEERRREEEERRKQEELLRKQPKPNPEQGLKTLFNSVKAERGDEAEE EKFEAISGWFMKFKERSHFHKIKAQGEAASADVEAAASYPDALLKIIDEGGLPISLLYTC FSDSVNAIKYLRPLHTPPLPFPGVLTFEFRCFTSWLRVLDITEEEAAKWAREEEEAQRRL EENRLRMEEEAARLRHEEEERKRKELEVQRQKELMRQRQQQQEALRRLQQQQQQQQLAQM KLPSSSTWGQQSNTTACQSQATLSLAEIQKLEEERERQLREEQRRQQRELMKALQQQQQQ QQQKLSGWGNVSKPSGTTKSLLEIQQEEARQMQKQQQQQQQHQQPNRARNNTHSNLHTSI GNSVWGSINTGPPNQWASDLVSSIWSNADTKNSNMGFWDDAVKEVGPRNSTNKNKNNASL SKSVGVSNRQNKKVEEEEKLLKLFQGVNKAQDGFTQWCEQMLHALNTANNLDVPTFVSFL KEVESPYEVHDYIRAYLGDTSEAKEFAKQFLERRAKQKANQQRQQQQLPQQQQQQPPQQP PQQPQQQDSVWGMNHSTLHSVFQTNQSNNQQSNFEAVQSGKKKKKQKMVRADPSLLGFSV NASSERLNMGEIETLDDY >gi568815596r:232668194_232871362|GENSCAN_predicted_CDS_6|4017_bp atgcaaaatggactaataaaccaaggaaagccagaatatagttctgacattccagacacg tcctgcagctgcattctgttagtccagcttataattgggggtagaggattccacagagat cagcagtcagggattgttgggagaccaaattttgaggaaggtggaccaacatcagtaggg agaaagcatgaatttatacgctcagaaagtgaaaattggcgcatctttagagaggaacaa aatggagaagatgaagatggaggttggcgactagctggatcaaggagggatggagagagg tggcgacctcacagtcctgatggccctcgttctgcaggctggcgggaacacatggaacga cgtcggaggtttgagtttgattttcgagatagagatgatgaacggggttaccgaagggtt cgctctggcagtgggagcatagatgatgacagggatagcttgcccgaatggtgcttagag gatgctgaagaagaaatgggtacatttgactcatctggagcattcctttctctaaaaaaa gtacagaaagagcctattccagaagagcaggagatggacttccggcctgtggacgaaggg gaggagtgctctgactctgagggtagccataatgaagaggccaaagaacccgataagaca aataagaaagaaggagagaaaacagatagagtaggagttgaagctagtgaggaaactccc cagacctcatcatcatctgctagaccaggtactccttcagaccatcagtctcaggaagca tcacagtttgagaggaaagatgaaccaaaaactgagcaaacggaaaaagctgaagaggag actcggatggaaaatagtctaccagccaaagtgcccagcagaggggatgaaatggttgct gatgtccagcagcccctgtcgcagattccttcagatacagcctctcctcttctcatactt ccacctcctgttcccaatcctagtcctactctccggccagttgaaacaccagttgtaggt gctcctggtatgggcagtgtttccacagaacctgatgatgaagaaggtctcaaacatttg gagcagcaagctgagaaaatggtggcttatctccaagacagtgcactagatgatgaaaga ttggcatcaaaactgcaagagcacagagctaaaggagtgtcgattccattgatgcatgaa gcaatgcagaagtggtattacaaagatcctcagggagaaattcaaggcaagttgttcctt tttcctttaaatactgaatccctggcaattaccattgccagggactggaggaaatctgtt tctatgaattttctatctgtttctatgaattttactattccgagcacctcatgtcccttc aataatcaggagatggcagaatggtttcaggcgggctattttactatgtctttattggtg aagagagcgtgtgatgaaagcttccaacctcttggcgatatcatgaaaatgtggggaagg gttcccttttctccaggtccagctccccctcctcatatgggagagctggaccaggaacga ctgaccaggcagcaagaactcacagccttataccagatgcagcacctgcagtaccagcag tttttaatacaacaacaatatgcacaggttttggcccaacagcagaaagcagcactgtct tcccagcagcagcagcagttggcacttcttcttcaacagtttcagaccttgaagatgaga atatctgatcagaacatcattccctcagtaactaggtctgtgtccgtgccagatactggc tctatctgggagcttcagccaacagcttcacagcctacagtttgggaaggtggtagtgta tgggatcttcctctggacaccacgacaccaggccctgccctggaacagcttcagcagcta gagaaggccaaagctgcaaagctagagcaagagagaagagaggcagaaatgagggcaaaa cgggaagaggaagagcgaaagaggcaggaagaactccgaagacaacaggaggaaattctt cggcgacagcaggaagaagaaaggaaaaggcgagaggaagaagaacttgcccgaaggaaa caggaagaggctctgcgtcgccagcgggagcaagaaattgcattaaggcgacagcgagaa gaggaagaaagacagcagcaagaagaagctcttagaagactggaagagaggagaagagaa gaggaagaaaggcggaagcaggaagaattgttacgcaaacagccaaagcctaatccagaa caaggcctcaaaactctcttcaattctgtgaaagccgagagaggtgatgaagctgaagaa gaaaagtttgaagctatcagtggttggttcatgaagtttaaggaaagaagccatttccat aaaataaaagcacaaggtgaagcagctagtgctgatgtagaagctgcagcaagttatcca gatgccctacttaagatcattgatgaaggaggtctgcccatcagcctgttgtacacttgc ttcagtgattcagtgaatgccattaagtatttacggcctttgcatactcctcctctgcca tttcctggtgtgttgacttttgagttcagatgtttcacgtcatggttaagagttttggat atcacagaagaggaggctgcaaaatgggcccgggaagaagaagaagcccagcgtcgatta gaggagaaccggctgcggatggaagaggaggcagccagactccggcatgaggaagaagaa cggaagagaaaggagctggaggtccagcggcagaaggagttaatgcgccagaggcagcag cagcaagaggctctccggaggttgcagcagcagcagcagcaacaacagctggcgcagatg aagcttccttcttcttcaacgtggggccagcagtccaatacaacagcatgtcagtcccag gccacgctgtcgttggctgaaatccaaaaactagaggaagaacgagaacggcagcttcga gaagagcaaaggcgccagcagagggagttgatgaaagctcttcagcagcagcagcaacag caacagcagaaactctcaggttgggggaatgtcagcaaaccttcaggtaccacgaaatct cttctggagatccagcaggaagaggccaggcaaatgcaaaagcagcagcagcagcagcag caacaccagcaaccaaacagagctcgtaacaatacgcattccaacctgcacaccagcatt gggaattctgtttggggctctataaatactggtcctcctaaccagtgggcatctgaccta gtcagtagtatttggagtaatgctgacactaaaaactccaacatgggattctgggatgat gcagtgaaagaggtgggacctaggaattcaacaaataaaaataaaaacaacgccagtctc agtaaatctgtaggtgtgtctaaccggcagaataagaaagtagaagaagaagaaaagttg ctgaagctctttcagggagtaaataaagcccaagatggatttacgcagtggtgtgaacag atgcttcatgcccttaatacggcaaataacttggatgttcccacatttgtttctttcctg aaagaagtagaatctccttatgaggtccatgattatatcagggcctatttaggagatact tctgaggccaaggagtttgccaagcagttccttgagcgccgtgccaaacagaaagccaac cagcagcgtcagcagcagcagctgccacagcagcagcagcagcagccgccacagcagccg ccacagcagccacaacagcaggactctgtgtgggggatgaaccacagtacactccattca gtatttcagaccaatcaaagcaacaaccaacaatccaattttgaggctgtgcagagtggc aagaagaagaaaaagcagaagatggtccgagcagatcccagtttattaggattttcagtc aatgcatcatcggagcgactcaacatgggtgaaatcgagacgttggatgactactga >gi568815596r:232668194_232871362|GENSCAN_predicted_peptide_7|25_aa MASCLALRMALLLVSGVLAPAVLTX >gi568815596r:232668194_232871362|GENSCAN_predicted_CDS_7|75_bp atggcatcctgtctggccctgcgcatggcgctgctgctggtctccggggttctggcccct gcggtgctcacagnn