GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:34:03 Sequence gi568815575r:133117045_133318259 : 201215 bp : 42.14% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1153 1315 163 0 1 83 89 40 0.055 2.23 1.02 Intr + 1641 1758 118 1 1 44 87 32 0.044 -2.70 1.03 Intr + 17385 17472 88 0 1 52 86 89 0.162 4.15 1.04 Intr + 21316 21409 94 0 1 59 116 49 0.159 3.42 1.05 Intr + 32502 32554 53 1 2 74 72 38 0.001 -1.49 1.06 Intr + 43395 43474 80 2 2 65 100 44 0.076 0.73 1.07 Term + 43526 43886 361 2 1 11 49 339 0.114 15.32 1.08 PlyA + 44014 44019 6 -0.45 2.00 Prom + 44050 44089 40 -9.55 2.01 Sngl + 44725 45564 840 0 0 33 42 312 0.725 16.59 2.02 PlyA + 45572 45577 6 1.05 3.08 PlyA - 47702 47697 6 1.05 3.07 Term - 50733 50659 75 0 0 83 42 39 0.111 -4.34 3.06 Intr - 52012 51940 73 0 1 57 94 30 0.089 -1.01 3.05 Intr - 54997 54872 126 0 0 77 91 106 0.415 8.67 3.04 Intr - 55174 55099 76 2 1 66 98 50 0.153 1.45 3.03 Intr - 68499 68422 78 1 0 145 96 71 0.661 12.20 3.02 Intr - 74217 74083 135 1 0 38 92 132 0.872 8.22 3.01 Init - 82932 82758 175 0 1 63 20 130 0.532 3.26 3.00 Prom - 82986 82947 40 -1.35 4.02 PlyA - 83257 83252 6 1.05 4.01 Sngl - 101215 99998 1218 1 0 79 45 1357 0.893 126.05 4.00 Prom - 105871 105832 40 -6.15 5.00 Prom + 108778 108817 40 -3.55 5.01 Init + 115890 115959 70 2 1 81 106 -6 0.438 1.76 5.02 Intr + 116444 116469 26 0 2 146 45 -15 0.368 -3.17 5.03 Intr + 117488 117681 194 1 2 126 27 79 0.087 2.97 5.04 Intr + 118278 118360 83 0 2 107 75 44 0.102 3.36 5.05 Intr + 131337 131401 65 1 2 76 105 71 0.399 5.12 5.06 Term + 131884 132093 210 0 0 36 42 136 0.320 0.11 5.07 PlyA + 132652 132657 6 1.05 6.05 PlyA - 132783 132778 6 1.05 6.04 Term - 137508 137169 340 2 1 63 49 312 0.505 17.82 6.03 Intr - 148488 148354 135 2 0 31 97 146 0.518 8.56 6.02 Intr - 151932 151811 122 0 2 -3 68 118 0.257 -0.93 6.01 Init - 152292 152260 33 0 0 71 115 12 0.382 2.13 6.00 Prom - 156000 155961 40 -5.65 7.00 Prom + 159602 159641 40 -5.75 7.01 Sngl + 162927 163355 429 2 0 79 48 294 0.988 20.53 7.02 PlyA + 163620 163625 6 -0.45 8.00 Prom + 164367 164406 40 -6.15 8.01 Init + 165325 165488 164 0 2 70 77 59 0.150 2.26 8.02 Intr + 168157 168214 58 1 1 129 81 55 0.583 6.87 8.03 Intr + 176251 176395 145 0 1 87 70 85 0.389 5.53 8.04 Term + 176876 176973 98 0 2 88 39 37 0.138 -4.05 8.05 PlyA + 179152 179157 6 1.05 9.07 PlyA - 181132 181127 6 1.05 9.06 Term - 186297 185823 475 2 1 61 31 516 0.515 36.57 9.05 Intr - 187817 187681 137 2 2 66 92 188 0.999 15.45 9.04 Intr - 188874 188728 147 0 0 87 93 187 0.951 18.61 9.03 Intr - 189110 188980 131 0 2 40 90 144 0.171 9.29 9.02 Intr - 196147 195960 188 0 2 106 35 62 0.098 1.11 9.01 Intr - 196566 196422 145 1 1 90 38 72 0.075 0.82 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 29020 28955 66 1 0 110 39 99 0.947 4.16 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:133117045_133318259|GENSCAN_predicted_peptide_1|318_aa CLLCKTALDGLVFLLCSKHDKICQQKKMEELHEEGGFSYWCAVGLVLFLSVSDAAPWMVV VSCLSSDCRLSLPRNTVKFSIIQWFASTPFPQPCPKGSGHTLKHHSVGYSDDVMLIGPDK EEVQLTEATWMAVINYKENLSEYSPFPSSVPCPKGHLGEGIFYQWPFKDIVQDHNSSPAK EQNWTENEFDKLTEVGFRSKEAKNLEKRLEELLTRISSSEKNMNDLMELKNTGRKLREAY TSFNSQINQAEERISVIEDQLDEIKTEDKIREKRIKRNEQSLQEIRDYVKRPNLSLIGVH ERDEENGTKLENTLQDIT >gi568815575r:133117045_133318259|GENSCAN_predicted_CDS_1|957_bp tgtctgctctgcaaaactgcactggatggtttagtattcctcctttgtagtaaacatgat aagatttgtcagcagaagaagatggaggaattgcatgaggaaggaggcttctcttactgg tgcgctgtgggtttggttttgtttttgtcagtcagtgatgcagcaccctggatggttgtt gtttcctgtttgtccagtgactgcagactctcactgcccagaaacacagtgaaattctcc atcatccagtggtttgcatccacaccttttcctcagccttgtccaaagggatctggacat accttgaaacatcactcagtgggctatagtgatgacgtcatgctaattggaccagataag gaagaagtacagttaactgaagccacatggatggctgtgatcaactacaaagaaaaccta tctgagtactcccccttccccagttcagtcccctgtcccaaaggccatctgggagaggga atattttaccagtggccttttaaagatatcgtgcaggatcacaactcttcaccagcaaag gaacaaaactggacggagaatgagtttgacaaattgacagaagtaggcttcagaagcaag gaagctaaaaacctagaaaaaaggctagaggaattgctaactagaataagcagttcagag aagaacatgaatgacctgatggagctgaaaaacacaggacgaaaacttcgtgaagcatac acaagtttcaatagccaaatcaatcaagcagaagaaaggatatcagtgattgaagatcaa cttgatgaaataaagacagaagacaagattagagaaaaaagaataaaaaggaatgaacaa agcctccaagaaatacgggactatgtaaaaagaccaaacctaagtttgattggtgtacat gaaagagacgaggagaatggaaccaaattggaaaacactcttcaggatattacctag >gi568815575r:133117045_133318259|GENSCAN_predicted_peptide_2|279_aa MLKGSMQQEELTIINVYAPNTGAPIFKKQVLRDLQRDLRSHTITVGDFNTPLSILDRSTR QKINKDILDLNSALDQMDLIDIHRTLHLKSTEYTFFSAPHRTYSKTDHIIGSKILLSKCK RTEIITNSLSDHSAIKLELRIKKLTQNSTTTWKLNNLLLNDYWVNNKIKAEIKMFFETNE NEDTTNQNLWDTFKAVCRGKFTALNAHKRKQETSKMDTLTSQLKELEKQEQTNSKSSRRQ EITKIRAELKEIETRKTLQKKLMNPGAGFLKRSTKEIDH >gi568815575r:133117045_133318259|GENSCAN_predicted_CDS_2|840_bp atgctaaagggatcaatgcagcaagaagagctaactatcataaatgtatatgcacccaat acaggagcacccatattcaaaaagcaagttcttagagacctacaaagagacttacgctcc cacacaattacagtgggagactttaacaccccactgtcaatattagacagatcaacaaga cagaaaattaacaaggatattctggacttgaactcagctctggaccaaatggacctaata gacatccacagaactctccacctcaaatcaacagaatatacattcttctcagcacctcat cgcacttattctaaaactgaccacataattggaagtaaaatactcctcagcaaatgcaaa agaacagaaatcataacaaacagtctctcagatcacagtgcaatcaaattagaactcagg attaagaaactcactcaaaacagcacaactacatggaaactgaacaacctgctcctgaat gactactgggtaaataacaaaattaaggcagaaataaagatgttctttgaaaccaatgag aacgaagacacaacgaaccagaatctgtgggacacatttaaagcagtgtgtagagggaaa tttacggcactaaatgcccacaagagaaagcaggaaacatctaaaatggacaccctaaca tcacaattaaaagaactagagaagcaagagcaaacaaattcaaaatctagcagaagacaa gaaataactaagatcagagcagaactgaaagagatagagacacgaaaaacccttcaaaaa aaattaatgaatccaggagctggttttttgaaaagatcaacaaaagagatagaccactag >gi568815575r:133117045_133318259|GENSCAN_predicted_peptide_3|245_aa MDKDFMTKTPKAITTKAKINKLDLIKLKSSCTAKEAVNTVNRQPTEWEKIFANYAADKGF RFLEPWVSAGAPEAFAGVVLCGKIGLRPSVGPVSRSTCYAVCRGLFLSSAKDGVLATQPQ KFRPADNLKESTGEFTEVASLFWVFLQMATEWQSSSLSAAMDPLETKTGLFVSGLPVAKA THLSEPLQELYEGIITRSNGLHRCRSSADGEEDLTKLGLCQADIQSETPVDLYSLSLANH SVSTA >gi568815575r:133117045_133318259|GENSCAN_predicted_CDS_3|738_bp atggacaaagatttcatgacaaagacaccaaaagcgatcacaacaaaagcaaaaattaat aagttagatctaattaaacttaagagctcctgcacagcaaaagaagctgtcaacacagta aacagacaacctacagaatgggagaaaatatttgcaaactatgcagctgacaaaggtttc aggttcctggagccctgggtctcagcaggagccccagaggcctttgcaggagttgttctt tgtggcaagattggtttaaggccctcggtgggccccgtgtctaggagcacctgctacgct gtttgcagaggacttttccttagttcagctaaagatggggtccttgctacacagccacaa aaatttaggcctgcagacaatttgaaggaaagcactggagagtttacggaagtggcatct ctcttctgggtatttctccaaatggccactgagtggcagtccagctctctgagtgctgcc atggacccactggagacaaagacagggttatttgtcagtggcctgccagtagcaaaggcc acacatctttcagagcccctccaagaactgtatgagggtattataaccaggtccaatggg ttgcacaggtgcagatccagtgctgatggagaggaggatttaaccaagctgggactttgt caggctgatatacagagtgagacacctgttgacctatactcgctgagcctagccaatcat tcagtgtctacagcttaa >gi568815575r:133117045_133318259|GENSCAN_predicted_peptide_4|405_aa MAKYVSLTEANEELKVLMDENQTSRPVAVHTSTVNPLGKQLLPKTFGQSSVNIDQQVVIG MPQRPAASNIPVVGSPNPPSTHFASQNQHSYSSPPWAGQHNRKGEKNGMGLCRLSMKVWE TVQRKGTTSCQEVVGELVAKFRAASNHASPNESAYDVKNIKRRTYDALNVLMAMNIISRE KKKIKWIGLTTNSAQNCQNLRVERQKRLERIKQKQSELQQLILQQIAFKNLVLRNQYVEE QVSQRPLPNSVIHVPFIIISSSKKTVINCSISDDKSEYLFKFNSSFEIHDDTEVLMWMGM TFGLESGSCSAEDLKMARNLVPKALEPYVTEMAQGTFGGVFTTAGSRSNGTWLSASDLTN IAIGMLATSSGGSQYSGSRVETPAVEEEEEEDNNDDDLSENDEDD >gi568815575r:133117045_133318259|GENSCAN_predicted_CDS_4|1218_bp atggcaaaatatgtcagtctcactgaagctaacgaagaactcaaggtcttaatggacgag aaccagaccagccgccccgtggccgttcacacctccaccgtgaacccgctcgggaagcag ctcttgccgaaaacctttggacagtccagtgtcaacattgaccagcaagtggtaattggt atgcctcagagaccagcagcatcaaacatccctgtggtaggaagcccaaacccacccagc actcactttgcctctcagaaccagcattcctactcctcacctccttgggccgggcagcac aacaggaaaggagagaagaatggcatgggcctgtgccgtctttccatgaaggtctgggag acggtgcagaggaaagggaccacttcctgccaggaagtggtgggcgagctggtcgccaag ttcagagctgccagcaaccacgcctcaccaaacgagtcagcttatgacgtgaaaaacata aaacggcgcacctacgatgccttaaacgtgctgatggccatgaatatcatctccagggag aaaaagaagatcaagtggattggtctgaccaccaactcggctcagaactgtcagaactta cgggtggaaagacagaagagacttgaaagaataaagcagaaacagtctgaacttcaacaa cttattctacagcaaattgctttcaagaacctggtgctgagaaaccagtatgtggaggag caggtcagccagcggccgctgcccaactcagtcatccacgtgcccttcatcatcatcagc agtagcaagaagaccgtcatcaactgcagcatctccgacgacaaatcagaatatctgttt aagtttaacagctcctttgaaatccacgatgacacagaagtgctgatgtggatgggcatg acttttgggctagagtccgggagctgctctgccgaagaccttaaaatggccagaaatttg gtcccaaaggctctggagccgtacgtgacagaaatggctcagggaacttttggaggtgtg ttcacgacggcaggttccaggtctaatggcacgtggctttctgccagtgacctgaccaac attgcgattgggatgctggccacaagctccggtggatctcagtacagtggctccagggtg gagaccccagcagtcgaggaggaagaggaggaggacaacaacgatgacgacctcagtgag aatgacgaggatgactga >gi568815575r:133117045_133318259|GENSCAN_predicted_peptide_5|215_aa MAGPVILELFVSFTEFSIVIPKESLSSIHTSYVMGSLAATRWGIWENIRVGCNVVLKTPG SVSMGLDVVERPQICTWGFALVRLPMFCKFQCKEITNCLAAAIPTMFSWGSAKTEKLDKK QCCTGLSTARRLEEALKWPCSNNQYLVFCMSDRRLHLDLPTAPVAPPRSDQSKRTASTPY YFISDPNSQHCPYLSSLSSKLSEKLLTSEPSMKLI >gi568815575r:133117045_133318259|GENSCAN_predicted_CDS_5|648_bp atggctgggcctgttatcctagaattatttgtatctttcacagaatttagcatagttata cctaaagaaagtctcagctcaatacacacttcctatgtaatgggctctcttgctgctacc aggtggggaatatgggaaaacattagagttgggtgcaatgtagtgttgaaaacaccggga tcagttagcatggggctggatgttgtagaaaggccccagatttgtacttgggggtttgca cttgttaggttgccaatgttctgcaaattccagtgcaaagaaataacaaattgcttagca gcagcaatacccacaatgttctcctggggttcagcaaaaacagaaaagctggacaagaaa caatgctgtacaggactttcaactgccagaaggctagaagaggcccttaagtggccttgc tctaataatcagtacctggttttttgcatgtctgaccgaaggctccacctggacctgcca actgctcctgtggccccacccagaagcgatcagtccaagaggacagcttcaactccctat tatttcatctctgacccaaacagtcagcactgtccataccttagctccctgtccagcaaa ctatctgaaaaactcctaacatctgaaccttccatgaaactgatttga >gi568815575r:133117045_133318259|GENSCAN_predicted_peptide_6|209_aa MLIEVSYPSLKGSVQECAVGSLCTKAWGGHSLVSGPEVDAVSIVESQGKSELGKEPDCQE CSGIPLRLEPRSRKCAECHLLKHTGLCTEKLFALQQSVSVNDYVTFGNVDLLRGPMVILH KPQRSNGPKPLGFGGGNTVHYLQDSTDSVVGFVKYAIKKEIKNTRLNMKEEEEKIKALEF YFQVAKCSCPDFPLTSGRSLETGTEWDSE >gi568815575r:133117045_133318259|GENSCAN_predicted_CDS_6|630_bp atgcttatagaggtttcctatccaagcctgaagggttctgttcaagaatgtgctgtggga tccctctgcaccaaggcttggggtggccacagcctggtgagtggcccagaggtggatgct gttagcatcgttgagagtcaagggaagtcagaattaggaaaggagccggactgtcaagaa tgcagtggaatcccgctgaggctggagcctcggtcgagaaaatgtgcggaatgccatttg ttaaaacacactggcctttgcacagaaaagctctttgcgcttcaacaaagcgtgagtgtc aatgattatgtgacttttggaaatgtagatcttcttcgggggcccatggtgattttgcat aaaccacagaggtcaaatgggcccaagcctcttgggtttggaggaggaaacacagtgcat tatttacaagattccaccgattcagtagttgggtttgtcaaatatgcaataaagaaggaa attaaaaacacaagattaaatatgaaggaagaagaagaaaagatcaaagccttggagttt tacttccaagttgcaaagtgcagctgtccggacttccccctgacctccggtcgctccttg gagacagggactgaatgggactctgaatag >gi568815575r:133117045_133318259|GENSCAN_predicted_peptide_7|142_aa MRKNQCKKAENSKNQKASSPPKDHNSSPAREKNWTENEFGKLREVGSRRWVITNFSELKE HALTQCKKVKNLEKRLEELLTRISSLEKNINDLMELKNKHEIFVKHTQVSIAKSIKQKKG YQRSKINFMKYSGKTRLEKKEE >gi568815575r:133117045_133318259|GENSCAN_predicted_CDS_7|429_bp atgaggaaaaaccagtgcaaaaaggctgaaaattccaaaaaccagaaagcctcttctcct ccaaaggatcataactcctcaccagcaagggaaaaaaactggacagagaatgagtttggc aaattgagagaagtaggctccagaaggtgggtaataacaaatttctctgagctaaaggag catgctctaacccaatgcaagaaagttaagaaccttgaaaaaaggttagaggaattgcta actagaataagcagtttagagaagaacataaatgacctaatggagctgaaaaacaagcat gagatctttgtgaagcatacacaagtatcaatagccaaatcaatcaagcagaagaaagga tatcagagatcaaagatcaacttcatgaaatacagtgggaagacaagattagaaaaaaaa gaagaatga >gi568815575r:133117045_133318259|GENSCAN_predicted_peptide_8|154_aa MDKFLDTYTLPSLKEEEVESLNRPITSSEIEAVINSLPTKKSPGPDGFTAKFYQRTEMAV EVHWQVRSKSLMEKLGGQSRIEWPKTSGISGMTLAVGVRVGHLEAMFEDVAKICACDWPE FKDHTLSHLPALTLTIQTSQCFTECGRQPVASKE >gi568815575r:133117045_133318259|GENSCAN_predicted_CDS_8|465_bp atggataaattcctggacacatataccctcccaagtctaaaagaggaagaagttgaatcc ctgaatagaccaataacaagttctgaaattgaggcagtaattaatagcctaccaaccaaa aaaagtccaggaccagacggattcacggccaaattctaccagaggaccgaaatggcagtg gaagtccactggcaagttaggtctaagtccttaatggaaaagctaggaggccagagcaga atagagtggcctaagacatctgggatctctggaatgactctggcagtaggggtgagggtg gggcacctagaagcaatgtttgaagatgtggcaaagatatgtgcatgtgactggccagaa ttcaaagaccacactttgtcccacctccctgcccttactcttacaatccaaaccagccag tgctttactgagtgtggaaggcaacctgtagcatccaaggaatag >gi568815575r:133117045_133318259|GENSCAN_predicted_peptide_9|407_aa XHLPPPVLFAPLQKERGTCREVLTICDFKGMNFQLSSKSGYPEKAIPPGALLISDIQSNT PEQVKCIAGLPCIPASLPCGGICLQKSVLAELVGTYTEECTNPLILTPKEEDAMLMVAER LEGPFNIESVMDPIDVKISDAIMNMQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAF SARFRPHHPEERPTTAAGTSLDRLVTDVKEKLKQAKKFWSSLPSNVCNDERMAAGNGNED DCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTSKPDILILRQIMALRVMTSKMKNAYNG NDVDFFDISKHLWFESTNCSCMSSDENKWNEISIPFFFKLGDESSGEGSGSGCEYQQCPS EFDYNATDHAGKSANEKADSAGVRPGAQAYLLTVFCILFLVMQREWR >gi568815575r:133117045_133318259|GENSCAN_predicted_CDS_9|1224_bp ngccacctcccccctccagtcctgtttgctcctttacaaaaggagagggggacatgtagg gaagtcttaacaatctgtgactttaaaggaatgaactttcagctgtccagcaaatctggc tatccagaaaaggccattcctccaggggcactgctgataagtgatatccaaagtaacact ccagagcaggtgaagtgcattgccgggctgccctgcatcccagcatctcttccctgtggt ggtatctgcctccaaaagtctgtgttagcagaactggtgggtacttacacagaagaatgc actaatcctctgatactgactccaaaagaggaggatgctatgctgatggtggcagagagg ctagagggtcctttcaacattgaatcggtcatggatcccatcgatgtgaagatttctgat gctattatgaacatgcaggataatagtgttcaagtgtctcagaaggttttccagggatgt ggaccccccaagcccctcccagctggacgaatttctcgttccatctctgaaagtgccttc agtgctcgcttcagaccacatcaccccgaggaacgcccaaccacagcagctggcactagt ttggaccgactggttactgatgtcaaggagaaactgaaacaggccaagaaattctggtcc tcccttccgagcaacgtttgcaacgatgagaggatggctgcaggaaacggcaatgaggat gactgttggaatgggaaaggcaaaagcaggtacctgtttgcagtgacaggaaatggatta gccaaccagggcaacaacccagaggtccaggttgacaccagcaaaccagacatactgatc cttcgtcaaatcatggctcttcgagtgatgaccagcaagatgaagaatgcatacaatggg aacgacgtggacttctttgatatcagtaagcatttgtggtttgaaagtacgaattgctct tgtatgtcttctgatgagaacaagtggaatgaaatttctattccatttttttttaaacta ggtgatgaaagtagtggagaaggaagtggaagtggctgtgagtatcagcagtgcccttca gagtttgactacaatgccactgaccatgctgggaagagtgccaatgagaaagccgacagt gctggtgtccgtcctggggcacaggcctacctcctcactgtcttctgcatcttgttcctg gttatgcagagagagtggagataa