GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:22:54 Sequence gi568815583r:84555273_84758152 : 202880 bp : 45.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1475 1514 40 -2.46 1.01 Init + 7153 7277 125 0 2 58 67 128 0.281 7.34 1.02 Intr + 15156 15462 307 0 1 40 105 117 0.012 5.05 1.03 Intr + 22676 22891 216 1 0 84 101 125 0.103 12.10 1.04 Intr + 34946 35069 124 1 1 55 37 64 0.003 -1.84 1.05 Intr + 45542 45726 185 0 2 86 52 116 0.078 7.31 1.06 Term + 48710 49273 564 1 0 33 43 393 0.003 23.69 1.07 PlyA + 51588 51593 6 1.05 2.00 Prom + 55526 55565 40 -3.76 2.01 Sngl + 65374 66768 1395 0 0 47 45 1510 0.597 138.66 2.02 PlyA + 68423 68428 6 1.05 3.00 Prom + 73217 73256 40 -7.16 3.01 Init + 76628 77026 399 1 0 76 83 296 0.976 24.57 3.02 Intr + 77325 77451 127 2 1 -52 91 83 0.936 -5.15 3.03 Intr + 78552 79067 516 1 0 107 -4 675 0.545 53.13 3.04 Intr + 82075 82148 74 2 2 135 109 14 0.813 7.33 3.05 Intr + 83124 83205 82 2 1 63 86 54 0.867 1.91 3.06 Term + 85883 86001 119 0 2 78 42 94 0.506 2.50 3.07 PlyA + 86586 86591 6 1.05 4.10 PlyA - 87829 87824 6 1.05 4.09 Term - 90564 90110 455 1 2 99 44 149 0.768 6.92 4.08 Intr - 91076 90912 165 0 0 85 86 139 0.946 13.33 4.07 Intr - 93353 93149 205 2 1 101 84 13 0.677 0.97 4.06 Intr - 97530 97442 89 1 2 108 88 67 0.989 8.39 4.05 Intr - 98427 98357 71 2 2 100 19 95 0.449 2.63 4.04 Intr - 100613 100444 170 2 2 20 25 77 0.250 -6.66 4.03 Intr - 102076 101904 173 2 2 90 99 52 0.767 6.16 4.02 Intr - 102270 102157 114 1 0 101 5 71 0.376 0.52 4.01 Init - 102880 102724 157 1 1 103 131 289 0.999 32.77 4.00 Prom - 105128 105089 40 -2.36 5.09 PlyA - 106961 106956 6 1.05 5.08 Term - 111135 111130 6 0 0 102 44 0 0.197 -5.13 5.07 Intr - 115510 115453 58 0 1 79 116 60 0.872 6.79 5.06 Intr - 125560 125441 120 0 0 67 98 61 0.924 4.61 5.05 Intr - 132502 132353 150 0 0 113 53 74 0.745 5.68 5.04 Intr - 136372 136263 110 1 2 111 92 15 0.441 3.28 5.03 Intr - 160977 160753 225 0 0 51 99 186 0.795 14.18 5.02 Intr - 165899 165836 64 1 1 83 99 59 0.159 5.22 5.01 Init - 176011 175968 44 1 2 95 77 -2 0.189 -0.71 5.00 Prom - 176700 176661 40 -6.76 6.00 Prom + 177000 177039 40 -6.26 6.01 Init + 177797 177799 3 1 0 113 22 0 0.316 -4.10 6.02 Term + 178671 178847 177 2 0 72 43 425 0.801 34.09 6.03 PlyA + 180513 180518 6 1.05 7.00 Prom + 183992 184031 40 -4.16 7.01 Init + 193315 193392 78 0 0 81 115 133 0.961 14.36 7.02 Term + 193665 193751 87 2 0 77 43 58 0.322 -2.14 7.03 PlyA + 194620 194625 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 14367 14233 135 1 0 33 110 129 0.925 10.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:84555273_84758152|GENSCAN_predicted_peptide_1|506_aa MRLQGDYDACFGHPSPALSIAPSDGSQLPCDEVPYGEAHMASLPQSRRSGPGQSREKGAR SPGPGNRGLSWAGSPLSRDSGLFLSALALPCEAAAGRLTVMLQWSLSCRGGSWFLWKIED NNFSLALNPDTDILLSPLAGGARKLQHIMSDEICVQVTDLYLAENNNGATGGQPNTQNSR SLLESTYPQKAKQLMSDEKCFKVRVAHVVKDLAGDNTQPSSSLEDSAFNVPSSKQRAALT RLQTCWYLDLGLPSHQNCTAQAVSPKPYSGDRGQVRLRADYISHNALGPASLLPPGWVPS PPTERQNFQGRSRQPMSAAVPQEEDRQEEEVTTMILEDDSWVQEAVLQEDGPESEPFPQS AGKGGPQEEVTRGPQGALGRLRELCRRWLRPEVHTKEQMLTMLPKEIQAWLQEHRPESSE EAAALVEDLTQTLQDSGETQNLIGRGREHPSKVEECGVSEEEKVVSKAEWGASAIPLLCL QAVSVFISLLVSSLCAKSAPEVLGGP >gi568815583r:84555273_84758152|GENSCAN_predicted_CDS_1|1521_bp atgaggctacaaggagactacgatgcctgctttggtcacccttctcctgctctttccatt gctccctctgatggaagccagttgccatgtgatgaggtgccctatggagaggcccacatg gcaagcctcccgcagtcccggcgatcggggccaggccagtcgcgggagaaaggtgcgcgc tcacccggcccggggaaccggggcctctcctgggcaggttcccctttgtcccgggactcc gggctcttcctctccgccctcgccctgccgtgtgaagccgccgctgggcgcctcaccgtg atgttgcagtggagcctgagctgccgcggcggctcctggttcttgtggaaaatagaggac aacaacttcagcttggccttgaaccctgacacggacattttactctcacctctggcggga ggggcgcggaagctgcagcatatcatgagtgatgagatctgtgtacaggtgactgacctt tacctggcagaaaataataatggggccaccggaggccagccgaacacacagaactcaaga agcctcctggagtcaacgtatccacagaaagccaagcagctaatgtcagatgagaaatgc tttaaggtgagagttgctcatgtagtcaaagaccttgctggtgacaatacacagccttcc tcatccctggaggactcagcattcaacgtgccttcttcgaagcagagagcagccctcact agactacaaacctgctggtaccttgatcttggacttcccagccaccagaactgcactgcc caggctgtcagccccaaaccctactccggggaccgcggtcaggttcgtctccgggcggac tacatctcccacaatgccttgggcccagcctccctcctgccgcccggctgggtgccgtct ccaccaacagaaaggcagaatttccagggccgttctcggcagccaatgagcgcggcggtg cctcaagaggaagatagacaggaggaggaggtcaccaccatgatcctggaggatgactcc tgggtgcaagaagctgtgctgcaggaggatggccctgagtctgagccctttccccagagt gctggcaagggcggcccccaggaggaggtgaccaggggaccacagggtgcactcggccgc ctccgagagctctgccggcgctggctgagaccagaggtacacaccaaggagcagatgtta accatgctgccaaaggaaattcaggcttggctgcaagagcatcggcctgaaagcagtgag gaggcagcggccctggtggaagacttgacccagacccttcaggacagtggtgagacgcag aacctcatagggagagggcgggagcacccttccaaggtagaggagtgtggtgtttcggag gaggagaaggtggtgtccaaggcagagtggggggctagcgccatccctctgctctgtctg caggcagtcagcgtgttcatcagccttttagtgtcctcactgtgtgcaaagtcagctcca gaagtgctaggagggccttag >gi568815583r:84555273_84758152|GENSCAN_predicted_peptide_2|464_aa MFENESRKIFSEMPEGESAQHSDGESDFERDAGIQRLQGHSPGEDHGEVVSQDREVGQLI GLQGTYLGEKPYECPQCGKTFSRKSHLITHERTHTGEKYYKCDECGKSFSDGSNFSRHQT THTGEKPYKCRDCGKSFSRSANLITHQRIHTGEKPFQCAECGKSFSRSPNLIAHQRTHTG EKPYSCPECGKSFGNRSSLNTHQGIHTGEKPYECKECGESFSYNSNLIRHQRIHTGEKPY KCTDCGQRFSQSSALITHRRTHTGEKPYQCSECGKSFSRSSNLATHRRTHMVEKPYKCGV CGKSFSQSSSLIAHQGMHTGEKPYECLTCGESFSWSSNLLKHQRIHTGEKPYKCSECGKC FSQRSQLVVHQRTHTGEKPYKCLMCGKSFSRGSILVMHQRAHLGDKPYRCPECGKGFSWN SVLIIHQRIHTGEKPYKCPECGKGFSNSSNFITHQRTHMKEKLY >gi568815583r:84555273_84758152|GENSCAN_predicted_CDS_2|1395_bp atgtttgagaatgaatcacgtaagatattctcggaaatgcctgaaggtgaaagtgctcag cactccgatggggaaagtgactttgagagagatgctggcatccagaggctccagggacac agcccaggtgaggaccacggggaggtggtttctcaggacagggaagttggccagctcata ggcctgcagggcacctacctaggggagaagccctacgaatgtccccagtgtgggaagacc ttcagccggaaatcccacctcatcacacacgagaggacccacacaggagagaaatactac aaatgtgatgaatgtggaaaaagctttagtgatggttcaaattttagtagacaccaaacc actcacaccggggagaagccctacaaatgcagagactgtgggaagagctttagccggagt gccaacctcataacccaccagaggatccacacgggggaaaagcccttccagtgtgccgag tgtggcaagagcttcagcaggagtcccaacctcattgcacatcagcgcacccacacagga gagaaaccctactcgtgccccgagtgtggaaagagctttggcaaccgatccagccttaac acgcatcaggggatccacactggagaaaagccctacgaatgtaaagaatgcggcgaaagc tttagttacaactccaatctaatcagacaccagagaatccacacaggagagaaaccctac aaatgtaccgactgtgggcagaggttcagccagagttcagccctcatcacccaccggaga acccacacaggagagaaaccctaccagtgcagcgagtgtgggaaaagcttcagccgcagc tctaacctggccacacaccggagaacccacatggtggagaagccctataagtgtggggtg tgtgggaagagcttcagccagagctccagtctgattgcacaccagggcatgcacacaggg gagaaaccctacgagtgcctgacatgtggggagagcttcagctggagctccaacctcctc aagcaccagaggatccacacgggagagaaaccctacaaatgcagcgagtgtgggaaatgc ttcagccagcgctcccagctcgtagtgcaccagcggacccacacgggcgagaagccctac aaatgcctcatgtgcggcaagagcttcagccggggctccattctggtcatgcaccagaga gcccatttgggagacaagccctacaggtgccctgagtgtgggaaaggctttagctggaac tcagtcctcattatacatcagcgaatccacactggggagaagccctacaaatgccccgag tgtggcaaaggcttcagcaacagctctaactttatcacacatcagagaactcacatgaaa gagaaactttattga >gi568815583r:84555273_84758152|GENSCAN_predicted_peptide_3|438_aa MAVAVDQQIQTPSVQDLQIVKLEEDSHWEQEISLQGNYPGPETSCQSFWHFRYQEASRPR EALLQLQKLCCQWLRPEKCTKEQILELLVLEQFPTVLLQEIQIWVRQQHPESGEEAVALV EDLQKEPGRQRLENSSHSVAASQRQAASVVFVDIIEPIAKHLNYPMSFCLGFGKAASGEA VRPHPSCSAAVAMANDSCGPGEPSSSERDRQYCELCGKMENLLRCSRSSFCCKERQRQDW KKHKLVCQGSEGALGHGGGPHQDSGPAPPAAAPPSRDRALEARKAARRRDSASGDAAKAK AKSAADPAAAASPPRASPGRTKAMAACYPVNGTGYVRHVDNPNGDGRPLPDVALGIPAEK SRGGQEIVSSSQEAWRDPEPGLKNHLEITQQNSENEVTLGLPVPQPDGVTMLQKSEELWN EDLPDFKEIQKTSSAGGK >gi568815583r:84555273_84758152|GENSCAN_predicted_CDS_3|1317_bp atggctgtagctgtggaccaacaaatccagactccttcagtacaagatctccaaatagtt aaactggaagaagattcccactgggagcaggaaatttcccttcaagggaattaccctgga ccagagacatcctgccagagcttttggcatttccgttaccaagaagcatcacgaccccga gaggccctcctccagctccagaagctctgttgtcagtggctaaggccagagaagtgtaca aaagagcagatcctggagttgctggtcctagaacagttcccgactgtccttctccaggag atccagatctgggtcagacagcagcatccggagagtggagaggaggcagtggccctggtg gaagacttgcagaaagaacctggaagacagaggctggagaacagctctcactctgtagca gccagccagaggcaagcggcttcggtggtctttgtcgacattattgagcctattgcaaaa cacctgaattatcctatgtctttttgcctagggtttggcaaggcagcctcgggcgaggcc gtccggccgcacccctcctgctcagctgcggtcgccatggccaatgacagctgcgggccc ggcgagccgagctcgagcgagcgagaccggcagtactgcgagctgtgcgggaagatggag aacctgctgcgctgcagccgcagctccttctgctgcaaggagcgccagcgccaggactgg aagaagcacaagctcgtgtgccagggcagcgagggcgccctcggccacggagggggccct caccaggactccggccccgcgccgcccgctgcagcgccgccgtccagggaccgggccctg gaggccaggaaggcagcgaggcgccgggacagcgcctccggggacgcagccaaggcaaag gccaagtccgcggccgaccccgcggcggccgcgtccccgcctcgcgcgtccccgggccgg acaaaagccatggctgcttgttatccggtcaatggaacgggttatgtacgtcatgttgat aatccaaatggagacggaagacccctgcctgatgtggctctgggaattcctgcagagaag agcaggggtggccaggagatagtgagcagctcccaggaggcatggagggatccagaacct ggactgaagaaccacctagaaataactcagcagaattctgaaaatgaggtcacgctggga cttccagttccccagccagatggggtcacaatgctgcagaaaagtgaagagctttggaat gaagatctcccggacttcaaggagattcagaaaacgtccagtgcaggtgggaaatga >gi568815583r:84555273_84758152|GENSCAN_predicted_peptide_4|532_aa MARRAGGARMFGSLLLFALLAAGVAPLSWDLPEPRSRASKIRVHSRGNLWATGKAEGAGF ATTQTPLWLLGEVEAPRGGRLSVFPLQERKRHFMGKKSLEPSSPSPLGTAPHTSLRDQRL QLSHDLLGILLLKKALGVSLSRPAPQIQNQQPLRTSDVRPRSLGWLSLKGNNVTSNALGF EALTHADQLENLRSSSAYVWGFRVRYQDFYAFDLSGATRVLEWIDDKGGVFVAGYESLKK NEILHLKLPLRLSVKENKGLFPERDFKVRHGGFSDRSIFDLKHVPHTRYGQFCDPAIHTG WDGMAANAWGYSASSSPFPPPQHTGTYVIKAVSTIAVHEKEESLWPRVAVFSTLAPGVLH GARLRSLQVVDLESRKTTYTSDVSDSEELSSLQVLDADTFAFCCASGRLGLVDTRQKWAP LENRSPGPGSGGERWCAEVGSWGQGPGPSIASLGSDGRLCLLDPRDLCHPVSSVQCPVSV PSPDPELLRVTWAPGLKNCLAISGTAEQDFVLLSDLFLPGDCWVLRKEPPAP >gi568815583r:84555273_84758152|GENSCAN_predicted_CDS_4|1599_bp atggcccggcgggcggggggcgctcggatgttcggcagcctcctgctcttcgccctgctc gctgccggcgtcgccccgctcagctgggatctcccggagccccgcagccgagccagcaag atccgagtgcactcgcgaggcaacctctgggccaccggaaaagctgagggagcaggcttt gccaccacccagacacctttgtggctccttggtgaggtggaagcaccaagaggaggaagg ttaagtgtcttcccgctacaagaacggaaacgtcacttcatgggcaagaagagtctggag ccttccagcccatccccattggggacagctccccacacctccctgagggaccagcgactg cagctgagtcatgatctgctcggaatcctcctgctaaagaaggctctgggcgtgagcctc agccgccccgcaccccaaatccagaaccagcagcccctgaggacctcagatgtaaggcct aggagcttgggctggctgagtctgaagggaaacaatgtcacctctaatgcccttggtttt gaagctctgacacatgcagaccaactagagaatctcagaagcagcagtgcctacgtctgg ggcttcagagtgaggtaccaggatttctatgcattcgacctgtcaggagccactcgagtc cttgaatggattgatgacaaaggtggagtctttgttgctggctatgaaagcctgaaaaag aatgaaattcttcatctgaaattacctctcagactttctgtaaaggaaaacaagggctta ttcccagaaagagatttcaaagtgcgccatggaggattttcagacaggtctatctttgat ctaaagcatgtgccacataccaggtatggtcaattttgtgatccagccatccacacagga tgggatgggatggctgcaaatgcctgggggtattctgcctcctcatctccattcccaccc cctcagcacacaggcacatatgtcattaaagctgtcagcaccattgctgtgcatgagaaa gaggagagtctctggcctagggtggccgtcttctccacattggcacccggagtcctccat ggggcgaggctccgaagtctgcaggtcgttgatctggagtcccggaagaccacgtacacc tcagatgtcagtgacagtgaggagctgagtagcctgcaggtcctagatgcagacaccttt gccttctgctgtgcttcaggccggctggggcttgttgacacccggcagaagtgggcaccg ttggagaatcgcagccctggccctgggtctggtggagagagatggtgtgctgaagttggg agctggggccagggccctgggcccagcattgccagccttggctcagatgggcgtctttgt cttcttgacccccgggatctctgccatcctgtgagctcagtccagtgcccagtatccgta cctagccctgacccagagctgctgcgagtgacttgggccccaggcctgaagaattgcttg gccatctcaggtactgccgagcaggatttcgtcttgttatctgatctctttcttccagga gactgttgggtgctcaggaaggagcctccagcaccatga >gi568815583r:84555273_84758152|GENSCAN_predicted_peptide_5|258_aa MVSKLDKQDPRTRTWVIEDLLLLPTTSYIEKVKEKLRRGAGGGDRARPCQLPPRKRKCKH CGVLSSVSPTRAPVVPPAGHRALSPAGVLLAVPAMLSLDFLDDVRRMNKRQLYYQVLNFG MIVSSALMIWKGLMVITGSESPIVVVLSGSMEPAFHRGDLLFLTNRVEDPIRVGEIVVFR IEGREIPIVHRVLKIHEKQNGHIKFLTKGDNNAVDDRGLYKQGQHWLEKKDVVGRARGFV PYIGIVTILMNDYPKFKA >gi568815583r:84555273_84758152|GENSCAN_predicted_CDS_5|777_bp atggtgagcaaattggacaaacaggacccaaggactaggacatgggtcatagaggacctt ctgcttctacccaccacttcctacattgagaaggtgaaggagaaactgcgaaggggcgcg gggggcggggaccgtgcccgaccgtgccagctcccgccccggaagcggaagtgcaagcac tgcggggtcctgtccagtgtgagcccgacccgagctccagtagttccgcccgctggtcat cgcgccctttcccctgccggtgtcctgctcgccgtccccgccatgctgtctctagacttt ttggacgatgtgcggcggatgaacaagcggcagctctattatcaagtcctaaattttgga atgattgtctcatcggcactaatgatctggaaggggttaatggtaataactggaagtgaa agtccgattgtagtggtgctcagtggcagcatggaacctgcatttcatagaggagatctt ctctttctaacaaatcgagttgaagatcccatacgagtgggagaaattgttgtttttagg atagaaggaagagagattcctatagttcaccgagtcttgaagattcatgaaaagcaaaat gggcatatcaagtttttgaccaaaggagataataatgcggttgatgaccgaggcctctat aaacaaggacaacattggctagagaaaaaagatgttgtggggagagccaggggatttgtt ccttatattggaattgtgacgatcctcatgaatgactatcctaaatttaaggcttag >gi568815583r:84555273_84758152|GENSCAN_predicted_peptide_6|59_aa MLYNCCLYLRRRKKEEEKEKEEEKGKEKEKEKKEEEKKKKKKKKKKKKKKKKKKKKKKK >gi568815583r:84555273_84758152|GENSCAN_predicted_CDS_6|180_bp atgctctacaactgctgcctttaccttagaagaaggaagaaggaagaagagaaggaaaag gaggaggagaaggggaaggagaaggagaaggagaaaaaagaagaagaaaagaagaagaag aagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaaatag >gi568815583r:84555273_84758152|GENSCAN_predicted_peptide_7|54_aa MLWLPQPALGTRAAETLACSRRRRRQAGHAAVLRAGRDVRPPSAIAGGPQPGHG >gi568815583r:84555273_84758152|GENSCAN_predicted_CDS_7|165_bp atgttgtggctcccgcagccggcgctggggacgcgcgcggccgagactctggcctgcagt cgccgccgccgccgccaggccgggcacgctgctgtcctccgcgctgggcgggacgtgcgg ccgccctcggcgatagccggcggcccccagcccgggcacggataa