GENSCAN 1.0 Date run: 6-Nov-116 Time: 10:39:05 Sequence gi568815591f:128376766_128601913 : 225148 bp : 46.21% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3325 3515 191 2 2 91 103 45 0.529 5.60 1.02 Intr + 13055 13277 223 1 1 59 57 216 0.580 13.30 1.03 Term + 15136 15359 224 2 2 111 44 143 0.894 9.28 1.04 PlyA + 15499 15504 6 -8.39 2.19 PlyA - 15543 15538 6 1.05 2.18 Term - 16263 16242 22 2 1 105 55 40 0.926 0.18 2.17 Intr - 17596 17513 84 0 0 117 99 141 0.976 17.04 2.16 Intr - 17834 17691 144 1 0 77 109 183 0.999 18.70 2.15 Intr - 18268 18124 145 2 1 16 19 275 0.960 12.74 2.14 Intr - 18509 18366 144 0 0 84 94 253 0.968 25.75 2.13 Intr - 19930 19835 96 2 0 76 98 190 0.959 18.78 2.12 Intr - 20257 20167 91 1 1 119 113 76 0.984 12.77 2.11 Intr - 21848 21649 200 0 2 69 87 410 0.968 38.07 2.10 Intr - 23417 23330 88 2 1 71 76 73 0.956 3.94 2.09 Intr - 23774 23568 207 2 0 113 67 348 0.990 34.47 2.08 Intr - 24126 24058 69 0 0 97 48 54 0.612 1.68 2.07 Intr - 24351 24250 102 0 0 88 68 98 0.835 8.17 2.06 Intr - 26989 26941 49 0 1 136 89 34 0.638 6.98 2.05 Intr - 29432 29002 431 2 2 114 82 309 0.368 25.41 2.04 Intr - 30056 29978 79 1 1 128 89 -4 0.738 3.25 2.03 Intr - 32601 32524 78 2 0 87 66 40 0.513 0.37 2.02 Intr - 33189 32991 199 1 1 49 51 206 0.607 11.41 2.01 Init - 42749 42599 151 2 1 58 21 157 0.004 4.21 2.00 Prom - 75079 75040 40 -2.96 3.00 Prom + 89310 89349 40 -3.26 3.01 Init + 89807 89920 114 1 0 75 84 99 0.977 8.41 3.02 Term + 92080 92406 327 0 0 108 44 92 0.921 1.51 3.03 PlyA + 93036 93041 6 1.05 4.00 Prom + 97822 97861 40 -3.26 4.01 Init + 100001 100110 110 1 2 84 49 121 0.953 7.49 4.02 Intr + 100275 100408 134 0 2 39 91 102 0.604 5.89 4.03 Intr + 102393 102748 356 1 2 57 76 332 0.829 23.91 4.04 Intr + 103882 103931 50 0 2 97 108 3 0.587 0.68 4.05 Intr + 111336 111396 61 0 1 91 109 -3 0.595 0.84 4.06 Intr + 117039 117178 140 2 2 98 108 145 0.999 16.76 4.07 Intr + 121271 121377 107 2 2 121 63 89 0.995 9.56 4.08 Term + 124997 125151 155 0 2 57 48 184 0.987 9.38 4.09 PlyA + 125345 125350 6 1.05 5.04 PlyA - 125554 125549 6 1.05 5.03 Term - 145639 145428 212 2 2 85 47 74 0.147 0.46 5.02 Intr - 155203 155124 80 0 2 80 19 82 0.006 -0.31 5.01 Init - 193868 193540 329 2 2 64 40 307 0.104 20.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:128376766_128601913|GENSCAN_predicted_peptide_1|212_aa XLGFQGVKDEAAGCSGGGGEIWVSEQERLVVGDTRTGPGARKRCKRPGEESYQWLGHKVY SHQQWPGAGCLNANVAARTLGHWLVARWQCTLSLGGYTFAQMPLPMSWNLGHHSSVVQRL WGHIFHNLVVSSLDHMVLGVFRQMRPQALAMLYFHLCPGCTMWHDEAAEGVFRQLGVRPR CWNSTAMMTPQQCCLAAGVAEWLAVGEAWETS >gi568815591f:128376766_128601913|GENSCAN_predicted_CDS_1|639_bp ngactaggattccaaggagtgaaagatgaagccgctgggtgcagcggagggggtggagaa atctgggtcagtgaacaggagagactggtagtgggggacaccaggacagggccgggggca aggaagagatgcaagcggccaggagaggagtcttaccagtggctgggccacaaggtgtac tctcaccagcagtggcctggtgcaggctgcctgaatgctaatgttgctgcaaggaccctg gggcactggctggttgcacgctggcagtgcaccctctccctgggtggctacacctttgcg cagatgccgctgcccatgagctggaacctgggtcaccacagcagcgtggttcagcgcctg tggggccacatcttccacaatctggtggtgagcagcctggatcacatggtgctgggtgtg tttcggcagatgagaccacaggcgctggccatgctgtacttccacctgtgccctggctgc accatgtggcatgatgaggctgcggagggcgttttccggcagctgggtgtgaggccccga tgctggaattctacagccatgatgacccctcaacaatgctgcctggctgcaggagttgct gagtggcttgcagtaggtgaagcctgggagacctcctga >gi568815591f:128376766_128601913|GENSCAN_predicted_peptide_2|792_aa MLNTHLLPLLLSPTLPRLLASPFTVTVPHRPEAVPWLLYQQILFLTGIPYSGAPGASRLP PALSCGLRMEGPLTPPPLQGGGAAAVPEPGARQHPGHETAAQRYSARLLQAGYEPESCFL LELSSVVLLAGVGVQMDRLRRARTLPNNKPLSEIKSPGTNTIFSAGDRAPHTSYPVRACG VAAPLQAAGGVRAARLSPLCGAGGGGGFRGRGEKGGRGGRTGRRPGPPCRCRSPRPRPRP ARPAAAAWPRRGVSSSSSSSGGSGGRPRGCLCRVAGSRGSMADYLISGGTGYVPEDGLTA QQLFASADGLTYNDFLILPGFIDFIADEVDLTSALTRKITLKTPLISSPMDTVTEADMAI AMALMGGIGFIHHNCTPEFQANEVRKKFEQGFITDPVVLSPSHTVGDVLEAKMRHGFSGI PITETGTMGSKLVGIVTSRDIDFLAEKDHTTLLSEVMTPRIELVVAPAGVTLKEANEILQ RSKKGKLPIVNDCDELVAIIARTDLKKNRDYPLASKDSQKQLLCGAAVGTREDDKYRLDL LTQAGVDVIVLDSSQGNSVYQIAMVHYIKQKYPHLQVIGGNVVTAAQAKNLIDAGVDGLR VGMGCGSICITQEVMACGRPQGTAVYKVAEYARRFGVPIIADGGIQTVGHVVKALALGAS TVMMGSLLAATTEAPGEYFFSDGVRLKKYRGMGSLDAMEKSSSSQKRYFSEGDKVKIAQG VSGSIQDKGSIQKFVPYLIAGIQHGCQDIGARSLSVLRSMMYSGELKFEKRTMSAQIEGG VHGLHSYEKRLY >gi568815591f:128376766_128601913|GENSCAN_predicted_CDS_2|2379_bp atgctgaacacccatctgctcccgctcctcctttcccccactctcccacgcttgctggca tctccattcacagtgactgtgcctcaccggccagaagctgttccgtggcttctgtaccag cagattctcttcttaactggtattccatactccggggccccgggagcctccaggctcccg cccgccctgagctgcggcctccgcatggaggggccactcactccaccaccgctgcaggga ggcggagccgccgctgttccggagcccggagcccggcaacacccgggacacgagacggcg gcgcagcggtacagcgcccgactgctgcaggccggctacgagcccgagagctgtttcctc ctagaactatcttcagtggtcttactggcaggtgttggtgtccagatggatcgccttcgc agggctaggaccctcccaaataacaaaccgctctctgagattaaaagccctggtaccaac accattttctcagccggagatagggcgccccacacctcctacccggtccgggcttgcggg gtcgcagcgccgctgcaggcagctgggggcgtgcgcgcggcgcggctgagccctttgtgc ggcgcgggcggaggcggcggtttccgcgggaggggcgagaagggcgggaggggcgggagg acagggcggcgccctggcccgccctgccgctgccgcagcccccgcccccgcccccggccc gcccggcccgcggcagcagcgtggccgcggcgcggcgtcagcagtagcagcagcagcagc ggcggcagcggcgggcggccgcgcgggtgtttatgtcgggtcgcggggtctcgcggcagc atggcggactacctgatcagcggcggcaccggctacgtgcccgaggatgggctcaccgcg cagcagctcttcgccagcgccgacggcctcacctacaacgacttcctgattctcccagga ttcatagacttcatagctgatgaggtggacctgacctcagccctgacccggaagatcacg ctgaagacgccactgatctcctcccccatggacactgtgacagaggctgacatggccatt gccatggctctgatgggaggtattggtttcattcaccacaactgcaccccagagttccag gccaacgaggtgcggaagaagtttgaacagggcttcatcacggaccctgtggtgctgagc ccctcgcacactgtgggcgatgtgctggaggccaagatgcggcatggcttctctggcatc cccatcactgagacgggcaccatgggcagcaagctggtgggcatcgtcacctcccgagac atcgactttcttgctgagaaggaccacaccaccctcctcagtgaggtgatgacgccaagg attgaactggtggtggctccagcaggtgtgacgttgaaagaggcaaatgagatcctgcag cgtagcaagaaagggaagctgcctatcgtcaatgattgcgatgagctggtggccatcatc gcccgcaccgacctgaagaagaaccgagactaccctctggcctccaaggattcccagaag cagctgctctgtggggcagctgtgggcacccgtgaggatgacaaataccgtctggacctg ctcacccaggcgggcgtcgacgtcatagtcttggactcgtcccaagggaattcggtgtat cagatcgccatggtgcattacatcaaacagaagtacccccacctccaggtgattgggggg aacgtggtgacagcagcccaggccaagaacctgattgatgctggtgtggacgggctgcgc gtgggcatgggctgcggctccatctgcatcacccaggaagtgatggcctgtggtcggccc cagggcactgctgtgtacaaggtggctgagtatgcccggcgctttggtgtgcccatcata gccgatggcggcatccagaccgtgggacacgtggtcaaggccctggcccttggagcctcc acagtgatgatgggctccctgctggccgccactacggaggcccctggcgagtacttcttc tcagacggggtgcggctcaagaagtaccggggcatgggctcactggatgccatggagaag agcagcagcagccagaaacgatacttcagcgagggggataaagtgaagatcgcgcagggt gtctcgggctccatccaggacaaaggatccattcagaagttcgtgccctacctcatagca ggcatccaacacggctgccaggatatcggggcccgcagcctgtctgtccttcggtccatg atgtactcaggagagctcaagtttgagaagcggaccatgtcggcccagattgagggtggt gtccatggcctgcactcttacgaaaagcggctgtactga >gi568815591f:128376766_128601913|GENSCAN_predicted_peptide_3|146_aa MKSMDPASGYSNNILTIDQMLKKKQTCIVADATTIKQHVKRATDTYNLGIALEHRKEMLN LWQKIRGDLIGIDSRNEAFYDTFSTYTWSWNVCQELLSPKDLRLYDAYVNRNSSHNCRSS SSSDTSECDTDSGRKRKQKGLKGFQQ >gi568815591f:128376766_128601913|GENSCAN_predicted_CDS_3|441_bp atgaagtctatggaccctgcctcaggttattcaaataacatcctcaccattgatcaaatg ctcaagaaaaagcagacttgtatagtggccgatgcaactactattaaacaacatgtgaag agagctactgatacctataatttgggaattgcccttgaacaccgaaaagaaatgctaaac ctctggcagaagatccgaggggatttgattgggatagactctagaaatgaggccttttat gacaccttttctacttatacatggtcctggaatgtttgccaagaattactttctcctaag gacttaaggttatatgatgcctatgtgaatagaaattcctcccataactgcagatcctct tcctcatcagataccagtgaatgtgacacagactcaggaagaaaaagaaaacagaaaggt ttaaagggatttcaacaatga >gi568815591f:128376766_128601913|GENSCAN_predicted_peptide_4|370_aa MAGSYPEGAPAILADKRQQFGSRFLSDPARVFHHNACQDVKPLSCPFDFLRDNVEWSEEQ AAAAERKVQENSIQRVCQEKQVDYEINAHKYWNDFYKIHENGFFKDRHWLFTEFPELAPS QNQNHLKDWFLENKSEVCECRNNEDGPGLIMEEQHKCSSKSLEHKTQTPPVEENVTQKIS DLEICADEFPGSSATYRILEVGCGVGNTVFPILQTNNDPGLFVYCCDFSSTAIELVQTNS EYDPSRCFAFVHDLCDEEKSYPVPKGSLDIIILIFVLSAVVPDKMQKAINRLSRLLKPGG MVLLRDYGRYDMAQLRFKKEELDTLFTTAGLEKVQNLVDRRLQVNRGKQLTMYRVWIQCK YCKPLLSSTS >gi568815591f:128376766_128601913|GENSCAN_predicted_CDS_4|1113_bp atggccggctcctaccctgaaggtgcacctgcaatcctcgccgataagaggcagcagttc ggaagccggttcctgagcgatccggcgcgcgtcttccaccacaatgcctgccaggacgtg aagcccctaagctgcccgtttgattttctcagggacaatgtggagtggtcggaagagcaa gccgcggcggcggagagaaaagtccaggagaacagtatccagcgggtgtgccaggagaaa caagttgattatgagatcaatgcccacaaatactggaatgacttctacaaaatccacgaa aatgggtttttcaaggatagacattggctttttaccgaattccctgagctggcacctagc caaaatcaaaatcatttgaaggattggttcttggagaacaagagtgaagtatgtgaatgt agaaacaatgaggatggacctggtttaataatggaagaacagcacaagtgttcttcgaag agccttgaacataaaacacagacacctcctgtggaggagaatgtaactcagaaaattagt gacctggaaatttgtgctgatgagtttcctggatcctcagccacctaccgaatactggag gttggctgtggtgtgggaaacacagtctttccaattttacaaacgaacaatgacccagga ctctttgtttattgctgtgatttttcttccacagctatagaactggtccagacaaattca gaatatgatccttctcggtgttttgcctttgttcacgacctgtgtgatgaagagaagagt tacccagtgcccaagggcagtcttgatattatcattctcatatttgttctttcagcagtt gttccagacaagatgcagaaggctatcaacaggctgagcaggcttctgaaacctgggggg atggtacttctgcgagattacggccgctatgacatggctcagcttcggtttaaaaaagag gaactggacacgcttttcaccactgctggactggaaaaagttcagaatctggtggaccgc cgactgcaggtgaaccgagggaagcaactgacaatgtaccgggtttggattcagtgcaaa tactgcaagccccttctgtccagcaccagctaa >gi568815591f:128376766_128601913|GENSCAN_predicted_peptide_5|206_aa MVMVAKKDVHMPKHPELADKNVPNLHVMKAMQSLKSRGYVKEHFAWRHFYWYLTNEGIQY LRDYLHLPPEIVPATLPRSRPETGRPWPKGLYVRSAVLPGADKKAEAGAGDPQPSGSPSR LSAQELRSGADTELGPGFFSVNRVHHIDVLFHFMLCDVNFSQSKEKAVYTLKRTPHRVHP NSTFSCISEPSASGPPPHSPRGALKP >gi568815591f:128376766_128601913|GENSCAN_predicted_CDS_5|621_bp atggtcatggtggccaagaaggatgtccacatgcctaagcacccagagctggcagacaag aatgtgcccaaccttcatgtcatgaaggccatgcagtctctcaagtcccgaggctacgtg aaggaacactttgcctggagacatttctactggtaccttaccaatgagggtatccagtat ctccgtgattaccttcatctgcccccggagattgtgcctgccaccctaccccgtagccgt ccagagactggcagaccttggcctaaaggtctgtatgtgaggagtgctgtgctacctggt gccgacaagaaagccgaggctggggctggcgacccgcagccctcagggagcccctcccgc ctcagcgcccaggagctgcggtccggagccgacaccgagctgggcccaggtttcttcagt gttaaccgtgttcatcacattgatgtgctgtttcacttcatgctgtgtgatgtgaacttc agccagagcaaggagaaagcagtatatactctgaaaaggaccccgcaccgtgttcaccca aacagcaccttctcatgcatctcagagccttcggcctctgggccacctcctcactccccc aggggtgctctgaagccttaa