GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:02:30 Sequence gi568815593f:175583234_175785451 : 202218 bp : 47.26% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1844 1976 133 1 1 75 45 83 0.340 3.00 1.02 Intr + 11010 11148 139 1 1 107 75 39 0.082 4.02 1.03 Term + 11385 11403 19 0 1 100 54 7 0.073 -3.71 1.04 PlyA + 12044 12049 6 1.05 2.04 PlyA - 12984 12979 6 1.05 2.03 Term - 13426 13174 253 0 1 78 49 128 0.662 3.01 2.02 Intr - 13998 13875 124 1 1 70 55 106 0.830 5.14 2.01 Init - 23118 23031 88 0 1 73 57 53 0.170 1.50 2.00 Prom - 25635 25596 40 -5.76 3.00 Prom + 30095 30134 40 -4.96 3.01 Sngl + 37289 37678 390 1 0 66 33 522 0.415 38.72 3.02 PlyA + 37965 37970 6 1.05 4.11 PlyA - 38615 38610 6 1.05 4.10 Term - 40606 40551 56 2 2 121 34 37 0.075 -0.68 4.09 Intr - 56790 56633 158 2 2 115 33 69 0.481 3.75 4.08 Intr - 60864 60766 99 2 0 84 30 123 0.840 5.33 4.07 Intr - 64672 64512 161 1 2 121 74 28 0.340 3.49 4.06 Intr - 74503 74423 81 1 0 131 51 38 0.607 4.23 4.05 Intr - 75371 74905 467 0 2 70 39 163 0.240 2.45 4.04 Intr - 75560 75446 115 2 1 78 18 118 0.159 3.82 4.03 Intr - 76472 76385 88 1 1 34 62 55 0.001 -2.53 4.02 Intr - 81302 81069 234 1 0 70 97 72 0.015 3.10 4.01 Init - 85485 85367 119 0 2 70 73 124 0.751 6.77 4.00 Prom - 92004 91965 40 -4.86 5.00 Prom + 92374 92413 40 -4.96 5.01 Init + 100001 101076 1076 1 2 80 90 1324 0.737 125.65 5.02 Intr + 104911 105022 112 1 1 67 92 9 0.265 -0.42 5.03 Intr + 112403 112742 340 1 1 90 60 223 0.961 14.75 5.04 Term + 116969 117084 116 0 2 72 47 98 0.682 2.83 5.05 PlyA + 117495 117500 6 1.05 6.00 Prom + 124019 124058 40 -3.56 6.01 Init + 127710 127769 60 2 0 71 111 48 0.899 6.65 6.02 Intr + 129180 129364 185 2 2 14 44 179 0.565 4.69 6.03 Term + 136771 137605 835 1 1 54 42 318 0.665 16.26 6.04 PlyA + 137614 137619 6 1.05 7.00 Prom + 137715 137754 40 -10.55 7.01 Sngl + 137872 138987 1116 0 0 79 47 295 0.843 21.28 7.02 PlyA + 139791 139796 6 1.05 8.00 Prom + 142555 142594 40 -4.26 8.01 Init + 149776 149933 158 0 2 45 73 151 0.406 8.68 8.02 Intr + 152640 152719 80 0 2 75 52 60 0.113 0.29 8.03 Intr + 157138 157305 168 2 0 29 62 106 0.181 2.02 8.04 Intr + 160298 160391 94 0 1 19 72 118 0.202 2.32 8.05 Intr + 163710 163833 124 0 1 121 81 3 0.250 3.49 8.06 Intr + 164999 165058 60 1 0 116 24 52 0.039 0.53 8.07 Intr + 167803 167911 109 0 1 60 55 109 0.492 4.66 8.08 Intr + 192184 192257 74 2 2 121 21 105 0.017 6.23 8.09 Term + 200732 200848 117 1 0 114 35 30 0.004 -1.16 8.10 PlyA + 201835 201840 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 165787 165868 82 0 1 87 107 74 0.845 10.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:175583234_175785451|GENSCAN_predicted_peptide_1|96_aa MDKQTRPFSAGALIIIEDRHPSSEQMNAMTTGSRICNEENKTGQCMFLLTMLVASDSPIH FKPSSVRLFPQLHTKAVLVKVTNDSTSTDPRAPSQI >gi568815593f:175583234_175785451|GENSCAN_predicted_CDS_1|291_bp atggacaaacagacaagacctttctctgctggagccctcatcataatagaagacagacac ccatcaagtgaacagatgaatgcaatgacaactggcagcaggatctgcaatgaagaaaat aagactggacaatgcatgttcctgctgaccatgcttgtggcttccgattctcccattcac tttaaacccagttcagtcaggcttttcccccaactccacaccaaagctgttcttgtcaaa gtcaccaatgactccacatcaacagatcccagagctcccagccagatttga >gi568815593f:175583234_175785451|GENSCAN_predicted_peptide_2|154_aa MQPEATRFLTSSIASVDNIIIVIPEIGVRERKAHVGIPSVPGPGGAGTAEYFTDLTIEEP ESQVLSKTLRRGNGTLALLVTAFWWLSTTYTRGSQNKTPRPAAAAAPENLLEKQILGPTP DLLTQNLRDGAQISVFHKPPGGPDAGSTQDCSFK >gi568815593f:175583234_175785451|GENSCAN_predicted_CDS_2|465_bp atgcagccagaggccacaagattcctaacctcctcaattgcttctgtagataacatcatt attgtaatacctgagattggtgttcgagaaagaaaggctcatgtgggaatccccagcgtc ccaggccccgggggagcaggcactgctgagtacttcaccgatttgacaattgaggaacct gagtcccaggtcttgtccaagacactgagaaggggaaatgggactctggctctcctggtc acagccttttggtggctttccacgacctacaccagggggtctcaaaataagacccccaga ccagctgcagcagcagcacctgagaacctgttagaaaagcaaattcttggccccacccca gacctactgacccagaacctcagagatggtgcccagatatctgtatttcacaagcccccg ggtggtcctgatgctggctccacacaggattgttcatttaagtga >gi568815593f:175583234_175785451|GENSCAN_predicted_peptide_3|129_aa MMIMVMVMVVTVMVMVRMMMMMVMVVMVRMTMVMVRMMIAMVNDDNYYCGDYNYGDDGED DDGEDDDGKDDDGDGDNVGNGDDGDGDDKDNDGGTDYGDDDDAEDEDDDDDSDCYVLNVF VLAKIHVET >gi568815593f:175583234_175785451|GENSCAN_predicted_CDS_3|390_bp atgatgatcatggtgatggtgatggtggtgacagtgatggtgatggtgagaatgatgatg atgatggtgatggtggtgatggtgaggatgacaatggtgatggtgaggatgatgatagcg atggtgaatgatgataattattactgtggtgattataattatggtgatgatggtgaggat gatgatggtgaggatgatgatggtaaggatgatgatggtgatggtgataatgttggtaat ggtgatgatggtgatggtgatgataaggataatgatggtggtactgattatggtgatgat gatgatgctgaggatgaggatgatgatgatgatagtgattgctatgttttaaatgttttt gtccttgccaaaattcatgttgaaacttaa >gi568815593f:175583234_175785451|GENSCAN_predicted_peptide_4|525_aa MAGLLLTDQPGPFLLAGPSPRLSSQSPGPDEERHPSQQLQSSLIILNALNTPGHFAALFP CARPFYIWNTQTLCVYLLHSHAPVCFPHYAGILSGVSIAVSSVHCPGLSTSRAPGSICHI LGPARYNQKHPGGGQTWFQSLNRHFLPANPRPDVPSKDSHPLSAACCGSGHRSSGSGRQP SGMKAESCSMLRQTHRREKMPPPRPGDKTGTVSAQVHFPNPPFAGKGDIKPTRVPSTAVH RGCIPSSQRPLGEGTTPAVTDKETEARRPGGGTCPRSGTARERWGRGSNPGSGPRRPPGI PSSPSGGSASPRRVPKPAWDPTPSRGPAGRRRPRYLVLLPRIVSPPLTGQPPPVQILAEW PQLEDLPRVRQPYCDHEGKTMRTEGKWTQNPDIVRLLEKPWKHLLLNVFACELMKHPMVK LLDIDNELCKEYLFLHPLEDDDDGDGGGYQLHSIYNACWLNTTGSQGTRELFDVAGFCSG ERVWRDRWTLPIMAGKPYIALQESDGRHEEIESSEFEAVTPQKQA >gi568815593f:175583234_175785451|GENSCAN_predicted_CDS_4|1578_bp atggctggactcctcctcactgaccagcctgggcccttcctcctagccggcccctcaccg cgactctcatctcagtctccaggacctgatgaagaaaggcatccttcacagcagctgcag tcaagcctaatcatcctgaatgccctgaacacccctggacattttgcagctctgtttcct tgtgcacgccctttctacatctggaatacccagacgctctgtgtctatctgctgcattcc catgcacctgtctgctttccacactatgctggaatcttatcaggtgtatccattgctgta tcttcagtgcactgcccaggactgagcacaagcagggccccaggaagcatttgtcatatt ttggggccagcacgatacaaccagaagcaccctggaggcgggcagacctggttccagtct ctgaacaggcacttcctaccggcaaatccacgtccagatgtccccagcaaggactcccac ccgctgagcgctgcctgttgtggctccggtcatcggtcatccggcagcgggcggcagccc agcgggatgaaggcagaaagttgctccatgctccgacagacacatcgccgcgagaaaatg ccaccacccagacccggagacaaaacggggactgtttcggcgcaagtgcactttccaaat ccacccttcgcgggcaaaggggatataaagccgacccgcgtcccgagcactgctgtgcac cggggctgcatcccgagctctcaacgcccccttggggaaggcacgacccccgcggtcaca gacaaggaaactgaggctcgcaggccaggcggaggcacctgcccaaggtcaggcacagcc agggagcggtggggccgaggctcgaacccgggttcggggccacgacggccgccggggatc ccgagcagcccctcgggcggctccgccagcccacgccgggtccccaagcccgcctgggac cccaccccgagccgaggccccgcggggaggcggcggccgcgttacctggtgctgctgccg cggattgtgagcccgcccttgaccgggcaaccgccacctgtgcagatattggctgagtgg ccacagctggaggacctgccccgggtgaggcagccatattgtgaccatgagggaaagacc atgagaactgaagggaaatggactcagaacccagatattgtaaggctcctggagaaaccc tggaaacatctacttctcaacgttttcgcttgtgagctaatgaaacaccctatggttaag ctacttgatattgacaatgaactatgtaaagagtacctttttctccaccctctagaggac gatgatgatggtgatggtggtggctaccagttacacagcatttacaatgcctgttggctg aacacaacaggaagccaggggacgagggagctctttgatgtagcaggcttttgcagtgga gagagggtctggagggacagatggacattacccatcatggcagggaagccctacatcgct ctccaggagagtgatggcagacatgaggaaatagaatcttctgaatttgaagctgttacc ccccaaaaacaggcttga >gi568815593f:175583234_175785451|GENSCAN_predicted_peptide_5|547_aa MAPNGTASSFCLDSTACKITITVVLAVLILITVAGNVVVCLAVGLNRRLRNLTNCFIVSL AITDLLLGLLVLPFSAIYQLSCKWSFGKVFCNIYTSLDVMLCTASILNLFMISLDRYCAV MDPLRYPVLVTPVRVAISLVLIWVISITLSFLSIHLGWNSRNETSKGNHTTSKCKVQVNE VYGLVDGLVTFYLPLLIMCITYYRIFKVARDQAKRINHISSWKAATIREHKATVTLAAVM GAFIICWFPYFTAFVYRGLRGDDAINEVLEAIVLWLGYANSALNPILYAALNRDFRTGYQ QLFCCRLANRNSHKTSLRSNASQLSRTQSREPRQQEEKPLKLQVWSGTEVTAPQGATDRK TLAIARRPAWVTHATLEPQLKFHNVIETAKSLLPRKTGCGGLPAASDIGRLLVPRTHISK RALEVLRGQQTKAEGVDLKEKGPPAFKDGPGNREEGRGVFEGAEEEPSERCPGPRPEPVR GYRASKGLKPKKLARRFRGYTQDTALNSPGFLPTRLLQAGGEKVDGEIKPESGYRLPETA GEKLDCL >gi568815593f:175583234_175785451|GENSCAN_predicted_CDS_5|1644_bp atggcacccaatggcacagcctcttccttttgcctggactctaccgcatgcaagatcacc atcaccgtggtccttgcggtcctcatcctcatcaccgttgctggcaatgtggtcgtctgt ctggccgtgggcttgaaccgccggctccgcaacctgaccaattgtttcatcgtgtccttg gctatcactgacctgctcctcggcctcctggtgctgcccttctctgccatctaccagctg tcctgcaagtggagctttggcaaggtcttctgcaatatctacaccagcctggatgtgatg ctctgcacagcctccattcttaacctcttcatgatcagcctcgaccggtactgcgctgtc atggacccactgcggtaccctgtgctggtcaccccagttcgggtcgccatctctctggtc ttaatttgggtcatctccattaccctgtcctttctgtctatccacctggggtggaacagc aggaacgagaccagcaagggcaatcataccacctctaagtgcaaagtccaggtcaatgaa gtgtacgggctggtggatgggctggtcaccttctacctcccgctactgatcatgtgcatc acctactaccgcatcttcaaggtcgcccgggatcaggccaagaggatcaatcacattagc tcctggaaggcagccaccatcagggagcacaaagccacagtgacactggccgccgtcatg ggggccttcatcatctgctggtttccctacttcaccgcgtttgtgtaccgtgggctgaga ggggatgatgccatcaatgaggtgttagaagccatcgttctgtggctgggctatgccaac tcagccctgaaccccatcctgtatgctgcgctgaacagagacttccgcaccgggtaccaa cagctcttctgctgcaggctggccaaccgcaactcccacaaaacttctctgaggtccaac gcctctcagctgtccaggacccaaagccgagaacccaggcaacaggaagagaaacccctg aagctccaggtgtggagtgggacagaagtcacggccccccagggagccacagacagaaag acactcgcaattgcacggaggcccgcctgggtaacccatgctactctcgagcctcagctc aaattccataatgtcatcgaaactgcaaagtcccttttgccgcgtaagacaggctgtggg ggcctccctgctgccagcgacatcggaaggttattagttcctcgaacccacatcagcaag agggccctagaagtgctgcgggggcagcaaactaaggcagagggtgtagacctcaaggag aaggggccaccagcatttaaggatgggccagggaacagggaagaggggagaggagtcttc gaaggggctgaggaggaaccatctgagaggtgcccaggtccaaggcctgagccagtcaga ggctacagagccagtaaaggtctcaaaccaaagaaactggcacggcgattcaggggctac acccaagacacagcgctcaacagcccagggttcctgcccacaaggctcttacaggctggc ggtgagaaggtagatggtgaaataaagccagaaagtgggtaccggctaccggagacagcc ggggagaagctggactgcctctga >gi568815593f:175583234_175785451|GENSCAN_predicted_peptide_6|359_aa MREGLADQKDLETQVVGRSLQSTRKFPVLGVAIKTTVEGEFIIRIVQMREWSSESSGGSP HVTQAGKHRWGHTFATGIPSIQRQEITKIRAELKEIETQKTLQKINESRSWFFERINKID KPLARLIKKKREKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLD TYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSAGPDGFTAEFYQRYKEELVPFLLK LFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDTKILNKILANRIQQH IKKLIHHDQVGFIPGMQGWFNIHKSINVIQHINRTKDKNHMIISIDAEKALTKFNNASC >gi568815593f:175583234_175785451|GENSCAN_predicted_CDS_6|1080_bp atgagggaagggctggctgatcagaaggacctggagacccaggtagttgggagaagcttg caaagcacccggaaattccccgtgcttggtgtggccatcaagactactgtcgagggagag ttcatcatccgcattgtccagatgagggaatggagctcagagagttcaggtggctctccc catgtcacacaagcaggaaagcatcgctggggacacacatttgccaccggaattccaagc atccaaaggcaagaaataactaaaatcagagcagaactgaaggaaatagagacacaaaaa acccttcaaaaaattaatgaatccaggagctggttttttgaaaggatcaacaaaattgat aaaccgctagcaagactaataaagaaaaaaagagagaagaatcaaatagacgcaataaaa aatgataaaggggatatcaccaccgatcccacagaaatacaaactaccatcagagaatac tacaaacacctctatgcaaataaactagaaaatctagaagaaatggataaattcctcgac acatacactctcccaagactaaaccaggaagaagttgaatctctgaatagaccaataaca ggctctgaaattgtggcaataatcaatagcttaccaaccaaaaagagtgcaggaccagat ggattcacagccgaattctaccagaggtacaaggaggaactggtaccattccttctgaaa ctattccaatcaatagaaaaagagggaatcctccctaactcattttatgaggccagcatc atcctgataccaaagccgggcagagacacaaccaaaaaagagaattttagaccaatatcc ttgatgaacattgacacaaaaatcctcaataaaatactggcaaaccgaatccagcagcac atcaaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttc aatatacacaaatcaataaatgtaatccagcatataaacagaaccaaagacaaaaaccac atgattatctcaatagatgcagaaaaggccttgacaaaattcaacaacgcttcatgctaa >gi568815593f:175583234_175785451|GENSCAN_predicted_peptide_7|371_aa MIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIAS KRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNMPCSWVGRINIMKMAILPKVIY RFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHITKSILSQKNKAGGITLPDFKLYYKATV TKTAWYWYQNRDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAI CRKLKLDPFLTPYTKINSRWIKDLNVGPKTIKTLEENLGFTIQDIGMGKDFMSKTPKAMA TKAKIDKWDLIQLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRIYNELKQIYKK KTTPSKSGRRT >gi568815593f:175583234_175785451|GENSCAN_predicted_CDS_7|1116_bp atgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatac accaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggag aactacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaatatgcca tgctcatgggtaggaagaatcaatatcatgaaaatggccatactgcccaaggtaatttac agattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaaact actttaaagttcatatggaaccaaaaaagagcccacatcaccaagtcaatcctaagccaa aagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctacagta accaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaacagag ccctcagaaataacgccgcatatctacaactatctgatctttgacaaacctgaaaaaaac aagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagccata tgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattcaagatgg attaaagacttaaacgttggacctaaaaccataaaaaccctagaagaaaacctaggcttt accattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggca acaaaagccaaaattgacaaatgggatctaattcaactaaagagcttctgcacagcaaaa gaaactaccatcagagtgaacaggcaacctacaaaatgggagaaaattttcgcaacctac tcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaagaaa aaaacaaccccatcaaaaagtgggcgaaggacatga >gi568815593f:175583234_175785451|GENSCAN_predicted_peptide_8|327_aa MERPEKHVIPPQRDWTEKGAQKVLKGHVPNFSGEMALELRGVGVVPVEEEGGGFSPSFKA VLNATFLRKPSLARILPFPGLMYYQTLGWLCGHHANILEGIQRRKSVVKIKASNLGHQTP EAASIPNYFWGLQAQVFELAYHTCGLYSCDGGYKDLYGEDGWCVQEGNKAQILLRVEVCA VKRLHFPSFPSSKACPHDCILANDVQMEGRWDPCDIKVADTTMEEGLELWEVSIAIRSIV QMFQNIKLMLRDMENLLKVTGKPEASLKLSRKQPMASDNDPKITTNVKKVFNGWVMPTHI GEAICFTWSTNSNANHFQKHLHRHTQK >gi568815593f:175583234_175785451|GENSCAN_predicted_CDS_8|984_bp atggagagaccagagaaacatgtgattccaccccagcgggactggacagaaaagggtgct cagaaggtgttgaaggggcacgtgcccaacttctcaggggagatggcattggagctgaga ggtgtgggagttgtccccgtggaggaggaaggtggaggcttctccccatccttcaaggct gtgctcaacgccaccttcctaaggaagccatcgctggctcggattctcccgtttccagga cttatgtactaccagacactgggatggctctgtggtcatcatgcaaacatcctagaagga atccaaaggaggaaatcagtggtgaaaatcaaggccagcaacctgggacaccagactcca gaggctgcctccatccccaactacttctggggtctccaagcacaagtattcgagttggct tatcacacgtgtggcctctacagctgtgatggtggctacaaagacctctatggggaagac ggctggtgtgtccaggaagggaacaaagcccagattttattgagggtggaagtgtgcgca gttaaaagactacatttcccatccttccctagcagcaaggcgtgcccacatgactgtatc ctggccaatgatgtacaaatggaaggaaggtgggatccatgtgacatcaaggtggctgat acaaccatggaagagggtctagagctctgggaggtgagcattgccattcgctccattgta cagatgttccaaaatataaaactgatgctcagagacatggagaacctgctcaaggtcaca ggaaagccagaggcctccctgaaactttcaaggaagcagccgatggcctctgacaatgat ccaaaaattacgaccaacgtcaaaaaagtcttcaatggatgggtgatgcccacacatatt ggggaggccatctgctttacttggtcaaccaattcaaatgctaatcacttccagaaacat cttcacagacacacccagaaataa