GENSCAN 1.0 Date run: 3-Nov-116 Time: 13:28:04 Sequence gi568815591f:80035161_80317490 : 282330 bp : 36.92% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 523 518 6 1.05 1.01 Sngl - 9609 9226 384 0 0 53 44 316 0.976 19.64 1.00 Prom - 13281 13242 40 -5.45 2.02 PlyA - 13406 13401 6 1.05 2.01 Sngl - 14311 13988 324 1 0 45 31 198 0.764 5.55 2.00 Prom - 26008 25969 40 -4.85 3.03 PlyA - 26264 26259 6 1.05 3.02 Term - 31273 31078 196 0 1 49 48 189 0.425 6.90 3.01 Init - 37011 36932 80 0 2 68 75 72 0.267 4.48 3.00 Prom - 39841 39802 40 -3.55 4.00 Prom + 41142 41181 40 -3.95 4.01 Init + 61639 62006 368 0 2 34 47 303 0.076 17.34 4.02 Term + 70490 70619 130 2 1 52 49 208 0.822 9.97 4.03 PlyA + 71128 71133 6 1.05 5.00 Prom + 71857 71896 40 -6.35 5.01 Init + 76136 76218 83 1 2 62 77 127 0.392 9.49 5.02 Intr + 87732 87801 70 0 1 85 34 41 0.009 -3.33 5.03 Intr + 99060 99152 93 2 0 111 97 51 0.781 7.54 5.04 Intr + 99287 99408 122 1 2 20 101 121 0.062 4.97 5.05 Intr + 99868 100118 251 1 2 51 99 177 0.062 11.26 5.06 Intr + 126778 126907 130 2 1 63 76 66 0.008 1.73 5.07 Intr + 148908 149030 123 0 0 51 97 50 0.034 0.98 5.08 Intr + 153930 154071 142 0 1 81 115 63 0.982 7.73 5.09 Intr + 164065 164222 158 0 2 56 92 178 0.963 12.89 5.10 Term + 168544 168676 133 1 1 71 48 37 0.001 -5.42 5.11 PlyA + 168730 168735 6 1.05 6.00 Prom + 172834 172873 40 -5.05 6.01 Init + 175810 175942 133 0 1 31 14 237 0.001 10.95 6.02 Term + 185261 185373 113 0 2 85 52 117 0.483 5.54 6.03 PlyA + 186088 186093 6 1.05 7.00 Prom + 192704 192743 40 -3.45 7.01 Sngl + 216707 216991 285 1 0 58 47 256 0.988 13.79 7.02 PlyA + 217249 217254 6 1.05 8.00 Prom + 220338 220377 40 -2.95 8.01 Init + 229268 229333 66 1 0 62 68 65 0.079 3.02 8.02 Term + 252104 252301 198 1 0 69 44 135 0.383 3.62 8.03 PlyA + 252349 252354 6 1.05 9.04 PlyA - 254386 254381 6 1.05 9.03 Term - 258353 258040 314 0 2 21 54 274 0.546 11.48 9.02 Intr - 277255 277187 69 2 0 51 106 41 0.296 0.44 9.01 Init - 277468 277384 85 1 1 54 61 82 0.484 3.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 264544 264473 72 1 0 34 38 124 0.839 -1.07 S.002 Init - 265480 265397 84 1 0 64 72 84 0.879 5.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:80035161_80317490|GENSCAN_predicted_peptide_1|127_aa MNGNALMSRQRCAAGAEPSWGTSVRAVQKGNVRSKPPHRTPTGALPSGAVRRGLPSSRCQ NGRSTSSLHWAPGKAIGTECQPMKAARKGAVSCKVTGSELLKAMGAHLLHLHDLDVRHEV KELILEL >gi568815591f:80035161_80317490|GENSCAN_predicted_CDS_1|384_bp atgaatggaaacgcgttgatgtctaggcagaggtgtgctgcaggggcagagccctcatgg ggaacctctgttagggcagtgcagaagggaaatgtgaggtccaagcccccacacagaacc cccactggggcactgcctagtggagctgtgagaagagggctaccatcctccagatgccag aatggtagatctaccagcagtttgcactgggcacctggaaaagctataggaactgaatgc cagcccatgaaagcagccaggaagggggctgtatcctgcaaagtgacaggatcggagctg ctgaaggctatgggagcccacctcttacatctacatgacctggatgtaagacatgaagtc aaagaactcattttggagctttaa >gi568815591f:80035161_80317490|GENSCAN_predicted_peptide_2|107_aa MLPFGLLHPPSCTHINPKPQAPQADEQTSSRAEKQKSGRTVQQRRREEKEHLNIERSLAG DGQRGDPRDGQTPGEDHLSTVSPFQLPIHPAESHLHLTKKSPTFTII >gi568815591f:80035161_80317490|GENSCAN_predicted_CDS_2|324_bp atgttgccttttggcctgctacaccccccatcctgtacccatataaaccccaaaccccag gctccacaagcggatgagcagacaagcagcagagcagagaagcagaagagtggcagaaca gtgcagcagagaaggagagaagagaaggagcatctgaacattgagaggagtttggctggg gatggtcagagaggagatccaagggatggccaaactccaggagaagatcatctttccact gtatcccctttccagctccccatccatcctgctgagagccacctccatctcacaaaaaaa tcccccacatttaccatcatttaa >gi568815591f:80035161_80317490|GENSCAN_predicted_peptide_3|91_aa MIELMTNIQSTLGKFLRPVGRLEMSDVPQNGRSTGSLHPAPGKATGIQLRPVKAALGAES CKATGAELPKFLEAHPLQQCALNVGNGVKRD >gi568815591f:80035161_80317490|GENSCAN_predicted_CDS_3|276_bp atgattgaactgatgacaaatatccaaagcaccttgggaaaatttttacggcctgtaggc cgcctagaaatgtctgatgtaccccaaaatggtagatccactggcagcttgcaccctgca cctggaaaagctacaggcattcaactccgacctgtgaaagcagccctgggggctgaatcc tgcaaagccacaggggctgagctgcccaagttcttggaagcccaccccttgcagcagtgt gccctaaatgtgggaaatggagtcaaacgagattaa >gi568815591f:80035161_80317490|GENSCAN_predicted_peptide_4|165_aa MVLVLKLSADKHDDLTNVDPGHSTLGFSKGPSHTHLEPFSPSTGLHLVDADDMDGMELHS DVKAISAPTFHHVLIGTNTDSLQGFRRQQLIFIRHHVATEWELIHFCLLSTQRYRSQQKR DFGAYVAEQDKDKEEEEEEEEEEGGKEEGYWNTEIDTHTGEMEGM >gi568815591f:80035161_80317490|GENSCAN_predicted_CDS_4|498_bp atggtccttgttctgaagcttagtgcggataaacatgatgacttgaccaatgtggaccct ggccacagtaccctggggttttccaaaggcccctcacatacccatctggagcctttcagc cccagcacagggctacatcttgttgatgcagatgacatggatgggatggagctacactcg gatgtgaaagccatctctgccccaacttttcaccatgtacttattggcacaaatacggac agcctccagggcttcagaagacagcagctcatattcatcagacaccatgtggccacagag tgggaactcatccacttttgccttctttcgacccaaagatacagatctcagcagaaacga gactttggagcctatgtggcagagcaagacaaagacaaagaagaagaagaggaggaagag gaagaggaaggtggcaaagaggagggatattggaacacagagatagatacacacacagga gaaatggaaggaatgtga >gi568815591f:80035161_80317490|GENSCAN_predicted_peptide_5|434_aa MANTTDWTMHEITSQQSAGSTGNTANERDALSTTSEDKKPKDRYPSSPPGKVTRKSFKMR SCEKRESARKEGGIGRRKEKQKPTGICILKNLAWLPALGMLKRPELAFFGNFGSSPTPTR RGRKWWGRRGPRREAFERPLGERKDSPVLGARTRARRERRQALAFGTMGCTLSAEDKAAV ERSKMIDRNLREDGEKAAREVKLLLLGSINTRLFNSYRVSVLWTECVPPKLYVEALTHSV MAFGGGAFGRRPVGGGGGLLCVPVTSSESDASAATSILTSSEERIRPRGIRIIHEAGYSE EECKQYKAVVYSNTIQSIIAIIRAMGRLKIDFGDSARADDARQLFVLAGAAEEGFMTAEL AGVIKRLWKDSGVQACFNRSREYQLNDSAAYYLNDLDRIAQPNYIPTQQDVLRTRVKTTG IVETHFTFKDLHFK >gi568815591f:80035161_80317490|GENSCAN_predicted_CDS_5|1305_bp atggcaaataccacagactggacaatgcacgaaattaccagtcaacagtcagcagggtca acaggaaatacggcaaatgaaagagatgccttgagtacaacctcagaagacaaaaagcca aaagacaggtaccccagcagcccccctgggaaagttacgaggaaatccttcaaaatgcgt tcttgtgaaaaacgtgagagtgcaaggaaagagggagggataggaagaaggaaagagaag caaaagccgacagggatttgcattttgaaaaaccttgcgtggctgccggcgctgggcatg ctcaaacgcccagaactggcttttttcggcaactttggcagctctcccacccccacccga aggggcaggaagtggtgggggcggcgtggcccccgtcgggaggcgttcgaacgcccgcta ggagagagaaaggattcccctgtgcttggagcccgcactcgggcgcggagggagcggcgg caggctctcgctttcggcaccatgggctgcacgctgagcgccgaggacaaggcggcggtg gagcggagtaagatgatcgaccgcaacctccgtgaggacggcgagaaggcggcgcgcgag gtcaagctgctgctgctcggatctataaataccaggttatttaatagttatcgagtgtct gtgctgtggactgaatgtgtccctccgaaattgtatgttgaagccctaacccacagtgtg atggcatttggaggtggggcatttgggaggagaccagtgggcggagggggaggactcctc tgtgtgcctgttaccagtagcgaatctgatgcatctgcagcaacctcaattcttacctcc tcagaagaaagaattcggccgaggggcataagaattatccatgaagctggttattcagaa gaggagtgtaaacaatacaaagcagtggtctacagtaacaccatccagtcaattattgct atcattagggctatggggaggttgaagatagactttggtgactcagcccgggcggatgat gcacgccaactctttgtgctagctggagctgctgaagaaggctttatgactgcagaactt gctggagttataaagagattgtggaaagatagtggtgtacaagcctgtttcaacagatcc cgagagtaccagcttaatgattctgcagcatactatttgaatgacttggacagaatagct caaccaaattacatcccgactcaacaagatgttctcagaactagagtgaaaactacagga attgttgaaacccattttactttcaaagatcttcattttaagtga >gi568815591f:80035161_80317490|GENSCAN_predicted_peptide_6|81_aa MFDVGGQRSERKKWIHCFEGVTAIIFCVALSDYDLVLAEDEEMARCPSGNYMILIELPIA FLTSPSLGLDLTNPSPPALDE >gi568815591f:80035161_80317490|GENSCAN_predicted_CDS_6|246_bp atgtttgatgtgggaggtcagagatctgagcggaagaagtggattcattgcttcgaagga gtgacggcgatcatcttctgtgtagcactgagtgactacgacctggttctagctgaagat gaagaaatggcaaggtgcccttctgggaactacatgattctgatagagctgccaatcgca tttcttacttctccgtctctaggcttagacctgaccaatccgagtcctccagcattggac gagtga >gi568815591f:80035161_80317490|GENSCAN_predicted_peptide_7|94_aa MTKAILQLIRQKILRDSYKQLYVHKLVNLEEMDKFLEAHNLPGLNQEEIETLNRPISISE NESVITYEQNEALDQMDLQLNSTRYTKKNRHQSY >gi568815591f:80035161_80317490|GENSCAN_predicted_CDS_7|285_bp atgacaaaggcaatactacaacttatacgacagaagatcctcagagactcctataaacaa ctctatgtacacaaattagtaaatctggaggaaatggataaattcctggaggcacacaat ctcccaggattgaatcaggaagagattgaaactctgaataggccaatatcaatttctgaa aatgaatcagtaataacctacgaacaaaatgaagccctggaccagatggatttacagctg aattctaccagatatacaaagaaaaaccggcaccaatcctactga >gi568815591f:80035161_80317490|GENSCAN_predicted_peptide_8|87_aa MYIWQLKFFTTLCMPVNDSKCAWLHGMKGESVHFTGGENSDCRTLHWNLVLSCHSGKQQD KNSAGATEGAFRPTLARGKSSMPADRN >gi568815591f:80035161_80317490|GENSCAN_predicted_CDS_8|264_bp atgtacatctggcaactcaaatttttcaccacactgtgcatgcccgtgaatgacagcaaa tgtgcttggctgcatggcatgaagggagaatctgtgcatttcacgggtggagagaacagt gattgtaggactttgcattggaacttagtgctgtcctgtcacagtggaaagcaacaggat aagaattcagctggtgccacagagggagcatttagaccaactctagccagagggaaatca tccatgccagcagacagaaattga >gi568815591f:80035161_80317490|GENSCAN_predicted_peptide_9|155_aa MPSGNIPKGSEDGGKTGQREKLTHIAVTKGITLSEAHYEMSEAYFRNSLEMGRDWNSLEG SEEDRKVWESLELPRDLLNGFAQNADSNTDNKVQTEVASDGNEELVGNWSKDESSYVLAK RLAAFCPCSRDLWNFEVERDDLGFLAEEISKEQSI >gi568815591f:80035161_80317490|GENSCAN_predicted_CDS_9|468_bp atgccctcaggaaatatacctaaaggaagtgaagatggaggcaagactggacagagggag aagctgactcacattgcagttacaaagggcataaccttgagtgaggcacactatgaaatg tcagaagcctatttccgcaacagcttggagatgggcagggattggaacagtttggagggc tcagaagaagacagaaaagtgtgggaaagtttggaacttcctagagacttgttgaatggc tttgcccaaaatgctgatagcaatacggacaataaagtccagactgaggtggcctcagat ggaaatgaggaacttgttgggaactggagcaaagatgaatcttcttatgttttagcaaag agactggcagcattttgcccctgctctagagacttgtggaactttgaggttgagagagat gatttagggtttctggcagaagaaatttctaaggagcaaagcatttga