GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:36:56 Sequence gi568815587f:61103565_61222400 : 118836 bp : 49.76% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 2111 2106 6 1.05 1.03 Term - 5454 5355 100 2 1 90 44 74 0.224 0.80 1.02 Intr - 7390 7216 175 2 1 84 50 235 0.613 18.30 1.01 Init - 8910 8862 49 0 1 77 58 25 0.753 -2.49 1.00 Prom - 10549 10510 40 -6.16 2.00 Prom + 12661 12700 40 -3.06 2.01 Init + 14122 14170 49 0 1 86 58 18 0.287 -2.29 2.02 Intr + 14611 14916 306 2 0 105 80 224 0.971 19.72 2.03 Intr + 15336 15413 78 1 0 94 110 57 0.929 8.02 2.04 Intr + 15670 16011 342 2 0 115 111 245 0.970 24.90 2.05 Intr + 18047 18340 294 0 0 112 84 334 0.988 32.38 2.06 Intr + 19343 19468 126 0 0 145 51 236 0.999 26.25 2.07 Intr + 20320 20373 54 2 0 80 98 82 0.994 7.35 2.08 Intr + 21468 21587 120 1 0 44 115 67 0.378 5.47 2.09 Term + 22187 22275 89 0 2 82 43 54 0.284 -1.88 2.10 PlyA + 24183 24188 6 1.05 3.09 PlyA - 24854 24849 6 -0.45 3.08 Term - 25695 25612 84 0 0 135 48 4 0.249 -1.25 3.07 Intr - 28225 28100 126 1 0 78 46 106 0.686 6.28 3.06 Intr - 28975 28790 186 1 0 109 78 182 0.686 19.19 3.05 Intr - 29773 29691 83 2 2 86 78 82 0.925 6.36 3.04 Intr - 30643 30472 172 1 1 87 45 301 0.963 25.22 3.03 Intr - 35271 35173 99 0 0 89 65 159 0.539 14.01 3.02 Intr - 42003 41856 148 2 1 30 64 89 0.236 0.94 3.01 Init - 47649 47567 83 0 2 59 51 85 0.110 2.34 3.00 Prom - 49714 49675 40 -5.76 4.00 Prom + 55474 55513 40 -7.36 4.01 Init + 57714 58114 401 2 2 88 111 221 0.772 18.54 4.02 Intr + 71601 71828 228 0 0 58 75 110 0.046 3.68 4.03 Intr + 82933 83016 84 1 0 91 77 30 0.151 1.14 4.04 Term + 91315 91426 112 1 1 94 49 62 0.299 0.93 4.05 PlyA + 92963 92968 6 1.05 5.00 Prom + 99912 99951 40 -0.76 5.01 Init + 100001 100056 56 1 2 90 89 113 0.999 10.42 5.02 Intr + 100543 100705 163 1 1 96 76 250 0.950 24.58 5.03 Intr + 102946 103063 118 0 1 87 92 213 0.984 21.64 5.04 Intr + 103933 104051 119 2 2 83 80 273 0.999 26.18 5.05 Intr + 104727 104926 200 2 2 73 96 247 0.999 22.15 5.06 Intr + 106278 106394 117 0 0 102 94 162 0.612 17.78 5.07 Intr + 107526 107670 145 0 1 75 86 300 0.987 28.78 5.08 Intr + 107773 107871 99 0 0 114 94 232 0.999 26.71 5.09 Intr + 108989 109114 126 1 0 95 57 145 0.295 12.98 5.10 Intr + 110597 110681 85 1 1 25 72 71 0.217 -1.41 5.11 Term + 114722 114882 161 0 2 126 38 52 0.533 2.10 5.12 PlyA + 116798 116803 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:61103565_61222400|GENSCAN_predicted_peptide_1|107_aa MRFHHVGQAGLELLTSDEQQFLSSPSISITVTIIITITVTIIITITVTITNTLKVFTLPG LLHINSPSPHHSAMKSCRPCTLGNGGKGVACLGFGRHPDSQACDNDR >gi568815587f:61103565_61222400|GENSCAN_predicted_CDS_1|324_bp atgaggtttcaccatgttggccaggctggtcttgaactcctgacctcagatgagcagcag ttcctctcctctccttccatctctatcaccgtcaccatcatcatcaccatcaccgtcacc atcatcatcaccatcactgtcaccatcaccaacactttaaaagttttcactcttccgggc cttttacacattaactctcccagtcctcaccacagtgctatgaagtcctgccgcccctgc accctgggaaatggaggcaagggtgtggcctgcttagggtttggcaggcacccggattca caggcctgtgataatgaccgatga >gi568815587f:61103565_61222400|GENSCAN_predicted_peptide_2|485_aa MGFHHVGQAGLKLLTSDFQARLTRSNSKCQGQLEVYLKDGWHMVCSQSWGRSSKQWEDPS QASKVCQRLNCGVPLSLGPFLVTYTPQSSIICYGQLGSFSNCSHSRNDMCHSLGLTCLEC LIAEPQKTTPPTTRPPPTTTPEPTAPPRLQLVAQSGGQHCAGVVEFYSGSLGGTISYEAQ DKTQDLENFLCNNLQCGSFLKHLPETEAGRAQDPGEPREHQPLPIQWKIQNSSCTSLEHC FRKIKPQKSGRVLALLCSGFQPKVQSRLVGGSSICEGTVEVRQGAQWAALCDSSSARSSL RWEEVCREQQCGSVNSYRVLDAGDPTSRGLFCPHQKLSQCHELWERNSYCKKVFVTCQDP NPAGLAAGTVASIILALVLLVVLLVVCGPLAYKKLVKKFRQKKQRQWIGPTGMNQNMSFH RNHTATVRSHAENPTASHVDNEYSQPPRNSHLSAYPALEGALHRSSMQPDNSSDSDYDLH GAQRL >gi568815587f:61103565_61222400|GENSCAN_predicted_CDS_2|1458_bp atggggtttcaccatgttggccaggctggtctcaaactcctgacctcagatttccaggca aggctcacccgttccaactcgaagtgccagggccagctggaggtctacctcaaggacgga tggcacatggtttgcagccagagctggggccggagctccaagcagtgggaggaccccagt caagcgtcaaaagtctgccagcggctgaactgtggggtgcccttaagccttggccccttc cttgtcacctacacacctcagagctcaatcatctgctacggacaactgggctccttctcc aactgcagccacagcagaaatgacatgtgtcactctctgggcctgacctgcttagagtgt ctcattgcagaaccccagaagacaacacctccaacgacaaggcccccgcccaccacaact ccagagcccacagctcctcccaggctgcagctggtggcacagtctggcggccagcactgt gccggcgtggtggagttctacagcggcagcctggggggtaccatcagctatgaggcccag gacaagacccaggacctggagaacttcctctgcaacaacctccagtgtggctccttcttg aagcatctgccagagactgaggcaggcagagcccaagacccaggggagccacgggaacac cagcccttgccaatccaatggaagatccagaactcaagctgtacctccctggagcattgc ttcaggaaaatcaagccccagaaaagtggccgagttcttgccctcctttgctcaggtttc cagcccaaggtgcagagccgtctggtggggggcagcagcatctgtgaaggcaccgtggag gtgcgccagggggctcagtgggcagccctgtgtgacagctcttcagccaggagctcgctg cggtgggaggaggtgtgccgggagcagcagtgtggcagcgtcaactcctatcgagtgctg gacgctggtgacccaacatcccgggggctcttctgtccccatcagaagctgtcccagtgc cacgaactttgggagagaaattcctactgcaagaaggtgtttgtcacatgccaggatcca aaccccgcaggcctggccgcaggcacggtggcaagcatcatcctggccctggtgctcctg gtggtgctgctggtcgtgtgcggcccccttgcctacaagaagctagtgaagaaattccgc cagaagaagcagcgccagtggattggcccaacgggaatgaaccaaaacatgtctttccat cgcaaccacacggcaaccgtccgatcccatgctgagaaccccacagcctcccacgtggat aacgaatacagccaacctcccaggaactcccacctgtcagcttatccagctctggaaggg gctctgcatcgctcctccatgcagcctgacaactcctccgacagtgactatgatctgcat ggggctcagaggctgtaa >gi568815587f:61103565_61222400|GENSCAN_predicted_peptide_3|326_aa MTIDEVGPQSQAWCSVDVDRLDRGSRAHLADGSEEQVMAVALVRERDLSFPGVGDAVVNP TRWHLPAQPEMLYEGGEGRMETLKDKTLQELEELQNDSEAIDQLALESPEVQDLQLEREM ALATNRSLAERNLEFQGPLEISRSNLSDRYQELRKLVERCQEQKAKLEKFSSALQPGTLL DLLQVEGMKIEEESEAMAEKFLEGEVPLETFLENFSSMRMLSHLRRVRVEKLQEVVRKPR ASQELAGDAPPPRPPPPSHPRSTFGPAIAEIRGSTGSGVGAPCGLASAPRGLGGTRHWAV SKAEDTGHQGQLEAIWLCRKRLQISL >gi568815587f:61103565_61222400|GENSCAN_predicted_CDS_3|981_bp atgaccattgatgaagttggtccccagagccaggcctggtgttctgtggatgtagataga cttgatcgaggcagccgagcacacctcgctgatggctctgaggagcaggtgatggctgtg gcattggtgagagaaagggacctgtcatttccaggggtgggtgatgctgtggtaaacccc acaaggtggcacttgcctgcccagcctgagatgctgtatgagggcggtgagggaaggatg gagacgctgaaggataagaccctgcaggagctggaggagttgcagaatgactcggaggcg attgaccagctggccctggagtcccctgaggtccaggacctacagctggaacgggagatg gcactggccaccaaccggagcctggcagagcggaacttggagttccagggtcccctggag atcagccgctcaaacctctcggatagataccaggagctccggaagctcgtggagcggtgc caggagcagaaggcaaagctggagaaattttcttcagcactgcagccagggaccttgtta gaccttctgcaggtggaaggcatgaagatcgaagaagagtccgaggccatggctgagaag ttcctggagggcgaggtgcccctggaaacgttcctggagaatttttcctccatgaggatg ctgtcccacctgcgccgggttcgcgtggaaaagctccaggaagtggtgaggaagcccagg gcttcccaggagctggccggcgatgcccctccaccccgtccaccacccccgtcccaccca cgctcaacctttggcccagccatcgccgagattcgagggtcaactggaagtggcgttggt gctccctgtggacttgcgtcggcacccagaggccttggagggacaaggcactgggcggtg tccaaggctgaggacacaggacatcagggacagctggaagcaatttggctttgtaggaag aggctgcaaatctccctctag >gi568815587f:61103565_61222400|GENSCAN_predicted_peptide_4|274_aa MRRAGGGAPGATAGAAPTVVLCPARLRQSRDPPQSPLTCQGRRRRRHRGLNRSRAHTSGV RQARPLPDVTPLQRPAPQWRACSPRDPVGRPASRASELPRHRATHTRSRPPGPQPRERPR DPSSARRTPRPRPRRGCPHCVWQLPILLSCCHKMAGLATELEPDLRRKRKELQLWHLQTP ESDLRKLAAGTAIEHSLTFPFSELPKPMPSHLKSHDPSLAERTSMLDFLHVNSVTLPINE NFIPDSQPGCLMLNHSLNIYTAPGPDLCKNEMLP >gi568815587f:61103565_61222400|GENSCAN_predicted_CDS_4|825_bp atgaggcgggcaggcgggggcgctccgggagccacggccggcgccgcccccaccgtcgtc ctctgccccgcccgcctgcggcagagccgggaccctccccagtctccgctcacctgccag ggccgccgtcgccgccgccaccgcgggttaaaccgcagccgcgcccacacttccggggtc cgccaggcccgccccctaccggatgtgacccccttgcaacgtccggcgccccagtggcgc gcgtgctcgccccgcgaccccgttggccgccccgcctcgcgcgcctctgagttaccacga catcgtgccacacacacacgctcgcggccgcccgggccacaaccacgggagcgcccacga gacccctccagcgctcgccgcacgcctcggccccgcccccgccggggctgcccccactgt gtctggcagcttccaatattgctttcctgctgccataagatggcaggtctggcaacagaa ctggagccggacttaaggagaaagaggaaggaacttcagctctggcacctgcagaccccc gaaagtgacctacggaagctggcagcagggactgcaattgagcattctctcacctttccc ttctcagagctccccaagcctatgcccagccatcttaagtcacatgaccccagtcttgct gagcgcacatcaatgttagactttctgcatgtgaacagtgtgacattgcccatcaatgaa aacttcatacctgacagtcagcctggatgcttaatgctcaaccattcactcaacatctac actgccccaggtccagatctttgtaaaaatgagatgctcccctag >gi568815587f:61103565_61222400|GENSCAN_predicted_peptide_5|462_aa MKWLLLLGLVALSECIMYKVPLIRKKSLRRTLSERGLLKDFLKKHNLNPARKYFPQWKAP TLVDEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCSSLACTNHNRFNP EDSSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFD GILGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYLSADDQSGSVVIFGGIDSSYYTGS LNWVPVTVEGYWQITVDSITMNGEAIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS ENSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQSEGSCISGFQGMNLPTESGEL WILGDVFIRQYFTVFDRANNQLLLLVTADPGFLLKTQAQRLELGSRKGGDHSHYLSRKTT SRALKWSQKREEQEAIRWILFELSTRFPVLFSIADEKLTHIE >gi568815587f:61103565_61222400|GENSCAN_predicted_CDS_5|1389_bp atgaagtggctgctgctgctgggtctggtggcactctctgagtgcatcatgtacaaggtc cccctcatcagaaagaagtccttgaggcgcaccctgtccgagcgtggcctgctgaaggac ttcctgaagaagcacaacctcaacccagccagaaagtacttcccccagtggaaggctccc accctggtagatgaacagcccctggagaactacctggatatggagtacttcggcactatc ggcatcggaactcctgcccaggatttcaccgtcgtctttgacaccggctcctccaacctg tgggtgccctcagtctactgctccagtcttgcctgcaccaaccacaaccgcttcaaccct gaggattcttccacctaccagtccaccagcgagacagtctccatcacctacggcaccggc agcatgacaggcatcctcggatacgacactgtccaggttggaggcatctctgacaccaat cagatcttcggcctgagcgagacggaacctggctccttcctgtattatgctcccttcgat ggcatcctggggctggcctaccccagcatttcctcctccggggccacacccgtctttgac aacatctggaaccagggcctggtttctcaggacctcttctctgtctacctcagcgccgat gaccagagtggcagcgtggtgatctttggtggcattgactcttcttactacactggaagt ctgaactgggtgcctgttaccgtcgagggttactggcagatcaccgtggacagcatcacc atgaacggagaggccatcgcctgcgctgagggctgccaggccattgttgacaccggcacc tctctgctgaccggcccaaccagccccattgccaacatccagagcgacatcggagccagc gagaactcagatggcgacatggtggtcagctgctcagccatcagcagcctgcccgacatc gtcttcaccatcaatggagtccagtaccccgtgccacccagtgcctacatcctgcagagc gaggggagctgcatcagtggcttccagggcatgaacctccccaccgaatctggagagctt tggatcctgggtgatgtcttcatccgccagtactttaccgtcttcgacagggcaaacaac cagctcctgcttctggtgacagccgacccaggctttctgctgaagacacaagcccagagg ctggagctgggctctaggaagggcggggatcactcacattacctgagcagaaaaacaact agcagagcactgaaatggtcccaaaagagggaagagcaagaggcaatcaggtggatcctt ttcgagttatcaacaaggtttccagtcctgttttctattgctgatgagaagttaacacac attgaatga