GENSCAN 1.0 Date run: 5-Nov-116 Time: 21:42:44 Sequence gi568815575r:25545725_25746327 : 200603 bp : 37.08% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2978 3017 40 1 1 86 94 30 0.678 3.89 1.02 Term + 7594 7841 248 2 2 20 39 289 0.815 12.17 1.03 PlyA + 11272 11277 6 1.05 2.00 Prom + 36476 36515 40 -5.35 2.01 Init + 36663 36708 46 2 1 53 100 54 0.398 4.00 2.02 Term + 44980 45095 116 2 2 68 48 117 0.394 3.45 2.03 PlyA + 45857 45862 6 1.05 3.05 PlyA - 46057 46052 6 1.05 3.04 Term - 59341 59132 210 1 0 45 46 185 0.883 6.31 3.03 Intr - 60503 60430 74 0 2 50 68 45 0.003 -3.09 3.02 Intr - 76188 76084 105 1 0 55 64 123 0.252 5.87 3.01 Init - 88259 88145 115 2 1 65 35 107 0.433 3.52 3.00 Prom - 93867 93828 40 -5.35 4.02 PlyA - 94662 94657 6 1.05 4.01 Sngl - 100603 99998 606 1 0 100 35 840 0.996 75.84 4.00 Prom - 101447 101408 40 -5.25 5.04 PlyA - 101591 101586 6 1.05 5.03 Term - 112003 111593 411 1 0 45 43 133 0.603 -1.24 5.02 Intr - 112295 112066 230 0 2 -7 72 175 0.808 3.07 5.01 Init - 112981 112831 151 1 1 84 62 125 0.985 9.75 5.00 Prom - 113946 113907 40 -5.45 6.05 PlyA - 114289 114284 6 1.05 6.04 Term - 118378 118226 153 1 0 45 55 130 0.537 2.34 6.03 Intr - 131904 131715 190 2 1 -18 82 193 0.011 6.77 6.02 Intr - 137522 137490 33 1 0 135 82 58 0.030 6.32 6.01 Init - 158437 158346 92 1 2 94 77 78 0.631 7.31 6.00 Prom - 158826 158787 40 -9.55 7.00 Prom + 158889 158928 40 -7.35 7.01 Sngl + 159527 160045 519 1 0 53 43 278 0.522 15.59 7.02 PlyA + 161560 161565 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 131816 131715 102 2 0 82 82 123 0.808 11.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:25545725_25746327|GENSCAN_predicted_peptide_1|95_aa MVEGEAGMYYMMAGRHPSGNVQRLQPMQVIRMEDVNVIGLQNHSLGRNLNFTPIPYLIQE LTVYPYYSKCDFQILSIDKMWEPVGHEESQALHQT >gi568815575r:25545725_25746327|GENSCAN_predicted_CDS_1|288_bp atggtggaaggggaagcaggcatgtattacatgatggcaggcaggcatccaagtggaaat gttcagagactgcagcccatgcaagtgatcaggatggaggatgtgaatgtcatcggacta cagaatcatagtttaggaaggaatctcaattttactccaatcccctacctaatacaggaa ctgactgtgtacccttattactcaaagtgtgatttccagatcctcagcattgacaagatg tgggagcccgttggacatgaagaatctcaggccctacaccagacctaa >gi568815575r:25545725_25746327|GENSCAN_predicted_peptide_2|53_aa MRELMQMAFERPSDEALAVGVLPAQQQQPEFPSHLRRCASGSQDSMQSVKGKG >gi568815575r:25545725_25746327|GENSCAN_predicted_CDS_2|162_bp atgcgtgaactgatgcagatggcttttgagaggcccagtgatgaagccctggctgtggga gtgctcccagctcagcagcagcaaccggagtttccttcacacctcagacgctgtgcttct gggtcccaggatagcatgcagtctgttaaaggcaaaggttga >gi568815575r:25545725_25746327|GENSCAN_predicted_peptide_3|167_aa MAVAVVEQPSGIYTDCAHVGSTCSGLLSQSSGLQLAPVALTIANFHLFARIQSRVAFPLK VRYLTFEGDVIRRDHLQTYLTTCSDSTKHTGPVCLQRARDQHENSRNSENPYKIPDKATI SKKHSHSHQIVQGQCERKILKAATEKKQVTYKENPIRLIADLLSETL >gi568815575r:25545725_25746327|GENSCAN_predicted_CDS_3|504_bp atggcagtagcagtggtggaacaaccctctgggatctacacagactgtgctcatgttggt agtacctgcagtgggctgctgagccagtcctcaggcctacagctggcacctgtggctctc accattgccaattttcatttgtttgccagaattcagtccagagtagcatttcccctcaag gtgcggtatctaacttttgaaggtgatgttattcgcagagaccacctgcagacatacctc acaacctgctctgactctaccaaacacacaggaccagtgtgcctccagagagctagagat caacatgaaaattcaagaaattcagagaacccctacaagataccagacaaggcgaccatc tctaagaaacatagtcatagtcatcagattgtccaaggtcaatgtgaaagaaaaatctta aaggcagctacagagaagaaacaggtcacttacaaagagaaccccatcaggctaatagca gacctcttatcagaaaccttataa >gi568815575r:25545725_25746327|GENSCAN_predicted_peptide_4|201_aa MAATKDTHEDHDTSTENTDELNHDPQFEPIVSLPEQEIKTLEEDEEELFKMRAKWLRFAS ENDLPEWKERGTGDVKLLKHKEKGAIRLLMQRDKTLKICANHYTTPMMELKPNAGSDRAW VWNTHADFADECPKPELLAIRFLNAENAQKFKTKFEECRKEIEEREKKAGSGKNDHAEKV AEKLEALSVKEDTKEDAEEKQ >gi568815575r:25545725_25746327|GENSCAN_predicted_CDS_4|606_bp atggcggccaccaaggacactcatgaggaccatgatacttccactgagaatacagacgag ttgaaccatgaccctcagtttgagccaatagtttctcttcctgagcaagaaattaaaacg ctggaagaagatgaagaggaactttttaaaatgcgggcaaaatggttgcgatttgcctcc gagaacgatctcccagaatggaaggagcgaggcactggtgacgtcaagctcctgaagcac aaggagaaaggggccatccgcctcctcatgcagagggacaaaaccctgaagatctgtgcc aaccactacaccacgccgatgatggagctgaagcccaacgcaggtagcgaccgtgcctgg gtctggaacacccacgctgacttcgccgacgagtgccccaagccagagctgctggccatc cgcttcctgaatgctgagaatgcacagaaattcaaaacaaagtttgaagaatgcaggaaa gagatcgaagagagagaaaagaaagcaggatcaggcaaaaacgatcatgccgaaaaagtg gcggaaaagctagaagctctctcggtgaaggaagacaccaaggaggatgctgaggagaag caataa >gi568815575r:25545725_25746327|GENSCAN_predicted_peptide_5|263_aa MSKKQMSLESFFEKGERPKTAEDSKTAKKKKAAFTRKYQESYLNYGFTATVANHIDKAKD PFTVVEELILPAATDICHDLFGEAAVQKVALVLLSASTITRRIDKIAEDIEEQLLERINE SPWYMIQEDVHEDMLCTLLFPTNNTAAELFKSLNDYISEELNWSFCVGICTDGTAAMTGW LSGFTTQVKEVASECESTHWFIHKEMLASRKMSPELNNVMQDMIKIINYFKVHAFNFYLF AQLCEEMDAQHTSSLVHRSEMAF >gi568815575r:25545725_25746327|GENSCAN_predicted_CDS_5|792_bp atgagtaaaaaacaaatgtcgctggagagcttctttgaaaagggggaaaggcccaagaca gcagaagactctaagactgccaagaaaaagaaagctgcatttacaagaaaataccaagag tcttacttgaattatgggttcactgcaacagtggctaaccacattgacaaagctaaggat ccctttactgttgttgaagagttgatcctgcctgctgctacggacatttgtcatgacctt ttcggagaggctgcagttcaaaaggtggcacttgttcttctttcggctagcaccataact agacgaattgataaaatagcagaggatattgaggaacaattgttagagaggattaacgag tcaccatggtacatgatccaggaggatgtgcacgaagatatgttatgtacacttttgttc ccaactaacaatacagctgcagaactattcaagtctttgaatgattacatatcagaagaa ctgaattggtcattttgtgttggtatatgcacagacggaacagctgccatgactggatgg ctttctggtttcactactcaagtcaaagaggttgcttctgaatgtgaatccacacactgg ttcatccataaagaaatgctggctagccgaaaaatgtcacccgaacttaacaatgttatg caggatatgattaaaattatcaactactttaaagtacatgcctttaacttttatctgttt gcgcagctctgtgaggagatggacgcacagcacacgtcttctcttgtacacagaagtgaa atggctttctaa >gi568815575r:25545725_25746327|GENSCAN_predicted_peptide_6|155_aa MSKDHHRKLRHLVVDPSKELDVEQPHGLKQRSLRPGASSNPCLIQLWPAKTGLNKLHPTK NGSNQSKPKSVMASQNHSNLVAACLNQCKLAVASHNRTKPVVAKQQPILVLIKPKISVFK ADTIKRGQKHLVGVELSRHFVQGDGSCKREAKRQL >gi568815575r:25545725_25746327|GENSCAN_predicted_CDS_6|468_bp atgagcaaagaccatcatagaaaactgagacacttagttgtggatccaagcaaagagctt gatgtcgaacagcctcatggactaaagcaaagatctttgagacctggtgccagcagcaat ccctgtctaatccagttgtggccagccaaaactggtctaaacaagttgcatccaaccaag aatggttcaaaccagtcaaagcctaagtcagtcatggctagtcaaaaccattcaaatctg gttgcagcttgtctaaaccagtgtaaactggctgtggccagtcacaaccggactaaacca gttgtggctaaacagcagcctattttggttttgataaaacccaagatttctgtattcaag gctgacaccataaaacgggggcaaaaacatctggtgggtgtagaattgagtcgtcatttt gtgcagggcgatggcagttgtaagagagaagcaaaacgacagctgtga >gi568815575r:25545725_25746327|GENSCAN_predicted_peptide_7|172_aa MSKQQSIEDVTWIILKAFSFMHSQRDGLKLELMFKREAEHKCSENLQPDNVIEKKISFSE GKFKQAAEICISNEKLNANPQDNGENVSRACQRPWQQPLPSQAQRTRRKNGFLGWAQGPC CSVLPQDMAPCFQAASAPVVAKRGQGTAQVVASEGASPKPWQVPRVAGPTGA >gi568815575r:25545725_25746327|GENSCAN_predicted_CDS_7|519_bp atgtctaagcagcaaagcattgaagatgtgacctggattattctaaaagcattcagtttt atgcattcacaaagagatggtttgaaattggaacttatgtttaaaagggaagcagagcat aaatgttcagaaaatttgcagcctgacaatgtgatagaaaagaaaatctcattttctgag gggaaattcaagcaggctgcagaaatttgcataagtaacgagaaactgaatgctaatccc caagacaatggggaaaatgtctccagggcatgtcagagaccttggcagcagcccctgcca tcacaggcccagaggactaggaggaaaaatggcttcctgggctgggcccagggcccctgc tgctctgtactgcctcaggacatggcaccctgcttccaagctgcttcagctccagtcgtg gctaaaaggggccaaggtacagctcaggtggttgcttcagagggtgcaagccccaagcct tggcaggttccacgtgttgctgggcctacaggtgcatag