GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:16:38 Sequence gi568815576f:40964052_41164633 : 200582 bp : 45.47% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 557 552 6 1.05 1.05 Term - 13388 13190 199 1 1 116 47 107 0.674 6.27 1.04 Intr - 32554 32443 112 2 1 73 110 20 0.393 2.14 1.03 Intr - 39329 39292 38 1 2 105 86 49 0.352 4.31 1.02 Intr - 45637 45543 95 1 2 102 49 56 0.215 1.86 1.01 Init - 46945 46916 30 1 0 34 102 18 0.241 -2.33 1.00 Prom - 52424 52385 40 -4.96 2.00 Prom + 57557 57596 40 -2.06 2.01 Init + 65715 65762 48 2 0 65 97 29 0.308 2.45 2.02 Intr + 70470 70580 111 2 0 50 79 92 0.394 5.08 2.03 Term + 85324 85551 228 0 0 87 49 102 0.201 2.83 2.04 PlyA + 90075 90080 6 1.05 3.00 Prom + 94742 94781 40 -3.76 3.01 Sngl + 100001 100585 585 1 0 99 46 737 0.996 66.79 3.02 PlyA + 100633 100638 6 1.05 4.04 PlyA - 102101 102096 6 1.05 4.03 Term - 110637 110113 525 0 0 -55 43 1090 0.873 84.46 4.02 Intr - 110910 110731 180 0 0 -30 51 294 0.296 13.96 4.01 Init - 111188 111057 132 2 0 95 52 164 0.231 13.64 4.00 Prom - 112094 112055 40 -5.76 5.00 Prom + 121318 121357 40 -6.56 5.01 Init + 128954 129047 94 1 1 71 92 126 0.859 11.84 5.02 Intr + 153136 153770 635 2 2 119 115 326 0.995 30.25 5.03 Intr + 161813 161989 177 1 0 72 101 180 0.988 17.82 5.04 Intr + 163436 163697 262 1 1 74 115 191 0.985 17.46 5.05 Intr + 167337 167582 246 1 0 93 80 175 0.694 14.63 5.06 Intr + 171762 171855 94 1 1 57 106 -1 0.214 -2.38 5.07 Intr + 173602 173739 138 1 0 24 93 171 0.231 10.78 5.08 Intr + 176089 176206 118 1 1 66 72 71 0.985 3.77 5.09 Intr + 176997 177171 175 2 1 22 94 141 0.988 7.61 5.10 Intr + 183786 183895 110 1 2 63 65 49 0.550 0.10 5.11 Intr + 184987 185124 138 0 0 104 55 40 0.599 2.96 5.12 Intr + 187782 187961 180 2 0 94 78 181 0.858 17.76 5.13 Intr + 188155 188299 145 0 1 48 70 85 0.892 2.56 5.14 Intr + 190944 191062 119 1 2 72 95 40 0.895 3.28 5.15 Intr + 193118 193357 240 1 0 112 116 210 0.999 23.95 5.16 Intr + 194313 194449 137 2 2 100 115 36 0.554 6.87 5.17 Intr + 196591 196671 81 1 0 65 111 16 0.352 0.35 5.18 Intr + 198672 198728 57 0 0 93 93 16 0.060 0.60 5.19 Intr + 200002 200079 78 1 0 86 106 -2 0.047 0.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:40964052_41164633|GENSCAN_predicted_peptide_1|157_aa MPLKDSVLQKPLRGQLGPEPVPTPMASLFQACLPGEVQDAAGFSTWYKKSGPEKDLFTWL FESWLLVNKLVLNPALHTTVFPCLGFARGPSRAPTTLEQVIVCPGDLVWDSPPMSDDCLH QVSQLACHIPKEDFPAPLLEVATQFYSMTPQPSSLST >gi568815576f:40964052_41164633|GENSCAN_predicted_CDS_1|474_bp atgcctctgaaagacagtgtcctgcaaaagcctttgcgaggtcagcttggccctgagcct gtgcccacacccatggcctccctctttcaggcctgtttacctggtgaggtgcaggatgct gcagggttctccacctggtacaagaagagtggcccagagaaagacctttttacttggctc ttcgaaagctggctcttggtcaacaaacttgtcctcaaccccgcccttcacaccacagtc ttcccctgcctggggtttgctcgggggcccagcagagctccaaccacactggagcaagtc atcgtctgccccggggacttggtctgggactccccacctatgtcagatgactgtcttcat caagtctcacaacttgcctgtcatatccccaaagaggacttccctgcccctctgttagaa gtggccacacagttttactctatgacacctcaaccctcttcactgtctacgtga >gi568815576f:40964052_41164633|GENSCAN_predicted_peptide_2|128_aa MLTFYQTSSFKVMGYKHFLTARVQAEHVPVARESPAGFESWCMQQEALGYTGTPTHHGEE QPLTMPTCLVSPVIYGPVGPSCWPWTQQLLKVKQALCEGCKSQDGPSAVVPSWGQGGDHV ASLKTNAS >gi568815576f:40964052_41164633|GENSCAN_predicted_CDS_2|387_bp atgctaacattctatcagaccagcagcttcaaagtgatgggctataagcacttccttact gcccgtgttcaggctgagcatgttcctgtggccagagaaagccctgcaggattcgagtcc tggtgcatgcagcaggaagccttggggtacacaggaacacccacccaccatggcgaggag cagccgctgactatgccgacctgtttagtgtcgccagtgatttatggcccagtggggccc agctgttggccgtggacacagcagctgctcaaggttaagcaggccctgtgtgagggatgc aagtcacaggatggacccagtgcagttgttccctcttggggtcaagggggcgatcatgtt gccagtctgaaaaccaacgcaagctaa >gi568815576f:40964052_41164633|GENSCAN_predicted_peptide_3|194_aa MPVARSWVCRKTYVTPRRSFEKSRLDQELKLIEEYGLRNKREVWRVKFTLAKIRKAAREL LTLDQKDPRRLFEGNALLWRLVCIGVLDEGKMKLDYILGLKIEDFLERRLQTQVFKLGLA KSIHHARVLIRQRHFRVRKQVVNIPSFIVRLDSQKHIDFSLCSPYGGGRPGRVKRKNAKK GQGGAGAGDHKEED >gi568815576f:40964052_41164633|GENSCAN_predicted_CDS_3|585_bp atgccagtggcccggagctgggtttgtcgcaaaacttacgtgaccccgcggagatccttt gagaaatctcgtctcgaccaagagctgaagctgatcgaagagtatgggctccggaataaa cgtgaggtctggagggtcaaatttaccctggccaagatccgcaaggccgcccgggaactg ctgacgcttgatcagaaggacccacggcgtctgttcgaaggcaatgccctgctgtggcgg ctggtctgcattggggtgctggatgagggcaagatgaagctggattacatcctgggcctg aagatagaggatttcttagagagacgcctacagacccaggtcttcaagctgggcttggcc aagtccatccaccacgctcgcgtgctgatccgccagcgccatttcagggtccgcaagcag gtggtgaacatcccgtccttcattgtccgcctggattcccagaagcacatcgacttctct ctgtgctctccctatgggggtggccgcccgggccgcgtgaagaggaagaatgccaagaag ggccagggtggggctggggctggagaccacaaggaggaggattaa >gi568815576f:40964052_41164633|GENSCAN_predicted_peptide_4|278_aa MDDDIAALVVDNSSGMCKAGFAGDDVPWAIFPSIVGCTRHQVMMLHVAPEEHPVLLTEAP LNPKANCEKMTQIMFETFNTPAMYVAIQAMLSLYASVRTAHVQGLHTTAEREIVRDIKKL CYVALDFEREMAMVASSSSLEKSYKLLDGQVITIGNERFHCPEALFQPSFLGMESCGIHK TTFNYIMKCDVDICKDLYTNTVLSGGTTMYPGITNRMQKKITALAPSMMKIKIIAPPERK YSVWISGSILASLSTFQQMWISKQKYDQSGPSIVHECF >gi568815576f:40964052_41164633|GENSCAN_predicted_CDS_4|837_bp atggatgatgatattgctgcgctcgttgttgacaacagctctggcatgtgcaaggccggc ttcgcgggtgacgatgtcccctgggccatcttcccttccatcgtggggtgcaccaggcac caggtcatgatgctgcatgtggctcccgaggagcaccctgtgctgctgaccgaggccccc ctgaaccccaaagccaactgtgagaagatgacccagatcatgtttgagaccttcaacacc ccagccatgtatgtggccatccaggccatgctgtccctgtacgcctccgtccgtaccgcc catgtacaagggcttcacaccacggccgagcgggaaatcgtgcgtgacatcaagaagctg tgctacgtcgccctggacttcgagcgggagatggccatggtggcctccagctcctccctg gagaagagctacaagctgctcgatggccaggtcatcaccatcggcaacgagcggttccac tgccccgaggcgctcttccagccttccttcctgggcatggaatcctgtggcatccacaaa actaccttcaactacatcatgaagtgtgacgtggacatctgcaaagacctgtacaccaac acagtgctgtctggcggcaccaccatgtaccctggcatcaccaacaggatgcagaagaag atcaccgccctagcacccagcatgatgaagatcaagatcattgctcctcccgagcgcaag tactccgtgtggatcagcggctccatcctggcctcgctgtctaccttccagcagatgtgg atcagcaagcagaagtatgaccagtccggcccctccatcgtccacgaatgcttctag >gi568815576f:40964052_41164633|GENSCAN_predicted_peptide_5|1075_aa MAENVVEPGPPSAKRPKLSSPALSASASDGTDFGSLFDLEHDLPDELINSTELGLTNGGD INQLQTSLGMVQDAASKHKQLSELLRSGSSPNLNMGVGGPGQVMASQAQQSSPGLGLINS MVKSPMTQAGLTSPNMGMGTSGPNQGPTQSTGMMNSPVNQPAMGMNTGMNAGMNPGMLAA GNGQGIMPNQVMNGSIGAGRGRQNMQYPNPGMGSAGNLLTEPLQQGSPQMGGQTGLRGPQ PLKMGMMNNPNPYGSPYTQNPGQQIGASGLGLQIQTKTVLSNNLSPFAMDKKAVPGGGMP NMGQQPAPQVQQPGLVTPVAQGMGSGAHTADPEKRKLIQQQLVLLLHAHKCQRREQANGE VRQCNLPHCRTMKNVLNHMTHCQSGKSCQAILTGAPVGLGNPSSLGVGQQSAPNLSTVSQ IDPSSIERAYAALGLPYQVNQMPTQPQVQAKNQQNQQPGQSPQGMRPMSNMSASPMGVNG GVGVQTPSLLSDSMLHSAINSQNPMMSENASVPSLGPMPTAAQPSTTGIRKQWHEDITQD LRNHLVHKLVQAIFPTPDPAALKDRRMENLVAYARKVEGDMYESANNRAEYYHLLAEKIY KIQKELEEKRRTRLQKQNMLPNAAGMVPVSMNPGPNMGQPQPGMTSSLNQFGQMSMAQPP IVPRQTPPLQHHGQLAQPGALNPPMGYGPRMQQPSNQGQFLPQTQFPSQGMNVTNIPLAP SSGQAPVSQLSQPAVSIEGQVSNPPSTSSTEVNSQAIAEKQPSQEVKMEAKMEVDQPEPA DTQPEDISESKVEDCKMESTETEERSTELKTEIKEEEDQPSTSATQSSPAPGQSKKKIFK PEELRQALMPTLEALYRQDPESLPFRQPVDPQLLGIPDYFDIVKSPMDLSTIKRKLDTGQ YQEPWQYVDDIWLMFNNAWLYNRKTSRVYKYCSKLSEVFEQEIDPVMQSLGYCCGRKLVL KASVLFNKWFLLQLEFSPQTLCCYGKQLCTIPRDATYYSYQNRYHFCEKCFNEIQGESVS LGDDPSQPQTTINKEQFSKRKNDTLDPELFVECTECGRKMHQICVLHHEIIWPAG >gi568815576f:40964052_41164633|GENSCAN_predicted_CDS_5|3225_bp atggccgagaatgtggtggaaccggggccgccttcagccaagcggcctaaactctcatct ccggccctctcggcgtccgccagcgatggcacagattttggctctctatttgacttggag cacgacttaccagatgaattaatcaactctacagaattgggactaaccaatggtggtgat attaatcagcttcagacaagtcttggcatggtacaagatgcagcttctaaacataaacag ctgtcagaattgctgcgatctggtagttcccctaacctcaatatgggagttggtggccca ggtcaagtcatggccagccaggcccaacagagcagtcctggattaggtttgataaatagc atggtcaaaagcccaatgacacaggcaggcttgacttctcccaacatggggatgggcact agtggaccaaatcagggtcctacgcagtcaacaggtatgatgaacagtccagtaaatcag cctgccatgggaatgaacacagggatgaatgcgggcatgaatcctggaatgttggctgca ggcaatggacaagggataatgcctaatcaagtcatgaacggttcaattggagcaggccga gggcgacagaatatgcagtacccaaacccaggcatgggaagtgctggcaacttactgact gagcctcttcagcagggctctccccagatgggaggacaaacaggattgagaggcccccag cctcttaagatgggaatgatgaacaaccccaatccttatggttcaccatatactcagaat cctggacagcagattggagccagtggccttggtctccagattcagacaaaaactgtacta tcaaataacttatctccatttgctatggacaaaaaggcagttcctggtggaggaatgccc aacatgggtcaacagccagccccgcaggtccagcagccaggcctggtgactccagttgcc caagggatgggttctggagcacatacagctgatccagagaagcgcaagctcatccagcag cagcttgttctccttttgcatgctcacaagtgccagcgccgggaacaggccaatggggaa gtgaggcagtgcaaccttccccactgtcgcacaatgaagaatgtcctaaaccacatgaca cactgccagtcaggcaagtcttgccaagcaattttgactggagcacccgttggacttgga aatcctagctctctaggggtgggtcaacagtctgcccccaacctaagcactgttagtcag attgatcccagctccatagaaagagcctatgcagctcttggactaccctatcaagtaaat cagatgccgacacaaccccaggtgcaagcaaagaaccagcagaatcagcagcctgggcag tctccccaaggcatgcggcccatgagcaacatgagtgctagtcctatgggagtaaatgga ggtgtaggagttcaaacgccgagtcttctttctgactcaatgttgcattcagccataaat tctcaaaacccaatgatgagtgaaaatgccagtgtgccctccctgggtcctatgccaaca gcagctcaaccatccactactggaattcggaaacagtggcacgaagatattactcaggat cttcgaaatcatcttgttcacaaactcgtccaagccatatttcctacgccggatcctgct gctttaaaagacagacggatggaaaacctagttgcatatgctcggaaagttgaaggggac atgtatgaatctgcaaacaatcgagcggaatactaccaccttctagctgagaaaatctat aagatccagaaagaactagaagaaaaacgaaggaccagactacagaagcagaacatgcta ccaaatgctgcaggcatggttccagtttccatgaatccagggcctaacatgggacagccg caaccaggaatgacttctagtttgaatcaatttggccagatgagcatggcccagccccct attgtaccccggcaaacccctcctcttcagcaccatggacagttggctcaacctggagct ctcaacccgcctatgggctatgggcctcgtatgcaacagccttccaaccagggccagttc cttcctcagactcagttcccatcacagggaatgaatgtaacaaatatccctttggctccg tccagcggtcaagctccagtgtctcaactttcccagccagctgtaagcattgaaggacag gtatcaaatcctccatctactagtagcacagaagtgaattctcaggccattgctgagaag cagccttcccaggaagtgaagatggaggccaaaatggaagtggatcaaccagaaccagca gatactcagccggaggatatttcagagtctaaagtggaagactgtaaaatggaatctacc gaaacagaagagagaagcactgagttaaaaactgaaataaaagaggaggaagaccagcca agtacttcagctacccagtcatctccggctccaggacagtcaaagaaaaagattttcaaa ccagaagaactacgacaggcactgatgccaactttggaggcactttaccgtcaggatcca gaatcccttccctttcgtcaacctgtggaccctcagcttttaggaatccctgattacttt gatattgtgaagagccccatggatctttctaccattaagaggaagttagacactggacag tatcaggagccctggcagtatgtcgatgatatttggcttatgttcaataatgcctggtta tataaccggaaaacatcacgggtatacaaatactgctccaagctctctgaggtctttgaa caagaaattgacccagtgatgcaaagccttggatactgttgtggcagaaagcttgtcctt aaggcctctgtgctttttaacaaatggtttcttttgcagttggagttctctccacagaca ctgtgttgctacggcaaacagttgtgcacaatacctcgtgatgccacttattacagttac cagaacaggtatcatttctgtgagaagtgtttcaatgagatccaaggggagagcgtttct ttgggggatgacccttcccagcctcaaactacaataaataaagaacaattttccaagaga aaaaatgacacactggatcctgaactgtttgttgaatgtacagagtgcggaagaaagatg catcagatctgtgtccttcaccatgagatcatctggcctgctggn