GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:51:16 Sequence gi568815583r:50178635_50409798 : 231164 bp : 41.83% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3794 4271 478 1 1 83 99 705 0.842 64.72 1.02 Intr + 18866 19075 210 0 0 13 98 149 0.895 6.36 1.03 Intr + 23853 24011 159 1 0 67 107 129 0.746 11.74 1.04 Intr + 26605 26729 125 2 2 113 92 109 0.996 13.18 1.05 Intr + 29404 29573 170 0 2 74 60 152 0.991 8.82 1.06 Intr + 30608 30773 166 2 1 72 86 17 0.641 -1.06 1.07 Intr + 31699 31824 126 0 0 51 75 117 0.935 6.66 1.08 Intr + 44331 44525 195 2 0 96 106 177 0.999 18.99 1.09 Intr + 47354 47444 91 1 1 39 87 144 0.999 8.05 1.10 Intr + 48346 48544 199 2 1 95 28 187 0.753 10.89 1.11 Intr + 50311 50408 98 1 2 130 76 105 0.750 12.23 1.12 Intr + 55234 55364 131 2 2 99 94 95 0.975 10.69 1.13 Term + 57286 57462 177 0 0 60 42 194 0.939 8.70 1.14 PlyA + 59485 59490 6 1.05 2.14 PlyA - 62965 62960 6 1.05 2.13 Term - 64372 63626 747 1 0 93 42 622 0.995 50.56 2.12 Intr - 64610 64509 102 2 0 79 85 116 0.948 9.85 2.11 Intr - 69709 69611 99 1 0 101 50 74 0.837 4.29 2.10 Intr - 73886 73796 91 1 1 65 101 96 0.632 7.68 2.09 Intr - 74140 73978 163 2 1 99 105 209 0.981 21.81 2.08 Intr - 75032 74966 67 2 1 127 84 42 0.974 5.26 2.07 Intr - 75639 75496 144 0 0 98 78 145 0.998 14.06 2.06 Intr - 76030 75896 135 1 0 104 109 193 0.988 22.84 2.05 Intr - 76994 76915 80 0 2 95 80 -47 0.876 -6.45 2.04 Intr - 79067 78791 277 2 1 72 82 258 0.947 19.77 2.03 Intr - 79883 79770 114 2 0 104 92 6 0.512 2.32 2.02 Intr - 84773 84601 173 0 2 105 64 230 0.716 21.04 2.01 Init - 86986 86899 88 1 1 78 68 107 0.543 8.55 2.00 Prom - 92812 92773 40 -4.05 3.11 PlyA - 96352 96347 6 1.05 3.10 Term - 100150 99998 153 1 0 54 32 253 0.998 13.24 3.09 Intr - 107549 107434 116 0 2 64 72 176 0.973 12.85 3.08 Intr - 111034 110849 186 2 0 100 96 243 0.965 25.04 3.07 Intr - 122268 122155 114 1 0 102 66 106 0.958 9.30 3.06 Intr - 122734 122587 148 1 1 87 72 103 0.963 7.49 3.05 Intr - 124489 124295 195 1 0 34 93 185 0.999 12.29 3.04 Intr - 125499 125332 168 0 0 65 95 165 0.998 14.12 3.03 Intr - 131164 131057 108 1 0 55 38 155 0.237 6.76 3.02 Intr - 167761 167738 24 1 0 123 113 -9 0.120 2.40 3.01 Init - 176097 175957 141 0 0 102 36 187 0.165 13.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:50178635_50409798|GENSCAN_predicted_peptide_1|774_aa MLSAIYTVLAGLLFLPLLVNLCCPYFFQDIGYFLKVAAVGRRVRSYGKRRPARTILRAFL EKARQTPHKPFLLFRDETLTYAQVDRRSNQVARALHDHLGLRQGDCVALLMGNEPAYVWL WLGLVKLGCAMACLNYNIRAKSLLHCFQCCGAKVLLVSPELQAAVEEILPSLKKDDVSIY YVSRTSNTDGIDSFLDKVDEVSTEPIPESWRSEVTFSTPALYIYTSGTTGLPKAAMITHQ RIWYGTGLTFVSGLKADDVIYITLPFYHSAALLIGIHGCIVAGATLALRTKFSASQFWDD CRKYNVTVIQYIGELLRYLCNSPQSCLEYAVAGGTNPTSDPQALDAREATSALAEQLNQV TPIHRHIWPSSSGESGASVGSLKFLSLHQLHSVQKREWMDLLQHRVESGQYDWGPRQREA KRKREGEQNQEGSPGKKWERETLISRTHTGEAEGLFVELPDFTWSLVKLESQTEQNTKKP NDRDHKVRLALGNGLRGDVWRQFVKRFGDICIYEFYAATEGNIGFMNYARKVGAVGRVNY LQKKIITYDLIKYDVEKDEPVRDENGYCVRVPKGEVGLLVCKITQLTPFNGYAGAKAQTE KKKLRDVFKKGDLYFNSGDLLMVDHENFIYFHDRVGDTFRWKGENVATTEVADTVGLVDF VQEVNVYGVHVPDHEGRIGMASIKMKENHEFDGKKLFQHIADYLPSYARPRFLRIQDTIE ITGTFKHRKMTLVEEGFNPAVIKDALYFLDDTAKMYVPMTEDIYNAISAKTLKL >gi568815583r:50178635_50409798|GENSCAN_predicted_CDS_1|2325_bp atgctttccgccatctacacagtcctggcgggactgctgttcctgccgctcctggtgaac ctctgctgcccatacttcttccaggacataggctacttcttgaaggtggccgccgtgggc cggagggtgcgcagctacgggaagcggcggccggcgcgcaccatcctgcgggcgttcctg gagaaagcgcgccagacgccacacaagccttttctgctcttccgcgacgagactctcacc tacgcgcaggtggaccggcgcagcaatcaagtggcccgggcgctgcacgaccacctcggc ctgcgccagggagactgcgtggcgctccttatgggtaacgagccggcctacgtgtggctg tggctggggctggtgaagctgggctgtgccatggcgtgcctcaattacaacatccgcgcg aagtccctgctgcactgcttccagtgctgcggggcgaaggtgctgctggtgtcgccagaa ctacaagcagctgtcgaagagatactgccaagccttaaaaaagatgatgtgtccatctat tatgtgagcagaacttctaacacagatgggattgactctttcctggacaaagtggatgaa gtatcaactgaacctatcccagagtcatggaggtctgaagtcactttttccactcctgcc ttatacatttatacttctggaaccacaggtcttccaaaagcagccatgatcactcatcag cgcatatggtatggaactggcctcacttttgtaagcggattgaaggcagatgatgtcatc tatatcactctgcccttttaccacagtgctgcactactgattggcattcacggatgtatt gtggctggtgctactcttgccttgcggactaaattttcagccagccagttttgggatgac tgcagaaaatacaacgtcactgtcattcagtatatcggtgaactgcttcggtatttatgc aactcaccacagagctgcctagaatacgctgttgctggagggactaaccccaccagcgac ccccaagcactggatgccagggaggcaacctctgccttagcagaacagctaaaccaagtg acacccatccacagacatatatggcccagctcatcaggagaatctggggccagtgttggg agcctaaaatttctgtctctgcatcagcttcattctgttcaaaaaagagaatggatggat ctgttgcagcacagagtggagagtggccaatatgactggggcccaagacagagggaggcc aaaagaaagagggaaggggagcagaatcaggaaggttcgccagggaagaagtgggaaagg gagaccctcatctcccgaacacatactggagaagctgaaggtctgtttgtagaacttccc gactttacctggagcctcgtcaagttagagagccaaaccgagcaaaatacaaagaaacca aatgaccgtgatcataaagtgagactggcactgggaaatggcttacgaggagatgtgtgg agacaatttgtcaagagatttggggacatatgcatctatgagttctatgctgccactgaa ggcaatattggatttatgaattatgcgagaaaagttggtgctgttggaagagtaaactac ctacagaaaaaaatcataacttatgacctgattaaatatgatgtggagaaagatgaacct gtccgtgatgaaaatggatattgcgtcagagttcccaaaggtgaagttggacttctggtt tgcaaaatcacacaacttacaccatttaatggctatgctggagcaaaggctcagacagag aagaaaaaactgagagatgtctttaagaaaggagacctctatttcaacagtggagatctc ttaatggttgaccatgaaaatttcatctatttccacgacagagttggagatacattccgg tggaaaggggaaaatgtggccaccactgaagttgctgatacagttggactggttgatttt gtccaagaagtaaatgtttatggagtgcatgtgccagatcatgagggtcgcattggcatg gcctccatcaaaatgaaagaaaaccatgaatttgatggaaagaaactctttcagcacatt gctgattacctacctagttatgcaaggccccggtttctaagaatacaggacaccattgag atcactggaacttttaaacaccgcaaaatgaccctggtggaggagggctttaaccctgct gtcatcaaagatgccttgtatttcttggatgacacagcaaaaatgtatgtgcctatgact gaggacatctataatgccataagtgctaaaaccctgaaactctga >gi568815583r:50178635_50409798|GENSCAN_predicted_peptide_2|759_aa MEPEEYRERGEQRASLALFPAATPAGTAAGREMVDYICQYLSTVRERRVTPDVQPGYLRA QLPESAPEDPDSWDSIFGDIERIIMPGVVHWQSPHMHAYYPALTSWPSLLGDMLADAINC LGFTWRVDQERGTWPNMTPSPEMFPLTPSGADMFLELVGAPQCPGAVQFEVPFPFSGIQP CVYRAGDERHGLVGKNAGTSRALLAPPPQQPGRRRPAETGSHYVVQAGLKLLDSRDPPKV LGLQSTVSESTLIALLAARKNKILEMKTSEPDADESCLNARLVAYASDQAHSSVEKAGLI SLVKMKFLPVDDNFSLRGEALQKAIEEDKQRGLVPVFVCATLGTTGVCAFDCLSELGPIC AREGLWLHIDAAYAGTAFLCPEFRGFLKGIEYADSFTFNPSKWMMVHFDCTGFWVKDKYK LQQTFSVNPIYLRHANSGVATDFMHWQIPLSRRFRSVKLWFVIRSFGVKNLQAHVRHGTE MAKYFESLVRNDPSFEIPAKRHLGLVVFRLKGPNCLTENVLKEIAKAGRLFLIPATIQDK LIIRFTVTSQFTTRDDILRDWNLIRDAATLILSQHCTSQPSPRVGNLISQIRGARAWACG TSLQSVSGAGDDPVQARKIIKQPQRVGAGPMKRENGLHLETLLDPVDDCFSEEAPDATKH KLSSFLFSYLSVQTKKKTVRSLSCNSVPVSAQKPLPTEASVKNGGSSRVRIFSRFPEDMM MLKKSAFKKLIKFYSVPSFPECSSQCGLQLPCCPLQAMV >gi568815583r:50178635_50409798|GENSCAN_predicted_CDS_2|2280_bp atggagcctgaggagtacagagagagaggtgagcaacgggcatccctggccctcttccct gctgctactcctgcaggaacggctgctgggagagagatggtggattacatctgccagtac ctgagcactgtgcgggagagacgtgtgacgccagacgtgcagcctggctacctgcgagcc cagctgcctgagagtgctcctgaggaccccgacagctgggacagcatctttggggacatt gaacgaatcatcatgcctggggtggtacattggcagagcccccatatgcacgcctactac ccagccctcacctcttggccctccctgctaggagacatgctggctgatgccatcaactgc ttgggattcacctggagagtggaccaggagagaggcacgtggcctaacatgaccccttct ccagaaatgttcccattgacaccatcaggggcagacatgtttctggagctcgtgggggca cctcagtgccctggggctgtgcagtttgaagttccttttcctttctcaggcatccagccc tgcgtgtacagagctggagatgaacgtcatggactggttggcaaaaatgctgggacttcc agagcacttcttgcaccaccaccccagcagccagggcggaggcgtcctgcagagacaggg tctcactatgttgtccaggctggtctcaaactcctggactcaagggatcctcccaaagtg ctgggattacagagcacggtcagtgaatccactttgattgccctgctggcagcaaggaag aacaaaatcctggaaatgaaaacgtctgagcccgatgctgatgagtcctgcctaaatgcc cgactcgtggcctatgcctctgaccaggctcactcctctgtggaaaaggctggtttgatt tcccttgtgaagatgaaatttctgcctgtggatgacaacttctcactccgaggggaagct cttcagaaggccatcgaggaagacaagcagcggggcttggtgcccgtctttgtctgtgca acactagggaccactggggtctgtgcatttgactgcctgtcagagctgggccccatctgt gcccgtgaggggctgtggctccacatcgatgctgcttatgcaggcactgccttcctgtgc cccgagttccgggggtttctgaaggggattgagtatgccgactccttcacctttaatcct tccaagtggatgatggtgcattttgactgtactgggttctgggtcaaggacaagtacaag ctgcagcagaccttcagtgtgaatcccatctacctcaggcatgccaactcaggcgtggcc accgacttcatgcactggcagatccccctgagccgacggtttcgctctgttaaactctgg ttcgtgattcggtccttcggggtgaagaatcttcaagcacatgtcagacatggtactgaa atggctaaatattttgaatctctggtcagaaacgacccttcctttgaaattcctgccaag aggcaccttggcctggtggtttttcgtctaaagggtcctaattgtctcacagaaaatgtg ttaaaggaaatagctaaagctggccgtctcttcctcatcccggccactatccaggacaag ttaatcatccgtttcactgtgacatcccagtttaccactagggatgacatcctgagagac tggaatctcattcgagatgctgccactctcatcctgagtcagcactgtacttcccaaccc agccctcgggttgggaacctcatctcccaaatcaggggtgccagagcctgggcctgtgga acgtcccttcagtctgtcagtggggcaggagatgatccagtccaggccaggaagatcatc aagcagcctcagcgtgtgggagccggtcccatgaaaagggaaaatggcctccatcttgaa accctgctggacccagttgatgactgcttttcagaagaggccccagatgccaccaagcac aagctgtcctccttcctgttcagttacttgtctgtgcagactaagaagaagacggtgcgc tccctcagttgcaacagtgtgccagtgagtgctcagaagccactgcccacagaggcctct gtgaagaatgggggctcctccagggtcagaatcttttccaggtttccagaagacatgatg atgctgaagaaaagtgccttcaaaaaactcatcaaattctacagcgtccccagctttcct gaatgcagctctcaatgtggactccagctgccctgttgccctctgcaggccatggtttag >gi568815583r:50178635_50409798|GENSCAN_predicted_peptide_3|450_aa MVAAAAPSSALACRGRLAATRLAGPERQPRGGDGVAPRQRRFCGRLEVTAYLWRKMSLVD LGKKLLEAARAGQDDEVRILMANGAPFTTDWLGTSPLHLAAQYGHYSTTEVLLRAGVSRD ARTKVDRTPLHMAASEGHASIVEVLLKHGADVNAKDMLKMTALHWATEHNHQEVVELLIK YGADVHTQSKFCKTAFDISIDNGNEDLAEILQIAMQNQINTNPESPDTVTIHAATPQFII GPGGVVNLTGLVSSENSSKATDETGVSAVQFGNSSTSVLATLAALAEASAPLSNSSETPV VATEEVVTAESVDGAIQQVVSSGGQQVITIVTDGIQLGNLHSIPTSGIGQPIIVTMPDGQ QVLTVPATDIAEETVISEEPPAKRQCIEIIENRVESAEIEEREALQKQLDEANREAQKYR QQLLKKEQEAEAYRQKLEAMTRLQTNKEAV >gi568815583r:50178635_50409798|GENSCAN_predicted_CDS_3|1353_bp atggtggcggccgccgccccctcctcggccctagcatgccgcggccgcctcgcggctacc cggcttgccggtcccgagcggcagccccggggtggcgatggggtcgcgccgaggcagcgg aggttctgcgggcgactggaggtcacagcatatttgtggaggaagatgtccctggtagat ttgggaaagaagcttttagaagcggcacgagcaggtcaagatgatgaagttcgtattttg atggcaaatggagctccctttactacagactggctgggaacttctccacttcatctagca gcacagtatggtcattattccaccacagaggtactgctgcgagctggtgtgagcagagat gccagaaccaaagtggaccgaacaccattacatatggcagcttctgagggccatgccagc atagtagaggttttacttaagcatggtgctgatgtcaatgcaaaggacatgttaaagatg acagctctccattgggccacagaacacaatcatcaagaggtggtggaacttttaatcaaa tatggtgctgatgtacacacgcaaagtaaattttgtaaaactgcatttgatatttcaata gacaatggaaatgaagatttagcagagatattacagattgctatgcagaaccaaatcaac acaaacccagagagtcctgacactgtgacaatacatgctgcaacaccacagtttatcatt ggacctggaggggtggtgaacctaacaggtctggtatcttcagaaaattcatccaaggca acagatgaaacgggtgtatctgctgttcagtttggaaactcttctacatcagtattagct acattagctgccttagctgaagcatctgctccattgtccaattcttcagaaactccagta gtggccacagaagaagtagttactgcagaatctgtggatggtgccattcagcaagtagtt agttcagggggtcagcaagtcatcacaatagttacagatggaattcagcttggaaatttg cactctattccaaccagtggaattggtcagcccatcattgtgaccatgccagatggacaa caagtattaacagtaccagcaacagacattgctgaagaaactgttataagtgaagaacca ccagctaagagacaatgtatcgaaataattgaaaaccgggtggaatctgcagaaatagaa gagagagaagctcttcagaaacagctggatgaagcaaatcgagaagcacaaaaatatcga cagcagctcctaaagaaagaacaggaagcagaggcctacagacagaagttggaagctatg actcgtcttcagactaataaagaagctgtttaa