GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:00:01 Sequence gi568815576r:38194109_38414157 : 220049 bp : 50.39% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4565 4622 58 1 1 104 7 70 0.049 1.97 1.02 Intr + 11506 11656 151 2 1 84 53 68 0.393 2.12 1.03 Intr + 14344 14521 178 1 1 66 22 93 0.270 0.42 1.04 Intr + 17547 17824 278 2 2 80 100 63 0.592 3.01 1.05 Intr + 19715 19781 67 2 1 115 100 30 0.909 5.81 1.06 Term + 20312 20770 459 1 0 104 39 992 0.998 90.99 1.07 PlyA + 21925 21930 6 1.05 2.09 PlyA - 22309 22304 6 1.05 2.08 Term - 27602 27361 242 0 2 120 43 353 0.998 30.19 2.07 Intr - 30871 30677 195 2 0 54 86 418 0.847 37.59 2.06 Intr - 31485 31316 170 2 2 74 105 339 0.991 33.79 2.05 Intr - 32762 32671 92 2 2 128 54 169 0.999 16.29 2.04 Intr - 36636 36561 76 2 1 124 89 25 0.020 5.72 2.03 Intr - 37226 37136 91 0 1 74 61 84 0.013 3.35 2.02 Intr - 52028 51827 202 2 1 76 72 532 0.754 49.26 2.01 Init - 53853 53662 192 0 0 100 80 311 0.991 28.47 2.00 Prom - 61802 61763 40 -7.16 3.04 PlyA - 62546 62541 6 1.05 3.03 Term - 64735 64658 78 1 0 101 38 36 0.355 -2.34 3.02 Intr - 69242 69089 154 1 1 47 80 123 0.621 7.47 3.01 Init - 70352 70108 245 2 2 51 89 185 0.993 10.11 3.00 Prom - 74964 74925 40 -3.26 4.00 Prom + 75018 75057 40 -8.76 4.01 Init + 76484 76493 10 1 1 94 109 13 0.323 3.37 4.02 Intr + 77952 78118 167 1 2 100 107 -18 0.344 0.98 4.03 Term + 79694 79882 189 1 0 76 40 125 0.298 3.95 4.04 PlyA + 80563 80568 6 1.05 5.11 PlyA - 82665 82660 6 1.05 5.10 Term - 90182 90055 128 0 2 108 43 69 0.393 2.84 5.09 Intr - 96547 96466 82 1 1 97 64 69 0.766 4.61 5.08 Intr - 100140 100001 140 1 2 97 99 -35 0.548 -1.32 5.07 Intr - 100426 100234 193 1 1 113 93 255 0.994 27.57 5.06 Intr - 104826 104237 590 1 2 41 -7 527 0.512 30.84 5.05 Intr - 105957 105787 171 1 0 97 76 357 0.999 35.31 5.04 Intr - 106844 106616 229 2 1 108 73 546 0.999 52.54 5.03 Intr - 108901 108753 149 2 2 52 101 283 0.995 25.95 5.02 Intr - 109140 109030 111 1 0 103 89 204 0.966 22.35 5.01 Init - 120049 119974 76 1 1 91 106 112 0.913 14.55 5.00 Prom - 127537 127498 40 -7.16 6.11 PlyA - 128869 128864 6 1.05 6.10 Term - 140826 140728 99 0 0 52 49 103 0.923 0.93 6.09 Intr - 142093 141994 100 0 1 29 61 115 0.623 3.01 6.08 Intr - 142598 142500 99 1 0 100 58 83 0.643 5.73 6.07 Intr - 152592 152421 172 1 1 41 95 104 0.291 5.40 6.06 Intr - 153432 153381 52 0 1 67 96 23 0.158 -0.52 6.05 Intr - 156891 156695 197 1 2 34 35 283 0.801 16.73 6.04 Intr - 157713 157661 53 2 2 79 75 28 0.160 -0.85 6.03 Intr - 167452 167291 162 0 0 53 110 96 0.853 7.39 6.02 Intr - 171046 171019 28 2 1 46 89 16 0.042 -5.33 6.01 Init - 184034 183968 67 2 1 46 98 82 0.668 6.23 6.00 Prom - 186527 186488 40 -3.76 7.00 Prom + 188539 188578 40 -4.06 7.01 Init + 190222 190572 351 0 0 79 86 95 0.758 5.56 7.02 Term + 190628 191323 696 1 0 75 53 261 0.987 14.85 7.03 PlyA + 192141 192146 6 1.05 8.03 PlyA - 192706 192701 6 1.05 8.02 Term - 197167 197157 11 2 2 97 54 3 0.098 -3.84 8.01 Init - 203308 203200 109 1 1 76 111 95 0.751 8.98 8.00 Prom - 213340 213301 40 -2.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 502 437 66 1 0 113 44 85 0.920 4.54 S.002 Intr - 769 582 188 2 2 104 34 110 0.810 6.61 S.003 Init - 34349 34215 135 2 0 42 99 124 0.929 7.05 S.004 Term - 50972 50764 209 0 2 55 43 109 0.833 0.60 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:38194109_38414157|GENSCAN_predicted_peptide_1|396_aa MALDLDQKGSYVYGKQSLQEDVMNLQTLEVSSAGFQGKKPGRGGGGRHRRQRTPGFQSAT ATKQWLGCLSICTVSANRPKEAAGEEEELETGPGGTELRFGLEPGLVTSWVMASETLTIH PSPVPAPCQGESEAQKEWLTYQNHTTRERDHPQKAPGVSGTKVHVLSISAPCPWETPRSC QNVSEAEPTREVNPVDSSRNQDLDLAFALCCPQNWVMLFPIRSAAQRAPSANMSVDPLSS KALKIKRELSENTPHLSDEALMGLSVRELNRHLRGLSAEEVTRLKQRRRTLKNRGYAASC RVKRVCQKEELQKQKSELEREVDKLARENAAMRLELDALRGKCEALQGFARSVAAARGPA TLVAPASVITIVKSTPGSGSGPAHGPDPAHGPASCS >gi568815576r:38194109_38414157|GENSCAN_predicted_CDS_1|1191_bp atggcccttgatcttgaccagaaagggagctatgtctatgggaagcagtccctgcaggaa gacgtcatgaacctgcagaccctggaggtcagctctgctggcttccaagggaagaagcca gggagaggtggtggtggccgacatagaagacagaggacccctgggttccagtccgccacg gctaccaaacagtggcttggttgcctcagcatctgtactgtctctgccaacaggcccaag gaagcagctggagaggaggaagaattagagacagggccaggaggtactgaactacgtttc ggcttggaacccggcttggtgacaagctgggtgatggcgtcagagactctgaccatccac ccttccccagtgcctgctccctgccagggggaaagtgaggcacagaaagagtggctgacc tatcagaaccacacaactagagagcgagaccacccccaaaaggccccaggtgtgtctggc accaaggtccatgtgctttccatttctgcaccttgtccctgggagacaccccggtcctgc cagaatgtgtcagaggctgagccaacccgtgaggtcaaccctgtagacagctcaaggaac caggacttggatctggcttttgccctctgctgtccacaaaactgggtgatgctctttccg atacggtctgcagcccagagggcaccttctgcaaacatgtctgtggatcccctatccagc aaagctctaaagatcaagcgagagctgagcgagaacacgccgcacctgtcggacgaggcg ctgatggggctgtcggtgcgcgagctgaaccggcatctgcgcgggctctccgccgaggag gtgacacggctcaagcagcggcgccgcacactcaaaaaccgtggctacgccgccagctgc cgcgtgaagcgcgtgtgccagaaggaggagctgcagaagcagaagtcggagctggagcgc gaggtggacaagctggcgcgcgagaacgccgccatgcgcctggagctcgacgcgctgcgc ggcaagtgcgaggcgctgcagggcttcgcgcgctccgtggccgccgcccgcgggcccgcc acgctcgtggcgccggccagcgtcatcaccatcgtcaagtccaccccgggctcggggtct ggccccgcccacggcccggaccccgcccacggcccggcctcctgctcctag >gi568815576r:38194109_38414157|GENSCAN_predicted_peptide_2|419_aa MTVRGDVLAPDPASPTTAAASPSVSVIPEGSPTAMEQPVFLMTTAAQAISGFFVWTALLI TCHQEGPQLTPHLCPQIYMHLRCYSCPNEQRYIVRILFIVPIYAFDSWLSLLFFTNDQYY VYFGTVRDCYEALVIYNFLSLCYEYLGGESSIMSEIRGKPIESSCMYGTCCLWGKTYSIG FLRFCKQATLQFCVVKPLMAVSTVVLQAFGKYRDGDFDVTSGYLYVTIIYNISVSLALYA LFLFYFATRELLSPYSPVLKFFMVKSVIFLSFWQGMLLAILEKCGAIPKIHSARVSVGEG TVAAGYQDFIICVEMFFAALALRHAFTYKVYADKRLDAQGRCAPMKSISSSLKETMNPHD IVQDAIHNFSPAYQQYTQQSTLEPGPTWRGGAHGLSRSHSLSGARDNEKTLLLSSDDEF >gi568815576r:38194109_38414157|GENSCAN_predicted_CDS_2|1260_bp atgacagtgaggggggatgtgctggccccggatccagcgtcgcccacgaccgcagcagcc tcgcccagcgtctccgtgatccccgagggcagccccactgccatggagcagcctgtgttc ctgatgacaactgccgctcaggccatctctggcttcttcgtgtggacggccctgctcatc acatgccaccaggagggtccccagctgaccccacacctctgcccccagatctacatgcac ctgcgctgctacagctgccccaacgagcagcgctacatcgtgcgcatcctcttcatcgtg cccatctacgcctttgactcctggctcagcctcctcttcttcaccaacgaccagtactac gtgtacttcggcaccgtccgcgactgctatgaggccttggtcatctataatttcctgagc ctgtgctatgagtacctaggaggagaaagttccatcatgtcggagatcagaggaaaaccc attgagtccagctgtatgtatggcacctgctgcctctggggaaagacttattccatcgga tttctgaggttctgcaaacaggccaccctgcagttctgtgtggtgaagccactcatggcg gtcagcactgtggtcctccaggccttcggcaagtaccgggatggggactttgacgtcacc agtggctacctctacgtgaccatcatctacaacatctccgtcagcctggccctctacgcc ctcttcctcttctacttcgccacccgggagctgctcagcccctacagccccgtcctcaag ttcttcatggtcaagtccgtcatctttctttccttctggcaaggcatgctcctggccatc ctggagaagtgtggggccatccccaaaatccactcggcccgcgtgtcggtgggcgagggc accgtggctgccggctaccaggacttcatcatctgtgtggagatgttctttgcagccctg gccctgcggcacgccttcacctacaaggtctatgctgacaagaggctggacgcacaaggc cgctgtgcccccatgaagagcatctccagcagcctcaaggagaccatgaacccgcacgac atcgtgcaggacgccatccacaacttctcacctgcctaccagcagtacacgcagcagtcc accctggagcctgggcccacctggcgtggtggcgcccacggcctctcccgctcccacagc ctcagtggcgcccgcgacaacgagaagactctcctgctcagctctgatgatgaattctag >gi568815576r:38194109_38414157|GENSCAN_predicted_peptide_3|158_aa MARLEALGPLGLCTPTPSHLACLLIFMGAWGDQLWLLVIVAQVTVHTSWDQDIRALRRVK GPSGPSLLLTAGKCKRVDQSTKGSLGRHRKSVIVMPLEGVLVMGDCLFAGDPMYLLIAEA GECTPFQEIKMGQNSERSEIPKVLLSLQQTNFCCKNLT >gi568815576r:38194109_38414157|GENSCAN_predicted_CDS_3|477_bp atggctagactggaagcacttggccctcttgggctctgcacccccaccccctcccacctg gcctgcctgctcatcttcatgggcgcctggggagaccaattatggctgcttgtcatagtg gctcaggtcaccgttcacacttcctgggaccaggacatcagagccctgagaagggtcaag gggccaagtgggcctagccttttactgacagctgggaaatgcaagcgtgtggaccagagc accaaggggtctctgggcagacaccgcaagtcggtgattgtgatgcctctggagggggtg ctggtcatgggagattgcttgtttgctggagatcccatgtaccttttgattgctgaagca ggtgaatgtacgcctttccaagaaattaaaatgggccagaattcagaacgctcagaaatc ccaaaggttcttctaagtctgcagcagactaatttctgctgcaaaaatctgacctga >gi568815576r:38194109_38414157|GENSCAN_predicted_peptide_4|121_aa MVFGIRHANTCFAPKCPGKEGGVKRLALYNTSRGDMYWESMPNPQHNCLALAWSHPALES PGQNSESAVNECPPSARYPIRQAGDPVGSTCKIDLDSDVCHSSSSATTLVQYLHRIIAEF C >gi568815576r:38194109_38414157|GENSCAN_predicted_CDS_4|366_bp atggtcttcggtatcaggcatgcaaatacctgctttgccccaaaatgccctggaaaggag ggaggagtaaaacgtctggccttatacaacacctccagaggggatatgtattgggaatcc atgcccaacccccagcataactgccttgccttggcctggagccacccagcactggagtcc ccaggccagaactcggagtcagctgtgaatgaatgccccccttctgcccgctacccaata aggcaggcaggagatcctgttggctctacctgcaaaatagatctagactcagatgtctgc cactcctcctccagtgctaccaccctggtccagtatttgcacaggattattgcagagttc tgctaa >gi568815576r:38194109_38414157|GENSCAN_predicted_peptide_5|622_aa MELRVGNKYRLGRKIGSGSFGDIYLGANIASGEEVAIKLECVKTKHPQLHIESKFYKMMQ GGVGIPSIKWCGAEGDYNVMVMELLGPSLEDLFNFCSRKFSLKTVLLLADQMISRIEYIH SKNFIHRDVKPDNFLMGLGKKGNLVYIIDFGLAKKYRDARTHQHIPYRENKNLTGTARYA SINTHLGIEQSRRDDLESLGYVLMYFNLGSLPWQGLKAATKRQKYERISEKKMSTPIEVL CKGYPSEFSTYLNFCRSLRFDDKPDYSYLRQLFRNLFHRQGFSYDYVFDWNMLKFVSHLE AKAGLGDWMGKALTQRARVSWWREGLPTDGRYELDSVVDLTGVPEPSSLIRADSLCHHGQ QGPGRALSPVGTRRIWLHLGSLLGARSRAWVVLAAPPAALIAAYTVLLLLFCTSQVATCT MAPRASLDPVMVLAFFLVMGPWGAARNPEDVDRERREHEREERMGQLRGSATRALPPGPP TGATANRLRSAAEPVASTPASRIQPAGNTSPRAISRVDRERKVSMRLHRGAPANVSSSDL TGRQEVSRIPASQGLSLQTRKWVLLGAAGVSEEQALALKAETDVETVMAKNAQSPWPLSQ SLQHKRPQEHNWSLEKSTHSKL >gi568815576r:38194109_38414157|GENSCAN_predicted_CDS_5|1869_bp atggagctacgtgtggggaacaagtaccgcctgggacggaagatcgggagcgggtccttc ggagatatctacctgggtgccaacatcgcctctggtgaggaagtcgccatcaagctggag tgtgtgaagacaaagcacccccagctgcacatcgagagcaagttctacaagatgatgcag ggtggcgtggggatcccgtccatcaagtggtgcggagctgagggcgactacaacgtgatg gtcatggagctgctggggcctagcctcgaggacctgttcaacttctgttcccgcaaattc agcctcaagacggtgctgctcttggccgaccagatgatcagccgcatcgagtatatccac tccaagaacttcatccaccgggacgtcaagcccgacaacttcctcatggggctggggaag aagggcaacctggtctacatcatcgacttcggcctggccaagaagtaccgggacgcccgc acccaccagcacattccctaccgggaaaacaagaacctgaccggcacggcccgctacgct tccatcaacacgcacctgggcattgagcaaagccgtcgagatgacctggagagcctgggc tacgtgctcatgtacttcaacctgggctccctgccctggcaggggctcaaagcagccacc aagcgccagaagtatgaacggatcagcgagaagaagatgtcaacgcccatcgaggtcctc tgcaaaggctatccctccgaattctcaacatacctcaacttctgccgctccctgcggttt gacgacaagcccgactactcttacctacgtcagctcttccgcaacctcttccaccggcag ggcttctcctatgactacgtctttgactggaacatgctgaaattcgtgagtcacctggag gccaaggcgggcttgggggactggatggggaaggccctgactcagagggccagagtgagc tggtggcgggaggggctcccgacagacgggaggtacgagctggacagtgtggttgacctc actggggtcccagagccctccagcctcatccgtgctgacagcttgtgccaccatgggcag cagggcccaggcagggcgctgtctcccgtcggaacaaggaggatctggcttcacctgggg tccctcctgggtgcccgcagcagggcatgggtggtgctggcggcgccccctgctgccctg attgctgcctacaccgtcctgctgctcctgttctgcacatcccaggtggctacctgcacc atggcccctcgggccagcctggacccagtcatggtgcttgctttcttcttggtcatgggc ccctggggtgcagcccggaatcccgaggatgtggaccgggagcggcgagaacacgaacgc gaggagaggatggggcagctacgggggtccgcgacccgagccctgccccctggcccaccc acgggggccactgccaaccggctccgcagtgccgccgagcccgtggcttccacgccagcc tcccgcatccagccggctggcaatacttctcccagagcgatctcgcgggtcgaccgggag aggaaggtgagtatgaggctgcacaggggtgcgcccgccaacgtctcctcctcagacctc actgggcggcaagaggtctcccggatcccagcctcacagggcctctccctgcagacccgg aagtgggtcttgctgggagcagcaggagtcagtgaggaacaagccctcgccctgaaggca gaaacagatgtggaaacagttatggctaagaatgctcagtccccatggcctctttcccag agcctacagcacaagcgcccacaggaacacaactggtcattggagaagtccacacattcc aagctctga >gi568815576r:38194109_38414157|GENSCAN_predicted_peptide_6|342_aa MIGEELLVVFEREKAENYKGKQDLVLYGDIVKLFGVFLVLLDVTLVLADLIFTDSKLYIP LEYRSISLAIALFFLMDVLLRVFVEGPVYTIGLPPSDLRAGKEETVLVRERHQQESQRFL LLSIITITIILITITVTVIIIITIMDFKEVQRISSSGQSESCSVEAGGWVELPSSFSAAF LILQSKELRACWGNSPRNTSLAPSVDPEEGAPRRSGPGPNSEGSSLQNQPLELRDSPTAL RTPWCWSGPERSRVGCEEKETGLSEAPHPSEVIPYSSAPQRSPHLQAQVCALRISKAGQQ ASGERQQQLGKGLGPTEGIEGPENGKCWGEKDGDELLGKPPE >gi568815576r:38194109_38414157|GENSCAN_predicted_CDS_6|1029_bp atgattggtgaagagttactagtggtatttgaacgtgagaaggcagaaaattacaagggc aagcaagacttggttctgtatggtgacattgtcaaactatttggagttttcctcgtctta ctggatgtcactctcgtccttgccgacctaattttcactgacagcaaactttatattcct ttggagtatcgttctatttctctagctattgccttattttttctcatggatgttcttctt cgagtatttgtagaagggcccgtctataccattgggctgcccccttcagacctccgggca ggaaaagaagagactgtactggtcagagaaagacatcagcaggagagccagagattcctc ctcctctccatcatcaccatcaccattattctcatcaccatcaccgtcaccgtcatcatc atcatcaccatcatggacttcaaagaagttcaaagaatttcctcctctggccagagtgaa tcctgttctgtggaagcaggaggctgggtagagctgccctcttccttctccgcagctttt ctcatcctgcaaagcaaggagctgagagcctgctggggaaacagtcctagaaacacctcg ctagcacccagtgtggacccagaggaaggggctcctaggagaagtggcccagggcccaac tcggaaggaagcagccttcagaatcagccgctggagctgcgggactcgcccacggccctc aggactccctggtgttggagcgggccggagagaagtagagtgggctgcgaggagaaggaa acaggcctttcggaggctcctcatccctcggaggtcatcccttactccagcgctccccag cggagtcctcacctgcaggcacaggtgtgcgcacttcgcatctcgaaggccggccagcaa gcttcaggtgagcgccagcagcagctggggaaaggacttggaccaactgaaggtattgag ggaccagagaacgggaagtgctggggagagaaggacggggatgagctcttgggaaaaccc ccagaatga >gi568815576r:38194109_38414157|GENSCAN_predicted_peptide_7|348_aa MIVYLENTIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIAS KRIKYLGIQLTRDVKDLFKENYKPLLKEIKKDTNKWKNIPCSWVGRINIVKMAILPKNWK KLKFIWNQKRAHIAKSILSQKNKAGGITLPDFKLYYEATVTKTAWYWYRNRDIDQWNTTE PSEIMPHIYNYLIFDKPEKNKKWGKDSLFNKWCWESWLAICRKLKLDPFLTPYTKINSRW IKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAK ETTIRVNRQTTKWEKIFATYSSDKGLISRICNELKQIYKKKTNNPIKK >gi568815576r:38194109_38414157|GENSCAN_predicted_CDS_7|1047_bp atgattgtatatctagaaaacaccattgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatac accaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggag aactacaaaccactgctcaaggaaataaaaaaggatacaaacaaatggaagaacattcca tgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaagaattggaaa aaactaaagttcatatggaaccaaaaaagagcccacatcgccaagtcaatcctaagccaa aagaacaaagctggaggcatcacactacctgacttcaaactatactacgaggctacagta accaaaacagcatggtactggtaccgaaacagagatatagatcaatggaacacaacagag ccctcagaaataatgccgcatatctacaactatctgatctttgacaaacctgagaaaaac aagaaatggggaaaggattccctatttaataaatggtgctgggaaagctggctagccata tgcagaaagctgaaactggatcccttccttacgccttatacaaaaatcaattcaagatgg attaaagacttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggcatt accattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggca acaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagcaaaa gaaactaccatcagagtgaacaggcaaactacaaaatgggagaaaattttcgcaacctac tcatctgacaaagggctaatatccagaatctgcaatgaactcaaacaaatttacaagaaa aaaacaaacaaccccatcaaaaagtga >gi568815576r:38194109_38414157|GENSCAN_predicted_peptide_8|39_aa MVEGRRALLGAWEAGFRAAWAVPPEGQDPTAGRAPQLLC >gi568815576r:38194109_38414157|GENSCAN_predicted_CDS_8|120_bp atggtggaggggcggagggcgctgctgggggcctgggaggctggatttagggctgcctgg gcggtaccgcccgaggggcaagacccgacagcggggcgcgcgccgcaacttctttgttga