GENSCAN 1.0 Date run: 5-Nov-116 Time: 22:56:51 Sequence gi568815586f:120625886_120839445 : 213560 bp : 46.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 5687 5726 40 -1.56 1.01 Init + 14801 15454 654 1 0 42 110 509 0.429 43.39 1.02 Term + 15636 15956 321 2 0 76 54 102 0.323 0.42 1.03 PlyA + 21270 21275 6 1.05 2.00 Prom + 22618 22657 40 -2.86 2.01 Init + 30235 30387 153 0 0 62 57 242 0.213 17.39 2.02 Intr + 33083 33340 258 1 0 72 81 116 0.800 6.96 2.03 Intr + 33993 34023 31 2 1 93 93 17 0.863 0.40 2.04 Intr + 34311 34454 144 1 0 94 84 337 0.999 34.15 2.05 Intr + 34846 34955 110 2 2 88 82 162 0.999 15.60 2.06 Intr + 35186 35333 148 1 1 103 100 217 0.983 24.21 2.07 Term + 40990 41015 26 2 2 93 54 28 0.082 -1.71 2.08 PlyA + 41417 41422 6 1.05 3.00 Prom + 42042 42081 40 -3.56 3.01 Init + 61412 61823 412 1 1 103 77 581 0.935 53.18 3.02 Intr + 68206 68384 179 2 2 117 100 302 0.997 33.94 3.03 Intr + 68939 69115 177 1 0 50 86 169 0.999 13.02 3.04 Intr + 69210 69267 58 2 1 106 78 45 0.994 3.76 3.05 Term + 70431 70660 230 1 2 66 54 314 0.979 22.59 3.06 PlyA + 75954 75959 6 1.05 4.00 Prom + 79410 79449 40 -4.66 4.01 Init + 84590 84833 244 1 1 82 70 601 0.991 53.60 4.02 Intr + 87389 87502 114 0 0 96 96 93 0.999 11.32 4.03 Intr + 90743 90854 112 0 1 77 83 171 0.999 14.94 4.04 Intr + 90985 91157 173 1 2 0 36 214 0.973 6.99 4.05 Intr + 94035 94143 109 1 1 66 100 166 0.976 14.94 4.06 Intr + 99940 100046 107 1 2 76 105 46 0.096 4.96 4.07 Intr + 101141 101304 164 0 2 87 94 127 0.812 12.89 4.08 Intr + 111101 111250 150 1 0 98 66 315 0.755 30.66 4.09 Intr + 111471 111582 112 2 1 100 82 113 0.999 11.85 4.10 Intr + 111952 112103 152 2 2 71 86 128 0.999 10.78 4.11 Intr + 112395 112565 171 2 0 98 89 203 0.998 21.54 4.12 Intr + 112648 112785 138 0 0 28 95 301 0.993 25.46 4.13 Intr + 112935 113030 96 2 0 25 44 160 0.814 5.51 4.14 Intr + 113255 113311 57 1 0 28 109 112 0.977 6.38 4.15 Intr + 113411 113524 114 1 0 94 18 337 0.264 27.94 4.16 Intr + 114395 114673 279 1 0 40 53 183 0.232 7.67 4.17 Intr + 134053 134208 156 0 0 99 80 81 0.956 8.61 4.18 Term + 135304 135429 126 0 0 70 44 117 0.974 3.78 4.19 PlyA + 136202 136207 6 -0.45 5.10 PlyA - 137365 137360 6 1.05 5.09 Term - 139185 139114 72 0 0 71 45 52 0.213 -2.89 5.08 Intr - 140487 140378 110 1 2 128 100 -17 0.553 3.50 5.07 Intr - 141708 141509 200 2 2 56 82 229 0.958 18.09 5.06 Intr - 142603 142440 164 1 2 72 116 138 0.999 13.77 5.05 Intr - 143174 143068 107 0 2 121 94 122 0.368 16.03 5.04 Intr - 156882 156770 113 2 2 82 101 5 0.073 1.22 5.03 Intr - 158287 158227 61 2 1 62 75 55 0.251 -0.61 5.02 Intr - 165672 165584 89 2 2 104 55 99 0.531 7.81 5.01 Intr - 185001 184924 78 2 0 116 110 -7 0.104 2.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100046 46 1 1 71 105 120 0.880 10.90 S.002 Term + 102441 102491 51 2 0 68 43 114 0.881 2.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:120625886_120839445|GENSCAN_predicted_peptide_1|324_aa MGGGDGAAFKRPGDGARLQRVLGLGSRREPRSLPAGGPAPRRTAPPPPGHASAGPAAMSS HIAKSESKTSLLKAAAAAASGGSRAPRHGPARDPGLPSRRLPGSCPATPQSSGDPSSRRP LCRPAPREEGARGSQRVLPQAHCRPREALPAAASRPSPSSPLPPARGRDGEERGLSPALG LRGSLRARGRGDSVPAAASEADPFLHRLRPMLSSAFGQHPVRCPRSGPLGARPSPLASLI PARTRQSWKRATSTTVLEICGGQGGALRAGGAKPTPLASGQYRQGGTQESKRFGVWGSRE GTSENTELLEGEGHWPLPIMSLKA >gi568815586f:120625886_120839445|GENSCAN_predicted_CDS_1|975_bp atgggcggcggcgacggggccgcatttaagcggccgggggacggcgcccgcctccagcgc gtcctcgggcttggctcccgccgggagccccgttctctgcccgccgggggccccgcgccg cgccgcaccgcgccgcccccgccgggccatgcgagcgcgggccccgccgcgatgagctcg cacatcgccaaaagcgagtccaagacgtcgctgctgaaggcggcggcggcggcggcgagc gggggcagccgggctccccgccacggccctgcccgggacccggggctgcctagccgccgg ctacccggctcctgcccggcgacgccgcagtcgtccggggaccccagttcgcggaggccc ctgtgccggccggcgccgcgagaggagggcgcgcgggggagccagcgtgtgctcccccag gcgcactgcaggccccgggaggcgctgccggccgcggcgtcccgaccttcgccgtcgtcg ccgctgccgccggcccgcgggcgggatggggaggaacggggactgtccccggcgctcggc ctccggggctctctgcgagcccggggccgcggggactccgttccagccgccgcgtccgag gcggacccgttcctccaccggctgcgccccatgctcagctccgcctttggccagcacccc gtgcgctgtccacgctccgggcctcttggggcaaggccctccccgctagcctccctcata cccgccagaacccgccagtcctggaagagagcgacctctacgaccgtcctggagatttgc gggggtcaaggaggggcgctccgagcaggtggcgccaaacccactcctctggcctctggc cagtaccgccagggagggactcaggagagcaagcggtttggagtctgggggtcaagagag gggacctcagagaacacagaattacttgagggcgagggacactggccactgcctatcatg tccctgaaggcctga >gi568815586f:120625886_120839445|GENSCAN_predicted_peptide_2|289_aa MVVQTSEEGLAADAELPGPLLMLAQNCAVMHNLLGPACIFLRKGFAENRQPAQGNWQLTN LSFLQACSLEMSLPSVLTPRFRVALIAQETWALLEAGDMCFLDPGSSQKHWWPPGSYVVS CDPMTGERTGMGVKLEKDRSLRPEEIEELREAFREFDKDKDGYINCRDLGNCMRTMGYMP TEMELIELSQQINMNLGGHVDFDDFVELMGPKLLAETADMIGVKELRDAFREFDTNGDGE ISTSELREAMRKLLGHQVGHRDIEEIIRDVDLNGDGRVDFEEFVRMMSR >gi568815586f:120625886_120839445|GENSCAN_predicted_CDS_2|870_bp atggtggtgcagacgagcgaggaggggctggcggctgacgccgagctcccgggaccgctc ctgatgctggcccagaactgcgcagtcatgcacaacctgctgggccctgcctgcattttc ctgcgcaagggcttcgctgagaacaggcagcctgcccagggcaactggcagctcaccaac ctgagcttcctccaggcctgctccttggagatgtctctgccatctgttctgacgcctaga ttcagggttgccttgattgcccaggagacctgggcactgcttgaggctggagacatgtgc tttttggaccctggctccagccagaagcactggtggcccccggggtcctatgtggtgtcc tgtgatcccatgacaggggaacgtacagggatgggggtcaaattggagaaggatagatca ctgcgaccagaggaaattgaagagctccgagaggccttcagagaattcgacaaggacaag gatggctacatcaactgccgggatctgggcaactgcatgcgcaccatgggctacatgccc accgagatggagctcatcgaactgtcccagcagatcaacatgaacctgggtggccatgta gattttgatgacttcgtggagctaatggggcctaaactcctggcagagacagcagatatg attggtgtaaaggaactgcgagatgctttccgagagtttgacaccaatggtgatggggaa ataagcaccagtgagctgcgagaggctatgaggaagctcctgggtcatcaggtgggacac cgagacatagaggaaattatccgagatgtggacctcaatggggatggacgagtggacttt gaagagtttgtccggatgatgtcccgctga >gi568815586f:120625886_120839445|GENSCAN_predicted_peptide_3|351_aa MLGAWAVEGTAVALLRLLLLLLPPAIRGPGLGVAGVAGAAGAGLPESVIWAVNAGGEAHV DVHGIHFRKDPLEGRVGRGESPPAEPRDPGPAVLGAAGRGLRAPSPFRPWGRVSGAKFLS AASSGARSELRAWHWLQASDYGMKLPILRSNPEDQILYQTERYNEETFGYEVPIKEEGDY VLVLKFAEVYFAQSQQKVFDVRLNGHVVVKDLDIFDRVGHSTAHDEIIPMSIRKGKLSVQ GEVSTFTGKLYIEFVKGYYDNPKVCALYIMAGTVDDVPKLQPHPGLEKKEEEEEEEEYDE GSNLKKQTNKNRVQSGPRTPNPYASDNSSLMFPILVAFGVFIPTLFCLCRL >gi568815586f:120625886_120839445|GENSCAN_predicted_CDS_3|1056_bp atgctgggagcctgggcggttgagggaaccgctgtggcgctcctgcgactgctgctgctg ctgctgccgccggcgatccggggacccgggctcggcgtggccggcgtggccggcgcggcg ggggccgggctgcccgagagcgtcatttgggcggtcaacgcgggtggagaggcgcatgtg gacgtgcacgggatccacttccgcaaggaccctttggaaggccgggtgggccgaggtgag agtccccctgccgagccgcgggatccagggcctgctgtgctgggcgcagccggccggggg ctgcgggccccgagcccctttcgaccctggggccgcgtctctggagcgaagtttctctct gcagcttcttcgggggcccgctctgagctcagggcctggcactggctccaagcctcagac tatggcatgaaactgccaatcctgcgttccaaccctgaggaccagatcctgtatcaaact gagcggtacaatgaggagacctttggctacgaagtgcccatcaaagaggagggggactac gtgctggtcttgaaatttgcagaggtctactttgcacagtcccagcaaaaggtatttgat gtacgattgaatggccacgtcgtggtgaaggacttggatatctttgatcgtgttgggcat agcacagctcacgatgaaattatacctatgagcatcagaaaggggaagctgagtgtccag ggggaggtgtccaccttcacagggaaactctacattgagtttgtcaaggggtactatgac aatcccaaggtctgtgcactctacatcatggctgggacagtggatgatgtaccaaagctt cagcctcatccgggattggagaagaaagaagaggaagaagaagaagaagaatatgatgaa gggtctaatctcaaaaaacagaccaataagaaccgggtgcagtcaggcccccgcacaccc aacccctatgcctcggacaacagcagcctcatgtttcccatcctggtggccttcggagtc ttcattccaaccctcttctgcctctgccggttgtga >gi568815586f:120625886_120839445|GENSCAN_predicted_peptide_4|857_aa MSGSNPKAAAAASAAGPGGLVAGKEEKKKAGGGVLNRLKARRQAPHHAADDGVGAAVTEQ ELLALDTIRPEHVLRLSRVTENYLCKPEDNIYSIDFTRFKIRDLETGTVLFEIAKPCVSD QEEDEEEGGGDVDISAGRFVRYQFTPAFLRLRTVGATVEFTVGDKPVSNFRMIERHYFRE HLLKNFDFDFGFCIPSSRNTCEHIYEFPQLSEDVIRLMIENPYETRSDSFYFVDNKLIMH NKADYAYNGGHTPEQRARSGRSRSLGLCLSPMAAALLARASGPARRALCPRAWRQLHTIY QSVELPETHQMLLQTCRDFAEKELFPIAAQVDKEHLFPAAQVKKMGGLGLLAMDVPEELG GAGLDYLAYAIAMEEISRGCASTGVIMSVNNSLYLGPILKFGSKEQKQAWVTPFTSGDKI GCFALSEPGNGSDAGAASTTARAEGDSWVLNGTKAWITNAWEASAAVVFASTDRALQNKG ISAFLVPMPTPGLTLGKKEDKLGIRGSSTANLIFEDCRIPKDSILGEPGMGFKIAMQTLD MGRIGIASQALGIAQTALDCAVNYAENRMAFGAPLTKLQVIQFKLADMALALESARLLTW RAAMLKDNKKPFIKEAAMAKLAASEAATAISHQAIQILGGMGYVTEMPAERHYRDARITE IYEGTSEIQRLDQPPGRRKEGPRLPESGSKAVEASSPRWENTEPREAAIVLTSALGGGPG QPVRGLALVLVPAGLSLGKGAPEAKVQALNPLGQCRSQQLEEKTGAVTGYVTICLGIKTP SRHDSNRRCEEIQSLMAKAQTQKLEAHVVATLQQLQERALDFKGKAEEETDDGWSGKNFV MGESLRMRQLESEPAEI >gi568815586f:120625886_120839445|GENSCAN_predicted_CDS_4|2574_bp atgagcgggtctaacccgaaggctgcggccgcggcgtcggcggctgggcccggggggctg gtggctggcaaggaggagaagaagaaggcgggcggcggcgtcctgaaccgcctgaaggcg cggcggcaggcgccccaccacgcggccgacgacggcgtcggggcagcggtcacggagcag gagctgctggcgctggacaccatccggcccgagcacgtcctgcgcctcagccgggtcacc gagaattatttatgtaaacccgaagacaacatctacagtattgatttcacccgcttcaaa attcgagatttggagacagggacagtactttttgagattgccaaaccttgcgtttcagac caggaggaggatgaggaggagggaggtggagacgtggacatcagcgcaggacgttttgtc cgctatcagttcacaccggcatttctccgcctccggacagtcggggctacggtggagttc acagtgggagacaaacctgtttcaaacttccggatgatcgaacggcactatttccgggaa cacttgctgaaaaactttgactttgattttggcttctgcatccccagcagtaggaacact tgtgaacatatctatgagtttccccagctttcggaggatgtcattcgtctaatgattgaa aatccttacgagacccgctctgacagcttctactttgttgacaacaagctgataatgcac aacaaggctgattatgcctataatggaggccacactccggaacagcgcgctcgcagcggg aggtcgcgaagcctgggactgtgtctgtcgcccatggccgccgcgctgctcgcccgggcc tcgggccctgcccgcagagctctctgtcctagggcctggcggcagttacacaccatctac cagtctgtggaactgcccgagacacaccagatgttgctccagacatgccgggactttgcc gagaaggagttgtttcccattgcagcccaggtggataaggaacatctcttcccagcggct caggtgaagaagatgggcgggcttgggcttctggccatggacgtgcccgaggagcttggc ggtgctggcctcgattacctggcctacgccatcgccatggaggagatcagccgtggctgc gcctccaccggagtcatcatgagtgtcaacaactctctctacctggggcccatcttgaag tttggctccaaggagcagaagcaggcgtgggtcacgcctttcaccagtggtgacaaaatt ggctgctttgccctcagcgaaccagggaacggcagtgatgcaggagctgcgtccaccacc gcccgggccgagggcgactcatgggttctgaatggaaccaaagcctggatcaccaatgcc tgggaggcttcggctgccgtggtctttgccagcacggacagagccctgcaaaacaagggc atcagtgccttcctggtccccatgccaacgcctgggctcacgttggggaagaaagaagac aagctgggcatccggggctcatccacggccaacctcatctttgaggactgtcgcatcccc aaggacagcatcctgggggagccagggatgggcttcaagatagccatgcaaaccctggac atgggccgcatcggcatcgcctcccaggccctgggcattgcccagaccgccctcgattgt gctgtgaactacgctgagaatcgcatggccttcggggcgcccctcaccaagctccaggtc atccagttcaagttggcagacatggccctggccctggagagtgcccggctgctgacctgg cgcgctgccatgctgaaggataacaagaagcctttcatcaaggaggcagccatggccaag ctggccgcctcggaggccgcgaccgccatcagccaccaggccatccagatcctgggcggc atgggctacgtgacagagatgccggcagagcggcactaccgcgacgcccgcatcactgag atctacgagggcaccagcgaaatccagcggctggaccagcctccaggcaggaggaaggaa ggccccagattgccagagtcggggagcaaagctgtggaggcctcgagccccaggtgggag aacacagaacctcgagaagcagccattgtgctcactagcgcgctcgggggcggccctggc cagccagtgcgggggcttgccctggtcctggtgcctgcagggctttctctgggcaaaggt gcccctgaagccaaggtccaagctcttaaccctctgggccagtgtcgatctcagcagctg gaagagaaaacaggagctgtcacaggatatgttactatctgcctggggataaaaacccca tcaaggcacgacagcaaccgcaggtgtgaggaaattcaatctctaatggcaaaggctcaa acacaaaagctagaagctcacgtggtggctacactccagcagctccaggaaagggctctg gatttcaagggaaaggctgaggaggaaacagatgatggctggagcgggaagaattttgtg atgggagagtctctgcgcatgagacagttggaatctgaacctgcagaaatttga >gi568815586f:120625886_120839445|GENSCAN_predicted_peptide_5|331_aa XAYSLVDSSQVSTFLISILLIVYGSFRSLNMDFENQDKEKDSNSSSGSFNGNSTNNNYQK VELGIEAIVSLYPRSCRISFGCCGRFTAAELLSFSLSVMLVLIWVLTGHWLLMDALAMGL CVAMIAFVRLPSLKVSCLLLSGLLIYDVFWVFFSAYIFNSNVMVKVATQPADNPLDVLSR KLHLGPNVGRDVPRLSLPGKLVFPSSTGSHFSMLGIGDIVMPGLLLCFVLRYDNYKKQAS GDSCGAPGPANISGRMQKVSYFHCTLIGYFVGLLTATVASRIHRAAQPALLYLVPFTLLP LLTMAYLKGDLRRMWSEPFHSKSSSSRFLEV >gi568815586f:120625886_120839445|GENSCAN_predicted_CDS_5|996_bp nnggcctattccctggtggattccagtcaagtgtctacatttctgatttccattcttctt atagtctatggtagtttcaggtcccttaatatggactttgaaaatcaagataaggagaaa gacagtaatagttcttctgggtctttcaatggcaacagcaccaataataattaccagaag gttgagcttggcatagaggccattgtcagcctctacccacgttcttgcaggatttccttt ggttgctgtggacgtttcactgctgctgagttgctgtcattctctctgtctgtcatgctc gtcctcatctgggttctcactggccattggcttctcatggatgcactggccatgggcctc tgtgtcgccatgatcgcctttgtccgcctgccgagcctcaaggtctcctgcctgcttctc tcagggcttctcatctatgatgtcttttgggtatttttctcagcctacatcttcaatagc aacgtcatggtgaaggtggccactcagccggctgacaatccccttgacgttctatcccgg aagctccacctggggcccaatgttgggcgtgatgttcctcgcctgtctctgcctggaaaa ctggtcttcccaagctccactggcagccacttctccatgttgggcatcggagacatcgtt atgcctggtctcctactatgctttgtccttcgctatgacaactacaaaaagcaagccagt ggggactcctgtggggcccctggacctgccaacatctccgggcgcatgcagaaggtctcc tactttcactgcaccctcatcggatactttgtaggcctgctcactgctactgtggcgtct cgcattcaccgggccgcccagcccgcccttctctatttggtgccatttactttattgcca ctcctcacgatggcctatttaaagggcgacctccggcggatgtggtctgagcctttccac tccaagtccagcagctcccgattcctggaagtatga