GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:28:21 Sequence gi568815591f:74074384_74295305 : 220922 bp : 50.33% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5595 5597 3 2 0 93 95 0 0.390 1.20 1.02 Intr + 6167 6254 88 1 1 68 70 47 0.370 0.44 1.03 Intr + 11365 11461 97 2 1 102 91 127 0.852 13.47 1.04 Intr + 15350 15428 79 2 1 55 101 25 0.325 0.05 1.05 Intr + 18153 18313 161 2 2 72 43 77 0.405 0.39 1.06 Intr + 22239 22377 139 0 1 89 89 172 0.918 17.87 1.07 Intr + 22697 22806 110 1 2 81 105 166 0.997 16.68 1.08 Intr + 24649 24855 207 1 0 73 84 223 0.982 18.59 1.09 Intr + 31492 31597 106 1 1 75 80 148 0.977 12.92 1.10 Intr + 31694 31860 167 1 2 91 89 158 0.994 14.96 1.11 Intr + 32627 32810 184 2 1 71 87 245 0.988 22.49 1.12 Intr + 33488 33574 87 1 0 94 78 156 0.997 15.37 1.13 Intr + 34522 34653 132 0 0 68 101 327 0.997 32.84 1.14 Intr + 37265 37324 60 1 0 108 101 79 0.999 10.13 1.15 Intr + 37550 37615 66 1 0 140 105 98 0.999 15.80 1.16 Intr + 41419 41575 157 0 1 55 113 317 0.954 30.48 1.17 Intr + 46200 46255 56 1 2 76 92 81 0.822 6.00 1.18 Intr + 46509 46666 158 2 2 63 105 271 0.999 25.11 1.19 Term + 46756 46918 163 1 1 83 52 169 0.994 10.21 1.20 PlyA + 48114 48119 6 1.05 2.00 Prom + 91024 91063 40 -0.36 2.01 Init + 100001 100059 59 1 2 56 72 209 0.663 16.88 2.02 Intr + 113228 113415 188 2 2 116 113 117 0.994 16.33 2.03 Intr + 113936 114022 87 0 0 55 75 100 0.906 5.34 2.04 Intr + 120358 120495 138 2 0 96 102 59 0.981 8.54 2.05 Term + 120786 120925 140 1 2 115 52 74 0.987 4.63 2.06 PlyA + 122690 122695 6 1.05 3.03 PlyA - 122732 122727 6 1.05 3.02 Term - 129095 129001 95 0 2 62 54 106 0.643 2.59 3.01 Init - 132191 132143 49 2 1 66 111 -5 0.249 0.79 3.00 Prom - 138211 138172 40 -1.56 4.00 Prom + 140343 140382 40 -7.76 4.01 Init + 141593 141686 94 1 1 104 115 110 0.960 13.84 4.02 Intr + 142442 142481 40 0 1 114 65 26 0.697 0.18 4.03 Intr + 145834 145871 38 1 2 151 96 14 0.746 6.41 4.04 Intr + 146321 146351 31 0 1 114 84 0 0.654 -0.61 4.05 Intr + 147254 147309 56 2 2 70 100 66 0.662 4.62 4.06 Intr + 149341 149400 60 2 0 127 94 104 0.999 13.61 4.07 Intr + 149635 149814 180 2 0 111 69 36 0.931 3.84 4.08 Term + 150256 150359 104 2 2 116 49 84 0.940 5.74 4.09 PlyA + 151007 151012 6 1.05 5.12 PlyA - 152837 152832 6 1.05 5.11 Term - 157833 157706 128 1 2 88 42 52 0.731 -0.96 5.10 Intr - 161262 161145 118 0 1 77 89 126 0.948 11.64 5.09 Intr - 163059 162979 81 0 0 112 58 120 0.996 11.23 5.08 Intr - 164605 164540 66 1 0 77 82 100 0.936 7.40 5.07 Intr - 165712 165555 158 2 2 84 91 251 0.947 24.73 5.06 Intr - 168863 168763 101 1 2 89 109 109 0.931 12.85 5.05 Intr - 172380 172279 102 2 0 58 80 88 0.941 4.29 5.04 Intr - 174735 174629 107 0 2 32 96 92 0.992 3.41 5.03 Intr - 175397 175356 42 2 0 79 101 44 0.901 3.24 5.02 Intr - 178115 178046 70 1 1 78 105 48 0.969 4.68 5.01 Init - 180000 179888 113 0 2 84 65 146 0.923 11.61 5.00 Prom - 183812 183773 40 -1.96 6.05 PlyA - 184510 184505 6 1.05 6.04 Term - 205458 205262 197 1 2 61 43 91 0.131 -0.53 6.03 Intr - 212599 212551 49 1 1 41 105 61 0.165 1.35 6.02 Intr - 214838 214679 160 1 1 132 100 66 0.714 12.19 6.01 Init - 215262 215189 74 0 2 73 15 114 0.292 1.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 115290 115354 65 1 2 85 92 92 0.972 7.74 S.002 Intr + 115439 115535 97 1 1 54 45 122 0.920 4.08 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:74074384_74295305|GENSCAN_predicted_peptide_1|739_aa MQKKSCMSNVTEYVRSDVSPGTVADKELLQGSELPVCASCGQRIYDGQYLQALNADWHAD CFRASHGSLSPVGGRKVRKNGRLGAGPEMPLAGPASRKVMTRLGLVQASLWLTYCLCEVP SRKPPVHNPQGCRIPGSISLAAGCCDCSASLSHQYYEKDGQLFCKKDYWARYGESCHGCS EQITKGLVMVAGELKYHPECFICLTCGTFIGDGDTYTLVEHSKLYCGHCYYQTVVTPVIE QILPDSPGSHLPHTVTLVSIPASSHGKRGLSVSIDPPHGPPGCGTEHSHTVRVQGVDPGC MSPDVKNSIHVGDRILEINGTPIRNVPLDEIDLLIQETSRLLQLTLEHDPHDTLGHGLGP ETSPLSSPAYTPSGEAGSSARQKPVLRSCSIDRSPGAGSLGSPASQRKDLGRSESLRVVC RPHRIFRPSDLIHGEVLGKGCFGQAIKVTHRETGEVMVMKELIRFDEETQRTFLKEVKVM RCLEHPNVLKFIGVLYKDKRLNFITEYIKGGTLRGIIKSMDSQYPWSQRVSFAKDIASGM AYLHSMNIIHRDLNSHNCLVRENKNVVVADFGLARLMVDEKTQPEGLRSLKKPDRKKRYT VVGNPYWMAPEMINGRSYDEKVDVFSFGIVLCEIIGRVNADPDYLPRTMDFGLNVRGFLD RYCPPNCPPSFFPITVRCCDLDPEKRPSFVKLEHWLETLRMHLAGHLPLGPQLEQLDRGF WETYRRGESGLPAHPEVPD >gi568815591f:74074384_74295305|GENSCAN_predicted_CDS_1|2220_bp atgcaaaaaaaatcttgcatgagcaatgtaactgaatatgtcagatctgatgtttcacct ggaactgtggctgacaaggagctcctacaaggaagcgagttgcccgtgtgtgcaagctgc ggccagaggatctatgatggccagtacctccaggccctgaacgcggactggcacgcagac tgcttcagagctagccatgggagcctgagtccagttggaggtaggaaagtcagaaaaaac ggccgcctcggagctggccctgagatgccccttgcaggccctgcctcccggaaggttatg accaggcttggactggtccaggcttccctttggctcacatactgcctctgcgaggtcccc tccaggaagcctcctgtgcacaacccccagggctgccgcatccctggtagcatctccttg gcagctgggtgttgtgactgcagtgcctccctgtcgcaccagtactatgagaaggatggg cagctcttctgcaagaaggactactgggcccgctatggcgagtcctgccatgggtgctct gagcaaatcaccaagggactggttatggtggctggggagctgaagtaccaccccgagtgt ttcatctgcctcacgtgtgggacctttatcggtgacggggacacctacacgctggtggag cactccaagctgtactgcgggcactgctactaccagactgtggtgacccccgtcatcgag cagatcctgcctgactcccctggctcccacctgccccacaccgtcaccctggtgtccatc ccagcctcatctcatggcaagcgtggactttcagtctccattgaccccccgcacggccca ccgggctgtggcaccgagcactcacacaccgtccgcgtccagggagtggatccgggctgc atgagcccagatgtgaagaattccatccacgtcggagaccggatcttggaaatcaatggc acgcccatccgaaatgtgcccctggacgagattgacctgctgattcaggaaaccagccgc ctgctccagctgaccctcgagcatgaccctcacgatacactgggccacgggctggggcct gagaccagccccctgagctctccggcttatactcccagcggggaggcgggcagctctgcc cggcagaaacctgtcttgaggagctgcagcatcgacaggtctccgggcgctggctcactg ggctccccggcctcccagcgcaaggacctgggtcgctctgagtccctccgcgtagtctgc cggccacaccgcatcttccggccgtcggacctcatccacggggaggtgctgggcaagggc tgcttcggccaggctatcaaggtgacacaccgtgagacaggtgaggtgatggtgatgaag gagctgatccggttcgacgaggagacccagaggacgttcctcaaggaggtgaaggtcatg cgatgcctggaacaccccaacgtgctcaagttcatcggggtgctctacaaggacaagagg ctcaacttcatcactgagtacatcaagggcggcacgctccggggcatcatcaagagcatg gacagccagtacccatggagccagagagtgagctttgccaaggacatcgcatcagggatg gcctacctccactccatgaacatcatccaccgagacctcaactcccacaactgcctggtc cgcgagaacaagaatgtggtggtggctgacttcgggctggcgcgtctcatggtggacgag aagactcagcctgagggcctgcggagcctcaagaagccagaccgcaagaagcgctacacc gtggtgggcaacccctactggatggcacctgagatgatcaacggccgcagctatgatgag aaggtggatgtgttctcctttgggatcgtcctgtgcgagatcatcgggcgggtgaacgca gaccctgactacctgccccgcaccatggactttggcctcaacgtgcgaggattcctggac cgctactgccccccaaactgccccccgagcttcttccccatcaccgtgcgctgttgcgat ctggaccccgagaagaggccatcctttgtgaagctggaacactggctggagaccctccgc atgcacctggccggccacctgccactgggcccacagctggagcagctggacagaggtttc tgggagacctaccggcgcggcgagagcggactgcctgcccaccctgaggtccccgactga >gi568815591f:74074384_74295305|GENSCAN_predicted_peptide_2|203_aa MADFDTYDDRAYSSFGGGRGSRGSAGGHGSRSQKELPTEPPYTAYVGNLPFNTVQGDIDA IFKDLSIRSVRLVRDKDTDKFKGCGSKEQEEEGIRAAFSCLQGFLKEVCLLCFRDDFLGG RGGSRPGDRRTGPPMGSRFRDGPPLRGSNMDFREPTEEERAQRPRLQLKPRTVATPLNQV ANPNSAIFGGARPREEVVQKEQE >gi568815591f:74074384_74295305|GENSCAN_predicted_CDS_2|612_bp atggcggacttcgacacctacgacgatcgggcctacagcagcttcggcggcggcagaggg tcccgcggcagtgctggtggccatggttcccgtagccagaaggagttgcccacagagccc ccctacacagcatacgtaggaaatctacctttcaatacggttcagggcgacatagatgct atctttaaggatctcagcataaggagtgtacggctagtcagagacaaagacacagataaa tttaaagggtgcgggagcaaagagcaggaagaagaagggatccgagcagcattctcctgc ctgcagggcttcctgaaggaagtgtgcctgctgtgcttcagggatgacttcttagggggc aggggaggtagtcgcccaggcgaccggcgaacaggcccccccatgggcagccgcttcaga gatggccctcccctccgtggatccaacatggatttcagagaacccacagaagaggaaaga gcacagagaccacgactccagcttaaacctcgaacagtcgcgacgcccctcaatcaagta gccaatcccaactctgctatcttcgggggtgccaggcctagagaggaagtcgttcaaaag gagcaagaatga >gi568815591f:74074384_74295305|GENSCAN_predicted_peptide_3|47_aa MPSALDSGPRFLPTVTEQNCSCHRKQKPNACGHQQNYRREEKLFNSC >gi568815591f:74074384_74295305|GENSCAN_predicted_CDS_3|144_bp atgccaagtgcactggattcaggaccacgtttccttcccacggtaaccgagcagaactgc tcctgccacagaaaacagaagcccaatgcttgcggccatcagcagaattatcggagagag gaaaaacttttcaacagctgctga >gi568815591f:74074384_74295305|GENSCAN_predicted_peptide_4|200_aa MSSGTELLWPGAALLVLLGVAASLCVRCSRPGAKRSEKIYQQRSLKDKLLQFYPSLEGSR HGSEEAYIDPIAMEYYNWGRFSKPPEDDDANSYENVLICKQKTTETGAQQEGIGGLCRGD LSLSLALKTGPTSGLCPSASPEEDEESEDYQNSASIHQWRESRKVMGQLQREASPGPVGS PDEEDGEPDYVNGEVAATEA >gi568815591f:74074384_74295305|GENSCAN_predicted_CDS_4|603_bp atgagctcggggactgaactgctgtggcccggagcagcgctgctggtgctgttgggggtg gcagccagtctgtgtgtgcgctgctcacgcccaggtgcaaagaggtcagagaaaatctac cagcagagaagtctgaaggacaagctgttgcaattctaccccagcctggagggaagcaga cacgggtcggaggaagcctacatagaccccattgccatggagtattacaactgggggcgg ttctcgaagcccccagaagatgatgatgccaattcctacgagaatgtgctcatttgcaag cagaaaaccacagagacaggtgcccagcaggagggcataggtggcctctgcagaggggac ctcagcctgtcactggccctgaagactggccccacttctggtctctgtccctctgcctcc ccggaagaagatgaggaatctgaggattatcagaactcagcatccatccatcagtggcgc gagtccaggaaggtcatggggcaactccagagagaagcatcccctggcccggtgggaagc ccagacgaggaggacggggaaccggattacgtgaatggggaggtggcagccacagaagcc tag >gi568815591f:74074384_74295305|GENSCAN_predicted_peptide_5|361_aa MEVEAVCGGAGEVEAQDSDPAPAFSKAPGSAGHYELPWVEKYRPVKLNEIVGNEDTVSRL EVFAREGNVPNIIIAGPPGTGKTTSILCLARALLGPALKDAMLELNASNDRGIDVVRNKI KMFAQQKVTLPKGRHKIIILDEADSMTDGAQQALRRTMEIYSKTTRFALACNASDKIIEP IQSRCAVLRYTKLTDAQILTRLMNVIEKERVPYTDDGLEAIIFTAQGDMRQALNNLQSTF SGFGFINSENVFKVCDEPHPLLVKEMIQHCVNANIDEAYKILAHLWHLGYSPEDIIGNIF RVCKTFQMAEYLKLEFIKVGNWIHSHENSGRSELSFADGRPPGKAVSEDNGPGGQLEQRL H >gi568815591f:74074384_74295305|GENSCAN_predicted_CDS_5|1086_bp atggaggtggaggccgtctgtggtggcgcgggcgaggtggaggcccaggactctgaccct gcccctgccttcagcaaggcccccggcagcgccggccactacgaactgccgtgggttgaa aaatataggccagtaaagctgaatgaaattgtcgggaatgaagacaccgtgagcaggcta gaggtctttgcaagggaaggaaatgtgcccaacatcatcattgcgggccctccaggaacc ggcaagaccacaagcattctgtgcttggcccgggccctgctgggcccagcactcaaagat gccatgttggaactcaatgcttcaaatgacaggggcattgacgttgtgaggaataaaatt aaaatgtttgctcaacaaaaagtcactcttcccaaaggccgacataagatcatcattctg gatgaagcagacagcatgaccgacggagcccagcaagccttgaggagaaccatggaaatc tactctaaaaccactcgcttcgcccttgcttgtaatgcttcggataagatcatcgagccc attcagtcccgctgtgcagtcctccggtacacaaagctgaccgacgcccagatcctcacc aggctgatgaatgttatcgagaaggagagggtaccctacactgatgacggcctagaagcc atcatcttcacggcccagggagacatgaggcaggcgctgaacaacctgcagtccaccttc tcaggatttggcttcattaacagtgagaacgtgttcaaggtctgtgacgagccccaccca ctgctggtaaaggagatgatccagcactgtgtgaatgccaacattgacgaagcctacaag attcttgctcacttgtggcatctgggctactcaccagaagatatcattggcaacatcttt cgagtgtgtaaaactttccaaatggcagaatacctgaaactggagtttatcaaggtcgga aattggatacactcacatgaaaatagcggaaggagtgaactctcttttgcagatggcagg cctcctggcaaggctgtgtcagaagacaatggccccggtggccagttagagcagagactt cactga >gi568815591f:74074384_74295305|GENSCAN_predicted_peptide_6|159_aa MLRPLPARRCPPAAPPPGAGDPRDRPQLSFVGRPGFAHIHLRAGHLVVLTSRLVKRILQP CYACKDVSGHWEGRQLGKYYSEQQPAVNRPYLAAAQADSCTRNFHGTVSHLQTSSVTIRV GGQQIWGLGSAAFPNAASCDSILPLWQSNESVEDLQTRS >gi568815591f:74074384_74295305|GENSCAN_predicted_CDS_6|480_bp atgctgcggcccctccctgcgcggcgctgcccaccggctgcaccccctcctggcgccggg gatccccgcgataggccgcagctgtcctttgtggggcgtccagggttcgcccacatccac ctccgcgctgggcacctggtggtgcttacatcgcgtttggtgaaacgcatcctccagcca tgctatgcttgcaaagatgtctctggtcattgggaaggacggcagcttgggaagtattat tcagagcaacagcctgcggtcaacagaccttacctggccgctgctcaagctgactcatgc accaggaatttccacgggactgtgtcacacttgcaaacgtcatctgtgactatccgggtg ggaggacagcagatctggggactgggctctgcagccttcccaaatgcagcttcctgtgat tccattcttccactttggcagtcgaatgagtcagtggaagatttgcaaacacggagctaa