GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:26:50 Sequence gi568815587f:125251306_125531389 : 280084 bp : 47.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3315 3390 76 1 1 72 109 49 0.472 3.97 1.02 Intr + 3449 3500 52 2 1 115 68 34 0.196 3.01 1.03 Intr + 12016 12208 193 0 1 89 68 89 0.280 6.07 1.04 Term + 17711 17808 98 0 2 93 48 44 0.164 -0.97 1.05 PlyA + 17811 17816 6 1.05 2.14 PlyA - 18023 18018 6 1.05 2.13 Term - 21638 21572 67 1 1 107 47 113 0.988 6.51 2.12 Intr - 22696 22576 121 2 1 66 32 138 0.617 5.55 2.11 Intr - 27062 26960 103 2 1 100 57 54 0.006 3.25 2.10 Intr - 48553 48433 121 0 1 72 94 30 0.200 2.50 2.09 Intr - 57349 57069 281 1 2 61 59 172 0.703 7.88 2.08 Intr - 61173 61120 54 0 0 45 81 96 0.700 3.78 2.07 Intr - 61491 61379 113 1 2 49 90 49 0.574 1.30 2.06 Intr - 62723 62624 100 2 1 62 115 -4 0.596 -0.62 2.05 Intr - 63037 62869 169 0 1 75 67 92 0.102 5.75 2.04 Intr - 68170 68091 80 1 2 82 78 28 0.028 -0.45 2.03 Intr - 68518 68410 109 0 1 120 115 -22 0.027 3.99 2.02 Intr - 77005 76879 127 2 1 146 32 36 0.129 3.54 2.01 Init - 79723 79624 100 1 1 90 84 55 0.885 3.83 2.00 Prom - 84731 84692 40 -5.46 3.00 Prom + 85308 85347 40 -8.26 3.01 Sngl + 85867 86055 189 0 0 78 33 194 0.431 8.01 3.02 PlyA + 86081 86086 6 1.05 4.00 Prom + 88890 88929 40 -6.06 4.01 Init + 90543 90630 88 2 1 94 65 52 0.702 4.24 4.02 Intr + 100023 100087 65 1 2 54 109 69 0.029 4.04 4.03 Intr + 111652 111804 153 0 0 108 -20 91 0.198 0.57 4.04 Intr + 115595 115702 108 1 0 -3 26 154 0.553 0.58 4.05 Intr + 116541 116680 140 2 2 153 91 117 0.953 17.76 4.06 Intr + 134246 134417 172 2 1 79 63 308 0.995 27.35 4.07 Intr + 146569 146757 189 0 0 128 92 253 0.997 29.48 4.08 Intr + 158891 159020 130 1 1 73 67 99 0.863 6.57 4.09 Intr + 159474 159571 98 1 2 84 109 34 0.973 4.83 4.10 Intr + 160381 160560 180 0 0 50 101 102 0.777 7.76 4.11 Intr + 162137 162256 120 1 0 39 52 89 0.055 1.09 4.12 Intr + 177707 177783 77 1 2 89 105 85 0.595 8.61 4.13 Intr + 178658 178836 179 2 2 91 95 214 0.991 21.96 4.14 Term + 179861 180087 227 0 2 138 38 484 0.995 45.34 4.15 PlyA + 186146 186151 6 1.05 5.10 PlyA - 186453 186448 6 1.05 5.09 Term - 188732 188653 80 0 2 26 45 111 0.066 -1.47 5.08 Intr - 197262 197197 66 1 0 83 116 84 0.784 9.58 5.07 Intr - 201104 201029 76 2 1 99 79 106 0.999 9.89 5.06 Intr - 202905 202825 81 0 0 112 101 131 0.997 16.63 5.05 Intr - 204801 204530 272 1 2 62 100 458 0.792 41.66 5.04 Intr - 209361 209193 169 0 1 76 100 242 0.992 23.72 5.03 Intr - 212265 212179 87 0 0 83 75 159 0.950 14.27 5.02 Intr - 230328 230229 100 2 1 103 67 123 0.417 11.81 5.01 Init - 238472 238162 311 2 2 85 46 508 0.365 43.09 5.00 Prom - 240824 240785 40 -7.06 6.03 PlyA - 241081 241076 6 1.05 6.02 Term - 244282 244028 255 1 0 102 48 79 0.650 0.89 6.01 Init - 245082 244816 267 0 0 70 55 215 0.921 11.38 6.00 Prom - 246865 246826 40 -4.76 7.03 PlyA - 247704 247699 6 1.05 7.02 Term - 268176 268015 162 0 0 78 38 108 0.804 2.74 7.01 Init - 278683 278597 87 1 0 70 93 74 0.528 6.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 23384 23300 85 2 1 88 33 118 0.826 5.28 S.002 Init - 214443 214438 6 0 0 108 59 0 0.804 -0.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:125251306_125531389|GENSCAN_predicted_peptide_1|139_aa XCVGDHNRIVTLPVIRAGSKLSPRCRAEPCSHLTFCNIDDGVQQWGKPHEEERRGSKWRG SAGGPSLGKQQGLLATAPAPTPAVTAAVQLLASLLGERAKPLRGQHCDAAKGGPWNSDHK VSHGRKKDERDGNSCKFLI >gi568815587f:125251306_125531389|GENSCAN_predicted_CDS_1|420_bp ngttgtgtaggggatcacaaccgcattgtgacactacctgtcatccgagctgggtcaaag ctgagccccaggtgcagagctgagccctgttcccacctcaccttctgtaacattgatgac ggtgtccagcagtgggggaagccgcatgaggaggagcggcgtgggagcaaatggagggga tctgctggcgggcctagtttaggcaaacagcagggactgctggcgacggctccagccccc actcccgcggtaacagctgcagtccagctgctggcatccctgctgggggagcgtgctaaa cctctccggggtcagcactgtgatgctgccaaaggtggtccctggaacagtgaccacaaa gtgtctcatgggaggaagaaggatgaaagagatgggaattcttgtaaatttttaatctga >gi568815587f:125251306_125531389|GENSCAN_predicted_peptide_2|514_aa MAGSGEGAALRPGRHFQSSIGRTAGQEGLPYARGVYSLVGKIEGKEIIATYNVCVMLLLT VSLQIPQWPEQGSGPGAFAQATHVPGRCSFSCFAFTSQLKSHFLRGATVTSQILHFLQIE GSWQHRIEQVYQYNFSNSRTQSRSKLAMVTLRASDVAVRAPHCRCSEITEIPQVPLFGDS RRADHDTNPKFSNWKYNNGQCLLGVTCVPGIVPSALHTELTGLHSILCVISASVLPSPEH TCMVLLSPLNWHYGHVRLLHSMELLEQGDFPQNLVLSVNSDGFELLVIMKLLSGEDISDW GKASPESCEHIVEVAISFAGVINLALHSGELLLGEIMVAMTCGIWKRASIFSCASPDMSG GMQLPLSLPQSMSRTEQSFWEAPRSLLAQNRGSGKQSSPVGGCEGIQPYPQRSPSPNRGK PRLPYEDLFPQKPFLLGTALSLMSHILQKWEMGSYVFGPSLASLTISKKILILLVPAHMT IRVHVSRKISFQRPFSLRFGQTRQSVLNAASVTA >gi568815587f:125251306_125531389|GENSCAN_predicted_CDS_2|1545_bp atggcagggtcgggggagggggctgctctgaggcctgggcggcactttcagagcagcatc gggagaaccgctggccaggagggcctgccctacgccagaggtgtttacagtctagtgggc aaaatagaaggtaaagaaataattgccacctacaatgtgtgtgtgatgctgcttctgact gtcagcctccagatcccacaatggccagagcagggctcagggccaggggcctttgcacaa gctacccatgttcctggaaggtgctctttctcctgctttgctttcacatctcaactcaag agtcacttcctcaggggagctacagtgacctctcagatattgcattttttacaaattgaa ggttcatggcaacaccgcattgagcaagtttatcagtacaatttttccaacagcaggacc cagagcagatccaaactggccatggtgacgttgcgagcaagtgatgtcgcagtgagggca ccgcactgcaggtgctctgagatcactgaaatcccccaggtgcctctctttggtgattca cgacgagctgaccatgacaccaaccccaaattctccaactggaagtataataatggccag tgcttactgggggtcacttgtgtgcctggcattgtgcccagtgctttacacacagaactc acagggctgcactccatcctgtgcgtaatttctgcctctgtgctcccatcacctgagcac acctgtatggtgctgctgagcccgctgaattggcattatgggcacgtgcgtctcctccat tccatggagctcctcgagcagggtgacttcccacagaacctggtgctcagcgtgaactca gacggctttgaactcttggttattatgaaactcctgagtggggaagatatttctgattgg ggcaaagcctcccccgagtcctgtgaacacattgtggaggtcgccatctcctttgctggt gtaataaatctggccctgcattccggggagctgctcttgggagagatcatggtggccatg acatgtggcatttggaaaagagccagcatcttctcttgtgcttctcctgacatgtctgga ggaatgcagctcccgctgagtcttccccagagcatgagcaggactgagcagagcttttgg gaagccccacggtcgcttctggctcagaaccgaggctcaggaaagcagagcagcccggtg gggggatgcgaggggatccagccatatccccaacgctcaccctctccaaatcgggggaaa cccagacttccctacgaggacctgtttcctcagaagccctttctgttggggacagctttg tctcttatgtctcatatacttcagaagtgggagatgggaagctacgtctttggccccagc ctggcctcactcacaatatccaagaagatcctgattctgctggttcctgctcacatgacc atccgtgtccatgtgtctcggaagataagcttccaaaggccattctccctgcgcttcggc cagacccgacagtcagtgctcaatgctgcttctgtcactgcctga >gi568815587f:125251306_125531389|GENSCAN_predicted_peptide_3|62_aa MAIYSTTELSSVNGLWKGRYNVAAVEEPSEPCGVMTGALSSHSFLFSVLRLFGFLRLLGY NL >gi568815587f:125251306_125531389|GENSCAN_predicted_CDS_3|189_bp atggcaatatattcaaccacagaactaagcagtgtgaatggcctgtggaagggtcgttac aacgtggctgctgtggaagaaccttctgagccctgtggtgtcatgactggtgccctgagc agccactctttcttgttcagtgttctccgcctcttcggatttcttcgtttgctgggctac aacctctag >gi568815587f:125251306_125531389|GENSCAN_predicted_peptide_4|641_aa MAVIFPSPGLAFLKLRNTEPVYPPSFQGGAPALTMMATQNVPPPPYQDSPQNGKRRNLQT KNLGRVSLAASLRQNWFWLDGRQKSSGYSIYERDVHGGDGPRLEAGDEGVTKASEEKTDK YMKPNTTCGESYHRAMDRMTATAQPPSKAQAVHISAPSAAASTPVPSAPIDPQAQLEADK RAVYRHPLFPLLTLLFEKCEQATQGSECITSASFDVDIENFVHQQEQEHKPFFSDDPELD NLMVKAIQVLRIHLLELEKVNELCKDFCNRYITCLKTKMHSDNLLRNDLGGPYSPNQPSI NLHSQDLLQNSPNSMSGVSNNPQGIVVPASALQQGNIAMTTVNSQVVSGGALYQPVTMVT SQGQVVTQAIPQGAIQIQNTQVPSRCPHRLLMLIFSPRALKVNLDLTSLLDNEDKKSKNK RGVLPKHATNIMRSWLFQHLMLPAYLWEEGCIFCALTWHNQNESTTRSQLLVELSLAALI VHPYPTEDEKRQIAAQTNLTLLQVNNWFINARRRILQPMLDASNPDPAPKAKKIKSQHRP TQRFWPNSIAAGVLQQQGGAPGTNPDGSINLDNLQSLSSDSATMAMQQAMMAAHDDSLDG TEEEDEDEMEEEEEEELEEEVDELQTTNVSDLGLEHSDSLE >gi568815587f:125251306_125531389|GENSCAN_predicted_CDS_4|1926_bp atggctgtcatcttcccttccccagggctggctttcctgaagctcaggaacacagagcca gtgtacccgccctccttccaagggggtgcccccgctctgacgatgatggccacgcagaat gtcccgcccccaccctaccaggacagcccacagaatgggaaaagaaggaaccttcaaacc aagaatctgggcagggtttccttagctgcgagcttaaggcagaactggttctggctagat ggcagacagaagtccagtggctatagcatctacgagagggacgtgcatggaggtgatggc cccaggttggaagctggtgatgaaggagtgaccaaggcctcagaggagaagacagataag tacatgaagcccaacaccacatgtggggagagctaccatcgtgctatggacaggatgacg gcaaccgcccagccaccctccaaggcccaggctgtccacatctctgccccctcagctgct gccagcacacctgtgcccagtgcccccatcgacccccaggcccagctggaggctgacaag cgagctgtatacaggcaccctcttttcccgctcctgacgctgctgtttgagaaatgtgaa caggccacccagggctctgagtgcatcacctccgccagctttgatgtggacatcgagaac tttgtccaccagcaggaacaggagcacaaacccttcttcagcgatgacccagaactggac aatctgatggtgaaggcaatccaggtcctgagaatccacctgctggagctggagaaagtc aatgaactctgcaaggacttttgtaaccgttacatcacctgcctcaaaaccaagatgcac agcgacaacctgctcaggaatgatctaggggggccctactcccccaaccagccctccatc aaccttcactcacaggacctcctgcagaattcccccaattccatgtccggagtctccaat aacccccaggggattgtggtcccagcctcagcgctccagcagggcaacatcgccatgaca accgtcaactcacaagttgtgtcaggtggagccttataccaaccggttaccatggtaacc tcccagggtcaggtggtcacccaagcaatcccccagggagccatccagatccagaacaca caggtccccagccgctgccctcacaggctgctgatgctgattttctctccacgtgccctg aaggttaaccttgacctcacctccctcctggacaatgaggataagaagtccaagaacaaa cgaggagtcttgcccaagcatgccaccaatataatgcgttcttggctcttccagcatctc atgctcccagcctacctgtgggaggaaggatgcatcttctgtgcactgacctggcacaac caaaatgaaagcaccacccgctcccaactgctcgtggagctaagtctggctgcgctcatt gtgcacccctaccccacggaggatgagaagaggcagatcgcagcccagaccaacctcacc ctcctgcaagtaaacaactggttcatcaatgcccggaggcgcatcctgcagcccatgctt gatgccagcaacccagatcctgcccccaaagccaagaagatcaagtctcagcaccggccc acccaaagattctggcccaactccatcgctgcgggggtgctgcagcagcagggcggtgcc ccagggacaaaccccgatggttccatcaacttggacaacctgcagtccctgtcctcagac agtgccaccatggccatgcagcaggctatgatggctgcacacgatgactcattggatggg acagaagaagaggatgaggatgagatggaagaggaggaggaggaggagctggaggaggag gtcgacgagctgcagacgacaaatgtcagcgacctgggcttggaacacagtgactccctg gagtag >gi568815587f:125251306_125531389|GENSCAN_predicted_peptide_5|413_aa MEAPLVSLDEEFEDLRPSCSEDPEEKPQCFYGSSPHHLEDPSLSELENFSSEIISFKSME DLVNEFDEKLNVCFRNYNAKTENLAPVKNQLQIQEEEETLQDEEVWDALTDNYIPSLSED WRDPNIEALNGNCSDTEIHEKEEEEFNEKSENDSGINEEPLLTADQVIEEIEEMMQNSPD PEEEEEVLEEEDGGETSSQADSVLLQEMQALTQTFNNNWSYEGLRHMSGSELTELLDQVE GAIRDFSEELVQQLARRDELEFEKEVKNSFITVLIEVQNKQKEQRELMKKRRKEKGLSLQ SSRIEKGNQMPLKRFSMEGISNILQSGIRQTFGSSGTDKQYLNTVIPYEKKASPPSVEDL QMLTNILFAMKEDNEKVPTLLTDYILKVVAQGALVLADSGTSGLQVHLDKKTK >gi568815587f:125251306_125531389|GENSCAN_predicted_CDS_5|1242_bp atggaggccccactggtgagtctggatgaagagtttgaggaccttcgaccctcctgctcg gaggacccggaggagaagccccagtgtttctatggttcatctccccaccatctcgaggac ccctccctctccgagcttgagaatttttcttccgaaataatcagcttcaagtccatggag gacctcgtaaatgaatttgatgagaagctcaatgtctgctttcggaactacaacgccaag accgagaacctagctcccgtgaagaaccagttacagatccaagaggaggaggagaccctt caggacgaggaggtttgggatgctctgacagacaattacatcccttcactctcagaagac tggagggatccaaacatcgaggctctgaatggcaactgctctgacactgagatccatgag aaagaagaggaagagttcaatgagaagagtgaaaatgattccggtatcaacgaggagcct ctgctcacagcagatcaggtaattgaggagattgaggaaatgatgcagaactccccagac cctgaggaagaagaggaggttctggaagaagaggatggaggagaaacttcctcccaggca gactcggtcctcctgcaggagatgcaggcattgacacagaccttcaacaacaactggtcc tatgaagggctgaggcacatgtctgggtctgagctgaccgagctgctggaccaggtggag ggtgccatccgtgacttctcggaggagctggtgcagcagctggcccgccgggacgagctg gagtttgagaaggaagtgaagaactcctttatcacggtgcttattgaggttcagaacaag cagaaggagcagcgagaactgatgaaaaagaggcggaaagagaaagggctgagcctgcag agcagccggatagagaagggaaaccagatgcctctcaagcgcttcagcatggaaggcatc tccaacattctgcagagtggcatccgccagacctttggctcctcaggaactgacaaacag tatctgaacacagtcattccttacgagaagaaagcctctcctccctcagtggaagacctg cagatgctgacaaacattctctttgccatgaaggaggataatgagaaggtgcctactttg ctaacggactacattttaaaagtggtagcccagggagccctggtgcttgcggattctgga actagcggcctccaagtacatctggacaagaagacaaagtga >gi568815587f:125251306_125531389|GENSCAN_predicted_peptide_6|173_aa MTSSSRACQLGGCPSAGAPPEPGHERGEGVGWRRERSRAPPELRGERSLAGWLAGSPLRC AALRPLPRPVDGGVRLSPGIRLPPPGPAQSAEPSPDRAFLQCQNRWQRPQTPGERLFYYR RAGEGTGAAARWRGALPSGGPRALECGEQCLTASRRYHSPLFFPFAYPFSPRR >gi568815587f:125251306_125531389|GENSCAN_predicted_CDS_6|522_bp atgacgtccagctcccgcgcgtgccagttgggtgggtgcccgagcgcgggcgcgcccccg gagcccgggcacgagcgcggggaaggggtggggtggaggcgggagcggagccgggcgccg ccggagctgcgcggggagcgctcgctggctggctggctggctggctctccgctgcgctgc gctgcgctccggccgctcccgcggcccgtggatgggggtgtccggctgagccccgggatc cgcctccctccgccaggacccgcacagagtgcggagccttccccagaccgcgcattcctg cagtgtcagaaccgctggcagcggccgcagacgccgggagagcgattgttttactacaga cgggcgggagaagggaccggggcggctgcgcggtggcggggcgcgctgccgtcgggtggg ccccgggctctggaatgcggggagcagtgtttgacggcgtcgaggaggtatcacagcccg ctcttcttcccttttgcctatcccttctctcctcgccgctag >gi568815587f:125251306_125531389|GENSCAN_predicted_peptide_7|82_aa MTTTKTKQKMTSIDMDVEKLEPMYIAGGKYPYPVPIKRLAGRATQVADASGWGCKLLSTG GCKLLSTGGCKRLMQTATFIAQ >gi568815587f:125251306_125531389|GENSCAN_predicted_CDS_7|249_bp atgacaacaacaaaaacaaaacagaaaatgacaagcattgacatggatgtggagaaattg gaacccatgtacattgctggtgggaagtacccctatcctgtgcccataaaaagactagct ggcagagcaacacaagtggctgatgcaagtggttggggatgcaagctgctgagcactggg ggatgcaagctgctgagcactgggggatgcaagcggctgatgcaaactgccacctttatc gcccagtaa