GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:33:53 Sequence gi568815586r:42578873_42779124 : 200252 bp : 40.84% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10287 10502 216 1 0 38 44 300 0.322 18.15 1.02 Intr + 11476 11657 182 2 2 103 5 171 0.037 8.97 1.03 Intr + 12114 12276 163 2 1 66 20 142 0.069 3.83 1.04 Intr + 23141 23169 29 0 2 93 97 -10 0.001 -2.68 1.05 Intr + 29316 29424 109 2 1 -34 131 125 0.071 3.54 1.06 Intr + 31958 32157 200 0 2 49 66 123 0.625 4.35 1.07 Intr + 35287 35403 117 0 0 63 54 108 0.420 4.64 1.08 Term + 48166 48297 132 0 0 69 44 144 0.162 5.21 1.09 PlyA + 49056 49061 6 1.05 2.04 PlyA - 50376 50371 6 1.05 2.03 Term - 58296 57874 423 0 0 43 31 317 0.508 15.81 2.02 Intr - 62044 61964 81 1 0 79 50 71 0.373 1.32 2.01 Init - 64304 64110 195 2 0 27 68 138 0.407 4.68 2.00 Prom - 74980 74941 40 -4.75 3.03 PlyA - 75264 75259 6 -0.45 3.02 Term - 76529 75730 800 0 2 66 54 367 0.903 23.53 3.01 Init - 78940 78886 55 1 1 60 121 18 0.412 3.80 3.00 Prom - 97903 97864 40 -4.75 4.00 Prom + 102354 102393 40 -3.75 4.01 Init + 120185 120255 71 1 2 70 108 53 0.103 6.07 4.02 Intr + 129860 130307 448 2 1 45 36 215 0.411 4.42 4.03 Intr + 130414 130672 259 0 1 78 3 211 0.317 7.71 4.04 Term + 132105 132574 470 1 2 30 48 182 0.530 2.55 4.05 PlyA + 132661 132666 6 1.05 5.03 PlyA - 135092 135087 6 1.05 5.02 Term - 139378 139211 168 1 0 49 44 130 0.297 1.60 5.01 Init - 143711 143532 180 2 0 70 41 135 0.318 6.24 5.00 Prom - 144659 144620 40 -6.15 6.02 PlyA - 144828 144823 6 1.05 6.01 Sngl - 146058 145660 399 0 0 88 44 395 0.977 31.11 6.00 Prom - 151749 151710 40 -5.85 7.04 PlyA - 152053 152048 6 1.05 7.03 Term - 155050 154967 84 1 0 51 41 99 0.353 -1.83 7.02 Intr - 155896 155787 110 2 2 34 86 121 0.914 5.58 7.01 Init - 157221 157113 109 0 1 34 105 115 0.807 8.23 7.00 Prom - 174187 174148 40 -2.85 8.03 PlyA - 175049 175044 6 1.05 8.02 Term - 197677 197490 188 2 2 76 44 216 0.969 12.67 8.01 Intr - 198417 198394 24 1 0 135 93 7 0.866 2.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 11476 11696 221 2 2 103 42 207 0.903 13.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:42578873_42779124|GENSCAN_predicted_peptide_1|382_aa XPVSTPEKTKGRVGCDAANRSHAESLCGALAMLQPISAFPVYTAQQTDYDLLPDLLRHRP SLGVTDSGGASEGWSESLPRNRRSDLSEQEAKVQPLVRSLPRLDGPGRLHENSVWGIVPA KTSAHRNVQSLTQKNQNCQSQNFSEHQVFSIFVGNKAKEYSAKMEPMKRECKEIKANGGG NCLNPGKCTVLSRCQDKIKNPLLGWSGSGPLSGNTTIKENGSGAFGELFGKKSARHIGSP SPHQTQEPSWLHPVDPAPGPQVELPASPAPCARTPQPLGGRWDWAPWSRGSTRQGGSGRT MGTLQNEDPTPHWGAEAYMPSQDDRKNGDLVHGQKQVVMVALMMAKVPGVTQRCSAQWWE QGSQLGLHLLRSLPSCEGALGG >gi568815586r:42578873_42779124|GENSCAN_predicted_CDS_1|1149_bp naacctgtttccacgccggagaaaaccaagggaagagttggttgtgatgcagcaaacagg agccatgcggagagcctctgcggagccctggcgatgctgcagccaatatctgctttccct gtctacacagcccaacaaaccgattatgaccttctccctgatcttctgcgacaccgaccc agtctgggtgtcacagactccggcggtgccagcgaaggctggtctgagtctctgccccgg aaccggaggagtgatctttcagagcaagaagccaaagttcagcctctagtccgttccctt ccccgcctcgacggccctggtcgtctccatgaaaactcagtgtgggggatcgtgcccgcg aaaacctcagcccaccgcaatgtccagagcctgacccaaaaaaatcaaaactgccagtct cagaatttctcagaacatcaagtcttttcgattttcgttgggaataaagcaaaggaatac tcagcaaaaatggagccgatgaaaagagaatgcaaggaaataaaggcaaatggaggagga aactgtctaaatccaggaaaatgcactgtactttctagatgccaagataagatcaagaat cctctcttggggtggtctggatcgggacccctttctggtaacactaccattaaagaaaat ggttctggagcctttggagaactctttggcaagaagtccgctagacatataggttctcca agtccccaccagactcaggagcccagctggcttcacccagtggatcctgcaccagggccg caggtggagctgcctgccagtcctgccccgtgtgctcgcactcctcagcccttgggtggt cgatgggactgggcaccgtggagcaggggcagcactcgtcagggaggctctggccgcacg atgggaacacttcagaatgaagacccaactccccactggggtgcagaagcttatatgcca tctcaagatgacagaaagaatggggacttggtgcatggccaaaaacaagttgtgatggtt gccttgatgatggcgaaggtgcctggtgtaactcagcgctgctctgctcagtggtgggag cagggttctcagttggggcttcacctgcttcggtctctgccatcttgtgaaggagcctta ggaggatag >gi568815586r:42578873_42779124|GENSCAN_predicted_peptide_2|232_aa MDSEGKDLKDKCGQEEKTKMELEENERKSSSINGELKKDFAIAQTVKQRQQKQYGKQKVS ILFSVECIIMFSRLIIQQEVFLSSPEMNSEEMCTRLASDGAMSLTSSSSGRVEWMAAVTV AAGTAAIGYLAYKRFYVKDHRNKAVINLHIQKDNPKTVHAFDMEDLGDNAVYCRFWRSKN SHSVMGLTQNTTKRLESTWDLLSSRKKKLKWTVLILVVKLPDCLIRMTTTSG >gi568815586r:42578873_42779124|GENSCAN_predicted_CDS_2|699_bp atggattcagagggaaaagatctaaaggataagtgtgggcaagaggaaaagacaaaaatg gaactagaagaaaatgagagaaagtcttcatcaataaatggagaattaaagaaagacttc gctattgctcaaacagttaaacagaggcaacaaaaacaatatggaaaacaaaaagtctct atcttattctcagtggaatgcatcatcatgttcagtaggctaatcatacagcaagaagtc ttccttagttcaccagagatgaattcagaagaaatgtgcacacgccttgcaagcgacggc gccatgagtctgacttccagttccagcggacgagttgaatggatggcagcagttaccgtt gctgctgggacagctgcaattggttatctagcttacaaaagattttatgttaaagatcat cgaaataaagctgtgataaaccttcacatccagaaagacaaccccaagacagtacatgct tttgacatggaggatttgggagataatgctgtgtactgccgtttctggaggtccaaaaat tcccattctgtgatgggtctcacacaaaacacaacgaagagactggagtcaacgtgggac ctcttatcatcaagaaaaaagaaacttaagtggacagttttgatacttgtcgtgaaatta cctgattgtttaattagaatgactaccacctctggctaa >gi568815586r:42578873_42779124|GENSCAN_predicted_peptide_3|284_aa MKEKMLRAAREKGQVTHKVLEVLARAIRQEKEIKGIQVGKEEVKLSLFADDVIVYLENPI ISAQNLLKLIGNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYIGIQ LTRDVKDLFKENYKTLLNEIKEDTNKWKNIPCSWEGRINIVKVAILPKGIYRFNAIRIKL PMTLFTELEKTTLKFIWNQKRACIAKSILSQKNKAGGITLPDFKLYYKATVTKTTWYWYQ NRDIDHCNRTQPSEIMLHIYNYLISDKPDKNKQWGKDFLFNKWC >gi568815586r:42578873_42779124|GENSCAN_predicted_CDS_3|855_bp atgaaggaaaaaatgttaagggcagccagagagaaaggtcaggttacccacaaagtgttg gaagttctggccagggcaatcaggcaggagaaagaaataaagggtattcaagtaggaaaa gaggaagtcaaattgtccctgtttgcagatgacgtgattgtgtatctagaaaaccccatc atctcagcccaaaatctccttaaactgataggcaacttcagcaaagtctcaggatacaaa atcaatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaa atcatgagtgaactcccattcacaattgcttcaaagagaataaaatacataggaatccaa cttacaagggatgtgaaggacctcttcaaggagaactacaaaacactgctcaatgaaata aaagaggatacaaacaaatggaagaacattccatgctcatgggaaggaagaatcaatatc gtgaaagtggccatactgcccaagggaatttatagattcaatgccatccgcatcaagcta ccaatgactctcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaa agagcctgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgcta cctgacttcaaactatactacaaggctacagtaaccaaaacaacatggtactggtaccaa aacagagatatagatcattgtaacagaacacagccctcagaaataatgctgcatatctac aactatctgatctctgacaaacctgacaaaaacaagcaatggggaaaggatttcctattt aataaatggtgttga >gi568815586r:42578873_42779124|GENSCAN_predicted_peptide_4|415_aa MQSTWYRGQNIPDLAVRELKDMGKAPGGRGSCGHSFSRLKRSCLLALKRAAALPAQCSSS AKGQTASSSGSLTPLPPDWETPLSRGQQTPHTGELWLASGRCPSGTRLPEEGAGSNLCCS AAFAGDTQENRVWSGPPADLQQRGLTVRRKTNKQQGIASTLTKRMPTQKPHLKDHNVSPA REQNWMENEFDELTEVGFRRWVITNSSELKEHVLTQSKEAKNLEKRLEELLTRITSLEKN INDLMELKNTAREPGEAYKTPHRTYSKIDHIIGSKTLLSKCKITEIITDSFSNHSTFKLE LRITKLTQNRTTTWKLNNLLLNDYWVNNIKAKINKFFETNENTDTMYQNLWDTDKAVSRG KFIALNAHRRKRERSKIDTLTSQLKELEKQEQMNSKASRRQEITKIRAELKEIET >gi568815586r:42578873_42779124|GENSCAN_predicted_CDS_4|1248_bp atgcagagcacatggtataggggtcagaatattccagacctggccgtgagagagctcaaa gacatgggcaaagcacctgggggaaggggcagctgtgggcatagcttcagcagacttaaa cgttcatgcctgctggctctgaagagagcagcggctctcccagcacagtgttcgagctct gctaagggacagactgcctcctcaagtgggtccctgaccccactgcctcctgactgggag acacctctcagcaggggtcaacagacacctcatacaggagagctctggctggcatctggc aggtgcccctctgggacaaggcttccagaggaaggagcaggcagcaatctttgctgttct gcagccttcgctggtgatacccaggaaaacagggtctggagtggacctccagcagacctg cagcagaggggcctgactgttagaagaaaaactaacaagcagcaaggaatagcatcaaca ttaacaaaaaggatgcccacacagaaaccccacctgaaggatcacaacgtctcaccagca agggaacaaaactggatggagaatgagtttgacgaattgacagaagtaggcttcagaagg tgggtaataacaaactcctccgagctaaaggagcatgttctaacccaaagcaaggaagct aagaaccttgaaaaaaggttagaggaattgctaactagaataaccagtttagagaagaac ataaatgacctgatggagctgaaaaacacagcacgagaacctggtgaagcatacaaaaca ccacatcgcacttattctaaaattgaccacataattggaagtaaaacactcctcagcaaa tgcaaaataacagaaatcataacagacagtttctcaaaccacagtacattcaaattagaa ctcaggattacaaaactcactcaaaaccgcacaactacatggaaactgaacaacctgctc ctgaatgactactgggtaaataacattaaggcaaaaataaataagttctttgaaaccaat gagaacacagacacaatgtaccagaatctctgggacacagataaagcagtgtccagaggg aaatttatagcactaaatgcccacaggagaaagcgagaaagatctaaaattgacacccta acatcacaattaaaagaactagagaagcaagagcaaatgaattcaaaagctagcagaaga caagaaataactaagatcagagcagaactgaaggagatagagacatga >gi568815586r:42578873_42779124|GENSCAN_predicted_peptide_5|115_aa MDKFLDTYTLPRLNQEEVESLNRPIIGSEIEAIINSLPTKKSPGPDGLTAKFYQRYKEEL VQQYKKDERENVHLSRELQGKQCLGKSKEVKRRFSEELRFWEIMLTIDLKIPKGQ >gi568815586r:42578873_42779124|GENSCAN_predicted_CDS_5|348_bp atggataaattcctggacacatacaccctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataataggctctgaaattgaggcaataattaatagcttaccaaccaaa aagagtccaggaccagatggactcacagccaaattctaccagaggtacaaggaagaactg gtccagcagtacaagaaggatgagagagaaaatgtccatctttcaagagagctgcaaggg aagcagtgtctggggaagagcaaggaagtgaagagaagattcagtgaagagttgaggttt tgggagattatgctgacaattgaccttaaaattccaaagggccagtga >gi568815586r:42578873_42779124|GENSCAN_predicted_peptide_6|132_aa MGEKQSRKAGNSKNQSTSPPPKERSSSPATEQSWTENDFDKSREEGFRQLNYSELKEEVR THGKEVKNLENKLDKWLTRIINVEKSLKDLMELKTMARELHDECTSLSSRVHQVEEGVSV MEDQMSEMKQEV >gi568815586r:42578873_42779124|GENSCAN_predicted_CDS_6|399_bp atgggggaaaaacagagcagaaaagctggaaactctaaaaatcagagcacctctcctcct ccaaaggaacgcagctcctcaccagcaacggaacaaagctggacagagaatgactttgac aagtcgagagaagaaggcttcagacaattgaactactctgagctaaaggaagaagttcga acccacggcaaagaagtgaaaaaccttgaaaacaaattagacaaatggctaactagaata atcaatgtagagaagtccttaaaggacctgatggagctgaaaaccatggcacgagagcta catgatgaatgcacaagcctcagtagccgagtccatcaagtggaagaaggggtttcagtg atggaagatcaaatgagtgaaatgaagcaagaagtttag >gi568815586r:42578873_42779124|GENSCAN_predicted_peptide_7|100_aa MRTNQGYGIVAVDSCWKHLLVSQRGMNINLEGCECSESEGNAARSSPDISSNPSRLAQAE RVKHSKSRSAKNRTIAHALTPFIRDAEGQGVKDLEPDLMV >gi568815586r:42578873_42779124|GENSCAN_predicted_CDS_7|303_bp atgcgtacaaaccaaggctacggcatcgtggctgtagacagttgttggaagcatctgctc gtatctcaaagaggaatgaatatcaatcttgaagggtgtgagtgttctgagagtgaaggg aatgctgccaggtcttcccctgacatatcatccaatccaagccgactagcccaagcagaa cgggtcaaacattccaaaagtagatctgccaaaaacaggaccattgcacacgcattaact ccttttattagggatgcagaagggcaaggagtcaaagatttggaaccagacctcatggtg tga >gi568815586r:42578873_42779124|GENSCAN_predicted_peptide_8|70_aa XRVKLVISGWFASSDTYRDFESRTFTGHEESTKEMEKRSSNHEQKEGFLRLLHHGCDPFI RDMAYTTKAL >gi568815586r:42578873_42779124|GENSCAN_predicted_CDS_8|213_bp ngtcgagtgaaacttgtgatctcaggttggtttgcctcttctgatacctacagagacttt gagtcccggacgttcacaggacatgaggaaagcacaaaagagatggaaaaacgatcaagt aaccatgaacaaaaggaggggttcctccgcctgcttcatcatggctgtgacccgtttatc agggatatggcttacacaactaaagctctatag