GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:52:15 Sequence gi568815579f:29712688_29923774 : 211087 bp : 49.76% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2949 3095 147 1 0 96 47 73 0.330 4.21 1.02 Intr + 12233 12391 159 0 0 83 116 0 0.236 2.26 1.03 Intr + 15464 15553 90 0 0 76 61 47 0.255 0.77 1.04 Intr + 15987 16125 139 1 1 103 53 51 0.117 2.62 1.05 Intr + 22288 22312 25 1 1 71 111 14 0.027 0.03 1.06 Intr + 26691 26797 107 2 2 6 72 128 0.263 2.01 1.07 Intr + 28380 28450 71 0 2 49 77 51 0.832 -1.07 1.08 Term + 28636 28751 116 2 2 95 53 76 0.904 3.53 1.09 PlyA + 29679 29684 6 1.05 2.00 Prom + 40978 41017 40 -5.36 2.01 Init + 62191 62259 69 0 0 78 81 52 0.362 4.56 2.02 Intr + 63357 63433 77 2 2 98 86 10 0.213 0.11 2.03 Intr + 68886 69256 371 0 2 84 45 152 0.650 5.25 2.04 Term + 78852 78985 134 1 2 115 55 52 0.742 2.85 2.05 PlyA + 81635 81640 6 1.05 3.00 Prom + 83301 83340 40 -3.16 3.01 Init + 85382 85428 47 1 2 69 66 50 0.176 1.06 3.02 Intr + 99306 99465 160 0 1 11 105 95 0.361 3.49 3.03 Intr + 99845 99891 47 1 2 106 101 27 0.749 3.01 3.04 Intr + 100002 100089 88 0 1 62 62 139 0.949 8.67 3.05 Intr + 100291 100350 60 0 0 29 96 98 0.845 3.63 3.06 Intr + 104450 104595 146 1 2 100 78 117 0.964 10.98 3.07 Intr + 104719 104854 136 1 1 91 83 -19 0.876 -1.53 3.08 Intr + 108015 108161 147 2 0 69 84 33 0.702 1.43 3.09 Intr + 109035 109130 96 2 0 109 98 28 0.981 6.11 3.10 Intr + 109309 109443 135 0 0 111 107 36 0.993 8.56 3.11 Intr + 109553 109664 112 1 1 95 111 48 0.994 7.75 3.12 Intr + 109759 109916 158 2 2 85 65 91 0.958 6.23 3.13 Intr + 110968 111055 88 0 1 76 95 8 0.352 -0.16 3.14 Intr + 111432 111569 138 1 0 87 78 29 0.292 2.24 3.15 Term + 125144 125190 47 0 2 118 48 39 0.005 0.27 3.16 PlyA + 125347 125352 6 -0.45 4.12 PlyA - 125643 125638 6 1.05 4.11 Term - 127387 127308 80 2 2 93 48 52 0.370 -0.37 4.10 Intr - 127816 127507 310 1 1 31 37 163 0.484 1.39 4.09 Intr - 129751 129638 114 1 0 103 60 21 0.657 1.44 4.08 Intr - 132995 132830 166 1 1 -22 96 151 0.041 4.86 4.07 Intr - 149243 149132 112 0 1 54 25 138 0.017 3.54 4.06 Intr - 161011 160942 70 1 1 80 3 43 0.085 -6.25 4.05 Intr - 161419 161321 99 1 0 112 74 30 0.517 4.31 4.04 Intr - 162013 161830 184 0 1 58 82 81 0.331 4.29 4.03 Intr - 163904 163753 152 2 2 96 49 75 0.166 3.36 4.02 Intr - 170470 170343 128 2 2 78 60 56 0.038 2.20 4.01 Init - 189182 188939 244 2 1 72 53 158 0.043 8.70 4.00 Prom - 191099 191060 40 -5.16 5.00 Prom + 208006 208045 40 -3.76 5.01 Sngl + 208495 208965 471 0 0 59 48 547 0.961 43.92 5.02 PlyA + 209468 209473 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 69682 69773 92 2 2 97 45 70 0.914 1.48 S.002 Init - 132925 132830 96 1 0 60 96 78 0.802 6.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:29712688_29923774|GENSCAN_predicted_peptide_1|284_aa XWAQKRAQGHLEVKGAASVFGECVGRSQISSVPEPALEPLGLGVAKGEAANSPETLRLFW LPPLTVAINRTSTLSLSYACFSYSESTRFPRRRDSGATMQLEEAVQCLHVPDSPFDVYSP IVIHLNVYSPLQAPPALPLFSVPGPNELGAPDLLGDFPKVSITQGVGATDFDCWVYSGQR GFLADLNMPVSYGPMGAGHREGAFDAVEQGGGIVHAKWPLKNWTFDTLTPTGFPRACMAS LTIHIHGGELLTYRGRAASSSDSLALSEGSGICSHERNVCRVTP >gi568815579f:29712688_29923774|GENSCAN_predicted_CDS_1|855_bp ncctgggcacagaagagagcacaggggcatctggaggtcaagggtgctgcgtcagtattt ggtgaatgtgtgggccggagccagatttcctctgttccagagcctgctctggagcctctg ggtctgggggttgcaaaaggagaagcagctaactccccggaaaccctgcgtttattctgg ctgccccctttgactgtggcaataaatagaacttcaaccctctcactttcttatgcctgc ttttcttacagtgaaagcaccagatttccgcgtaggagggactcgggggcaacgatgcaa ttggaagaggctgtgcaatgccttcatgtccctgattcaccttttgacgtctacagcccg attgtaatacacttaaatgtgtattctccactccaagctcctccagccctccctctgttc agtgtgcctggcccaaatgaactgggagctccagaccttctcggggacttccccaaagtc tctataacacaaggtgttggtgccacagactttgactgttgggtctacagtggccagaga ggctttttggcagaccttaacatgcctgtctcctatggacctatgggagctggacacagg gaaggagcctttgatgccgtggaacaaggtggaggaattgtccatgcgaagtggcccctg aagaactggacttttgacacgcttacccccaccggctttccccgcgcctgcatggcctct ctaaccatccacatccatggaggtgagctcctcacgtacaggggtcgcgctgcctcctcc tctgactccctggctctgagcgaaggctctggcatctgtagccatgagaggaatgtttgc cgagtgaccccatga >gi568815579f:29712688_29923774|GENSCAN_predicted_peptide_2|216_aa MSLPAGKVPSGKGALLACYSPQKMERRRQFGEETSLSTISPGSNIPGERHTVFVLEISLV APCADWQTGTWAHTRAAHTQPQPPHWHGSSGGLCRCRATEMEAGRVESQLAPVPSLPEEL HTAVPGGIGDAFQHWRAGTEHGTGKDKGLGWVAPGAIGAWLVAPEHQDVGRLVHQVLKTS VPSSNRVLAGGLEDCWGLAAAVSGAEARWPDISFRC >gi568815579f:29712688_29923774|GENSCAN_predicted_CDS_2|651_bp atgtccctgccagctggcaaggtgccctcaggaaaaggtgcccttctggcctgctacagt ccccaaaagatggaaaggagaaggcagtttggggaagaaaccagcctcagcaccatctcc cctgggtctaacattccaggagaaaggcatacggtctttgtgctggaaatttccctcgtg gctccatgcgcggactggcagacgggcacgtgggcccacacgcgggcagcacacacgcag ccgcagccccctcattggcatggctctagtgggggcctctgcaggtgccgggctacagag atggaagcggggagagtggaatcacagttggcacctgtgccgagcttgcctgaggagctg cacactgcggtgcctggcggaatcggagatgctttccagcactggagggcagggacggag cacgggaccgggaaggacaagggcttgggttgggtcgcccctggggccattggggcctgg cttgtggcacccgagcaccaggatgtgggaaggctggtccatcaggttctaaagacctct gttccttcttccaaccgcgtcctggctggtggccttgaggactgctggggtctcgctgcg gctgtgtctggagctgaagcgagatggcctgacatcagtttccgttgttga >gi568815579f:29712688_29923774|GENSCAN_predicted_peptide_3|534_aa MRVGQPDTTGPGAEDRSSRRGRQRGVGGRRGSRHRRALGPPTPGAAAATAVAAAACRDWS APSAADKTLASGRSSPIMPRERRERDAKERDTMKEDGGAEFSARSRKRKANVTVDPDEEM AKIDRTARDQCGSQPWDNNAVCADPCSLIPTPDKEDDDRVYPNSTCKPRIIAPSRGSPLP VLSWANREEVWKIMLNKEKTYLRDQHFLEQHPLLQPKMRAILLDWLMEVCEVYKLHRETF YLAQDFFDRYMATQENVVKTLLQLIGISSLFIAAKLEEIYPPKLHQFAYVTDGACSGDEI LTMELMIMKALKWRLSPLTIVSWLNVYMQVAYLNDLHEVLLPQYPQQIFIQIAELLDLCV LDVDCLEFPYGILAASALYHFSSSELMQKVSGYQWCDIENCVKWMVPFAMVIRETGSSKL KHFRGVADEDAHNIQTHRDSLDLLDKARAKKAMLSEQNRASPLPSGLLTPPQSGVLGSVV PSGAGGCGQALCRAHSQLGRGLPSPHYQLTVYNAFDELFCLLVIPEVTFGPQSL >gi568815579f:29712688_29923774|GENSCAN_predicted_CDS_3|1605_bp atgagagtgggacagccagacaccacgggtcctggagctgaggacaggagcagccggcgc ggccgccagcgcggtgtagggggcaggcgcggatcccgccaccgccgcgcgctcggcccg ccgactcccggcgccgccgccgccactgccgtcgccgccgccgcctgccgggactggagc gcgccgtccgccgcggacaagaccctggcctcaggccggagcagccccatcatgccgagg gagcgcagggagcgggatgcgaaggagcgggacaccatgaaggaggacggcggcgcggag ttctcggctcgctccaggaagaggaaggcaaacgtgaccgttgatccagatgaagaaatg gccaaaatcgacaggacggcgagggaccagtgtgggagccagccttgggacaataatgca gtctgtgcagacccctgctccctgatccccacacctgacaaagaagatgatgaccgggtt tacccaaactcaacgtgcaagcctcggattattgcaccatccagaggctccccgctgcct gtactgagctgggcaaatagagaggaagtctggaaaatcatgttaaacaaggaaaagaca tacttaagggatcagcactttcttgagcaacaccctcttctgcagccaaaaatgcgagca attcttctggattggttaatggaggtgtgtgaagtctataaacttcacagggagaccttt tacttggcacaagatttctttgaccggtatatggcgacacaagaaaatgttgtaaaaact cttttacagcttattgggatttcatctttatttattgcagccaaacttgaggaaatctat cctccaaagttgcaccagtttgcgtatgtgacagatggagcttgttcaggagatgaaatt ctcaccatggaattaatgattatgaaggcccttaagtggcgtttaagtcccctgactatt gtgtcctggctgaatgtatacatgcaggttgcatatctaaatgacttacatgaagtgcta ctgccgcagtatccccagcaaatctttatacagattgcagagctgttggatctctgtgtc ctggatgttgactgccttgaatttccttatggtatacttgctgcttcggccttgtatcat ttctcgtcatctgaattgatgcaaaaggtttcagggtatcagtggtgcgacatagagaac tgtgtcaagtggatggttccatttgccatggttataagggagacggggagctcaaaactg aagcacttcaggggcgtcgctgatgaagatgcacacaacatacagacccacagagacagc ttggatttgctggacaaagcccgagcaaagaaagccatgttgtctgaacaaaatagggct tctcctctccccagtgggctcctcaccccgccacagagcggtgttctgggctccgttgta ccaagtggagcaggtggttgcgggcaagcgttgtgcagagcccatagccagctgggcagg gggctgccctctccacattatcagttgacagtgtacaatgcctttgatgaactgttttgc ctcctggtgatccctgaggtgacctttggccctcagtccctctaa >gi568815579f:29712688_29923774|GENSCAN_predicted_peptide_4|552_aa MKRSRCRDRPQPPPPDRREDAVQRAAELSQSLPPRRRAPPGRQRLEERTGPAGPEGKEQP PALASQSAEIAASARLPPRLGTPGPPGSWEPQRDGSRGQPLWMDPLLGILNSKHQPHICK QSLQQHNNGVECVGAPLPLDHLGEVNGTHLASSAAEKWGGYIKRKTERRVNERRRKQALP IRQEDMGGDLKVWGQLWESLHTGRELTNLLELRGKTPGNNDVGGTVSRVAEPSVHQDPQP ASSEVNSVTVCKYGCRQIAASCPQPRASGMPGRRHQMAADALRSSGREGAGPGISRKDSV SLELLGLQLEEHLQDNEANIMKSKAQEEVGRAVEHPAGGEYGAQSSGLRGGVKMWLSGVQ TASMFRKHVENSLATVRTIRIIRMQGQGPEGWTLRCWFKEEENGNPNTGRSKRHQRGKLG ALGAVTPRLSPGAEVAVLIISTAAECASFQQLFGVNVHVEPSTRASLLASATLLSVGSTK DARRGLDHQLCKSAPSVWEEVVWRAESPVGVEEKKEEKKVGSAGDGGHPRAPCMEIPELP ELQVEVAELGPH >gi568815579f:29712688_29923774|GENSCAN_predicted_CDS_4|1659_bp atgaagcggagccgctgccgcgaccgaccgcagccgccgccgcccgaccgccgggaggat gcagttcagcgggcagcggagctgtctcagtctttgccgccgcgccggcgagcgccgccc gggaggcagcggctggaggagcggacgggccccgcggggcccgagggcaaggagcagccg cctgccttggcctcccaaagtgccgagattgcagcctctgcccggctgccaccccgtctg ggaactccaggccctcctggctcctgggagccacagagagatggctcccgtgggcagccg ctgtggatggatcctttattgggcatcttgaattccaagcaccagccccacatttgcaag cagagcctccagcaacacaacaacggtgtagagtgcgtgggggccccactgccactggac cacttgggagaggtgaatggtactcacctcgcctcttctgcagcagagaagtggggagga tatatcaagaggaaaacagagaggagagtcaatgagagacggaggaaacaggccttgcct ataaggcaggaggatatggggggagacctcaaggtttgggggcagctgtgggagtctctg cacactggcagagaattaactaacctcttggagcttagagggaaaaccccggggaacaat gatgtcgggggcacggtcagcagagtggccgagccatctgtacaccaggacccgcagcca gccagctctgaggttaattctgtgacagtgtgcaaatacggctgccgccaaatcgcagcc tcgtgcccgcagcctcgcgcctcagggatgcccgggcgccgccaccagatggcagcagac gctttgcgatcgagcggccgggagggcgcggggccggggatttctcgcaaggacagtgta agcctggagctgctggggctacaactggaagaacacttgcaggataacgaagccaatata atgaaaagcaaagcccaggaagaggtgggaagggccgtggaacacccggccggcggggaa tacggagctcagagcagcgggctgcgcgggggcgtaaaaatgtggctgtcaggggtccaa acagcatcaatgttcagaaagcacgtggaaaacagtttagctactgttagaactatcagg atcatccgcatgcagggtcagggtcctgaaggttggaccttacggtgttggtttaaggaa gaggaaaatggaaatccaaacacagggagatctaagagacaccagagagggaaactaggg gccttaggggctgtcacaccgagactttctccaggagctgaggtggcagtgctgataatt tctacagcagcagagtgtgccagtttccagcagctgttcggggtgaatgtgcatgtggag ccttccaccagagcatcgctactcgcctcggcaactttactcagtgttggatccaccaag gatgccaggaggggcctcgaccaccagctgtgcaaatcagcccctagtgtctgggaggag gtggtctggagggcagaaagcccagtgggggttgaggagaagaaggaagagaagaaggtt ggcagtgcgggagatgggggtcaccctcgagccccctgcatggaaattccagaactgcct gagttgcaggtggaggtggctgagttgggaccccattag >gi568815579f:29712688_29923774|GENSCAN_predicted_peptide_5|156_aa MVNPTKFFNEPWGRISIQLFADKFPKTAENVCALSIGEKGFGYKGSCFHRIIPGFMCHGG DFTHHNGSGGKYIYGEKFDDENFILKQTGSGILSKENAGPNTNGSQFFICSAKSEWLDGE HVFFGKVKEGMNIVEAMEGFGSRNGKTSKKITIADC >gi568815579f:29712688_29923774|GENSCAN_predicted_CDS_5|471_bp atggtcaaccccaccaagttcttcaatgagccctggggccgcatctccatccagctgttt gcagacaagtttccaaagacagcagaaaatgtttgtgctctgagcattggagagaaagga tttggttataagggttcctgctttcacagaattattccggggtttatgtgtcacggtggt gacttcacacaccataatggcagtggtggcaagtacatctatggggagaaatttgatgat gagaacttcatcctgaagcagacaggttctggcatcttgtccaaggaaaatgctggaccc aacacaaacggttcccagtttttcatctgcagtgccaagagtgagtggttggatggtgag catgtgttctttggcaaggtgaaagaaggcatgaatattgtggaggccatggagggtttt gggtccaggaatggcaagaccagcaagaagatcaccattgctgactgttga