GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:44:31 Sequence gi568815591f:66841866_67053372 : 211507 bp : 45.34% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 253 248 6 1.05 1.06 Term - 3273 2622 652 2 1 28 54 415 0.116 25.61 1.05 Intr - 37618 37566 53 1 2 89 50 68 0.013 0.81 1.04 Intr - 38577 38460 118 2 1 30 123 51 0.235 3.27 1.03 Intr - 64702 64610 93 0 0 71 76 52 0.092 1.38 1.02 Intr - 66969 66728 242 0 2 114 69 98 0.883 6.85 1.01 Init - 67178 67041 138 2 0 87 108 56 0.707 7.66 1.00 Prom - 68251 68212 40 -6.56 2.00 Prom + 68977 69016 40 -5.46 2.01 Init + 72716 72961 246 1 0 86 12 385 0.055 28.30 2.02 Intr + 99983 100159 177 1 0 95 65 291 0.923 27.62 2.03 Intr + 103111 103396 286 0 1 79 85 183 0.992 14.01 2.04 Intr + 106679 106829 151 0 1 71 75 143 0.848 10.52 2.05 Intr + 109087 109270 184 1 1 109 95 181 0.986 20.69 2.06 Intr + 111361 111504 144 0 0 103 105 27 0.975 6.38 2.07 Term + 113637 113657 21 2 0 124 31 16 0.597 -2.19 2.08 PlyA + 114263 114268 6 1.05 3.06 PlyA - 114668 114663 6 1.05 3.05 Term - 116316 116227 90 0 0 52 51 73 0.697 -2.28 3.04 Intr - 116726 116574 153 2 0 95 52 133 0.796 10.67 3.03 Intr - 128828 128790 39 2 0 81 80 33 0.026 0.22 3.02 Intr - 129212 129177 36 2 0 95 87 11 0.040 0.16 3.01 Init - 132833 132753 81 2 0 76 77 30 0.233 1.67 3.00 Prom - 133471 133432 40 -0.76 4.08 PlyA - 134729 134724 6 1.05 4.07 Term - 146634 146506 129 0 0 79 47 187 0.999 11.88 4.06 Intr - 149436 149272 165 0 0 17 60 192 0.992 9.46 4.05 Intr - 151552 151352 201 1 0 51 109 200 0.999 17.88 4.04 Intr - 152476 152347 130 0 1 32 94 107 0.948 6.40 4.03 Intr - 153557 153425 133 0 1 34 67 244 0.932 16.60 4.02 Intr - 155104 154888 217 1 1 96 78 196 0.696 17.48 4.01 Init - 156244 156233 12 1 0 49 100 -8 0.418 -3.42 4.00 Prom - 156612 156573 40 -3.76 5.00 Prom + 160578 160617 40 -6.36 5.01 Init + 164865 164876 12 2 0 114 68 3 0.494 1.27 5.02 Intr + 167718 167819 102 2 0 75 73 117 0.996 9.27 5.03 Intr + 172502 172696 195 1 0 60 100 182 0.999 16.21 5.04 Intr + 175988 176278 291 1 0 105 59 247 0.974 20.83 5.05 Intr + 183035 183157 123 1 0 64 81 176 0.976 15.28 5.06 Term + 183339 183368 30 2 0 73 47 22 0.637 -5.55 5.07 PlyA + 183520 183525 6 1.05 6.04 PlyA - 183619 183614 6 1.05 6.03 Term - 190436 190398 39 2 0 125 42 22 0.404 -1.51 6.02 Intr - 208605 208558 48 0 0 129 116 -24 0.865 3.38 6.01 Intr - 210041 209898 144 2 0 38 115 131 0.977 11.28 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 72716 73078 363 1 0 86 37 364 0.941 27.18 S.002 Term - 79975 79704 272 2 2 6 48 146 0.926 -1.95 S.003 Init - 80088 80004 85 0 1 77 83 67 0.879 6.08 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:66841866_67053372|GENSCAN_predicted_peptide_1|431_aa MRPELPVMNWCFLTHLAIKWVVHSSIPSSDGGGIYVIGLQLVLKAQPEMMASWGVPYDQL TEEEKTRGWFTGGSARYAGTTQKWTAAALQPLSGTSLMDSSEEKSFQWTELQAVHLVVHF AWKEMARTSDSKFFRFGTQNGFLAPQLADSLLWNLVILNDRNPLHTAVAPGSLCILIKRQ GSCSKTGGVYDATKQRSAATGKVDSKGTIPCVTDNQQHTEHGGEVAVTRHHSEDKTAQLQ ALYRHSDDAEGLALLRRRGPAPVTSAKALPSDDREPRPSCGTNRNSRSGCRVVQKRACAD AWPQPAGQCSQAPTRRPTPVSYKEELSHCLCYYYRRDFPACSVGRSKGLTLSEQALHTKR LSPGGHKQKVGQKLLNGHCKPASSGCSRCRTLASPEAGYVRPPTPPPARDRGPRSVPRRP APLILGRLGSL >gi568815591f:66841866_67053372|GENSCAN_predicted_CDS_1|1296_bp atgagacctgaactacctgtcatgaactggtgttttctgacccacctagccataaagtgg gtcgtgcacagcagcattccatcatcagatggaggtggtatatatgtgatcgggctccag ctggtcctgaaggcacaacctgaaatgatggcctcatggggagttccctatgatcagttg acagaggaagagaagactaggggctggttcacaggtggttctgcacgatatgcaggcacc acccaaaaatggacagctgcagcactacagcccctttctgggacatcccttatggacagc agtgaagaaaaatctttccagtggacagaacttcaagcagtgcacctggttgtgcacttt gcatggaaggaaatggccagaacatcggactccaagttcttcaggtttgggactcagaat ggcttccttgctcctcagcttgcagacagcctattgtggaaccttgtgatcctaaatgac cgaaatccactccatacagcagtggcccctggcagcctttgcattctcatcaagaggcag ggtagctgttcaaagacaggtggtgtctatgatgcaactaagcagcgctcggcagccacg gggaaagtggacagcaaagggaccattccctgtgtcactgataaccagcaacacacagaa catggcggggaagtcgcggttaccaggcaccactctgaagacaaaacagcccagctccag gcactgtaccgtcactctgacgacgcggaaggcctggcgctattgcgtcgccgagggccc gcgcctgtgacgtcagcgaaggcgctccccagtgacgacagagagccacgcccctcatgt ggaaccaatcggaactcgaggagcggctgccgggtcgtccagaagcgcgcatgcgcagac gcgtggccacagccggccggtcagtgttcgcaggctccgacccggcggccaacaccagta agctacaaggaggagctttcccactgcctgtgctactactaccggcgcgacttcccagcc tgcagcgtagggcgcagcaagggcctgacgctgagcgagcaggcgctgcacaccaagcgg ttgtcgcctggcgggcacaaacagaaggtcgggcagaagctccttaatggccactgcaag ccggctagctccggctgcagccgttgccgcacactcgcctcacctgaggctgggtacgtg cgccccccaacacctcctccagccagggaccggggaccccgcagcgtcccccgccgcccg gcgccgctcatcctgggcaggctcggctccctctga >gi568815591f:66841866_67053372|GENSCAN_predicted_peptide_2|402_aa MAPAKKGGEKKKGHSAINEVVTREYTINIHKRIHEVGFKKRAPRALKEIQKYAMKEMGTL DMHIDTRLNKAVWAKGISHTMSVELIRIMFSINPLENLKVYISSRPPLVVFMISVSAMAI AFLTLGYFFKIKEIKSPEMAEDWNTFLLRFNDLDLCVSENETLKHLTNDTTTPESTMTSG QARASTQSPQALEDSGPVNISVSITLTLDPLKPFGGYSRNVTHLYSTILGHQIGLSGREA HEEINITFTLPTAWSSDDCALHGHCEQVVFTACMTLTASPGVFPVTVQPPHCVPDTYSNA TLWYKIFTTARDANTKYAQDYNPFWCYKGAIGKVYHALNPKLTVIVPDDDRSLINLHLMH TSYFLFVMVITMFCYAVIKGRPSKLRQSNPEFCPEKVALAEA >gi568815591f:66841866_67053372|GENSCAN_predicted_CDS_2|1209_bp atggctcccgcaaagaagggtggtgagaagaaaaagggccattctgccatcaacgaggtg gtgacccgagaatacaccatcaacattcacaagcgcatccatgaagtgggcttcaagaag cgtgcccctcgggcactcaaagaaattcagaaatatgccatgaaggagatgggaactcta gacatgcacattgataccaggctcaacaaagctgtctgggccaaaggaatatcccatacc atgtctgtggagctgatcagaataatgttcagcatcaaccccctggagaacctgaaggtg tacatcagcagtcggcctcccctggtggtcttcatgatcagcgtaagcgccatggccata gctttcctgaccctgggctacttcttcaaaatcaaggagattaaatccccagaaatggca gaggattggaatacttttctgctacggttcaatgatttggacttgtgtgtatcagagaat gaaaccctcaagcatctcacaaacgacaccacaactccggaaagtacaatgaccagcggg caggcccgagcttccacccagtccccccaggccctggaggactcgggcccggtgaatatc tcagtctcaatcaccctaaccctggacccactgaaacccttcggagggtattcccgcaac gtcacccatctgtactcaaccatcttagggcatcagattggactttcaggcagggaagcc cacgaggagataaacatcaccttcaccctgcctacagcgtggagctcagatgactgcgcc ctccacggtcactgtgagcaggtggtattcacagcctgcatgaccctcacggccagccct ggggtgttccccgtcactgtacagccaccgcactgtgttcctgacacgtacagcaacgcc acgctctggtacaagatcttcacaactgccagagatgccaacacaaaatacgcccaagat tacaatcctttctggtgttataagggggccattggaaaagtctatcatgctttaaatccc aagcttacagtgattgttccagatgatgaccgttcattaataaatttgcatctcatgcac accagttacttcctctttgtgatggtgataacaatgttttgctatgctgttatcaagggc agacctagcaaattgcgtcagagcaatcctgaattttgtcccgagaaggtggctttggct gaagcctaa >gi568815591f:66841866_67053372|GENSCAN_predicted_peptide_3|132_aa MPRDQTYVRLQPSKICRRFSQTLVVLLPCRRASSPSAMIVLALPMGARNCSWKTPIRLEE EDAEQINSDVLFCKAHMHILQTRKINLEVNAITDQLLTIMQQESEMPEGELVLLHGAALN SKRTPSVAHFES >gi568815591f:66841866_67053372|GENSCAN_predicted_CDS_3|399_bp atgccgagagatcaaacgtacgtaagacttcagcccagcaagatctgccgcagattttct caaaccctggtggtccttttgccatgcagacgtgcctcttcgccttctgccatgattgta ctggcattaccaatgggggctcgaaactgcagctggaaaacacccatccggctggaggaa gaagacgcggagcaaataaattcagatgttttattctgtaaagcacatatgcacatactt caaacacggaaaattaatctggaagtgaatgccataacagatcagctactgaccatcatg cagcaagagtccgagatgcctgagggtgagctggtgctcctgcacggggccgccctgaac agcaagcgcactccgtctgttgcacacttcgaatcctga >gi568815591f:66841866_67053372|GENSCAN_predicted_peptide_4|328_aa MILKSRQETRISDLDSPMIRTGTARRTELPRRVSGVPSACRPVGSHDTARPVLIGSEDEE VGPEESPELRERKPVPAAMSIFTPTNQIRLTNVAVVRMKRAGKRFEIACYKNKVVGWRSG VEKDLDEVLQTHSVFVNVSKGQVAKKEDLISAFGTDDQTEICKQILTKGEVQVSDKERHT QLEQMFRDIATIVADKCVNPETKRPYTVILIERAMKDIHYSVKTNKSTKQQALEVIKQLK EKMKIERAHMRLRFILPVNEGKKLKEKLKPLIKVIESEDYGQQLEIVCLIDPGCFREIDE LIKKETKGKGSLEVLNLKDVEEGDEKFE >gi568815591f:66841866_67053372|GENSCAN_predicted_CDS_4|987_bp atgatattaaagagccgacaggagactaggatctcggacctggatagcccgatgattcgc actggtaccgcgagacgcaccgagctacctcgccgcgttagcggcgtaccgagtgcctgc agacctgtgggcagccatgacactgccaggccggtcctgattggttcagaggacgaagag gtgggcccagaggaaagcccagagctccgggaaaggaagccagtgcccgccgcgatgtcg atcttcacccccaccaaccagatccgcctaaccaatgtggccgtggtacggatgaagcgt gccgggaagcgcttcgaaatcgcctgctacaaaaacaaggtcgtcggctggcggagcggc gtggaaaaagacctcgatgaagttctgcagacccactcagtgtttgtaaatgtttctaaa ggtcaggttgccaaaaaggaagatctcatcagtgcgtttggaacagatgaccaaactgaa atctgtaagcagattttgactaaaggagaagttcaagtatcagataaagaaagacacaca caactggagcagatgtttagggacattgcaactattgtggcagacaaatgtgtgaatcct gaaacaaagagaccatacaccgtgatccttattgagagagccatgaaggacatccactat tcggtgaaaaccaacaagagtacaaaacagcaggctttggaagtgataaagcagttaaaa gagaaaatgaagatagaacgtgctcacatgaggcttcggttcatccttccagtcaatgaa ggcaagaagctgaaagaaaagctcaagccactgatcaaggtcatagaaagtgaagattat ggccaacagttagaaatcgtatgtctgattgacccgggctgcttccgagaaattgatgag ctaataaaaaaggaaactaaaggcaaaggttctttggaagtactcaatctgaaagatgta gaagaaggagatgagaaatttgaatga >gi568815591f:66841866_67053372|GENSCAN_predicted_peptide_5|250_aa MPGQGFATVLAEAVTSLDLPVAIINLKEYDPDDHLIEEVTSKNVCVFLVATYTDGLPTES AEWFCKWLEEASIDFRFGKTYLKGMRYAVFGLGNSAYASHFNKVGKNVDKWLWMLGAHRV MSRGEGDCDVVKSKHGSIEADFRAWKTKFISQLQALQKGERKKSCGGHCKKGKCESHQHG SEEREEGSHEQDELHHRDTEEEEPFESSSEEEFGGEDHQSLNSIVDVEDLGKIMDHVKKE KACESQLIRG >gi568815591f:66841866_67053372|GENSCAN_predicted_CDS_5|753_bp atgcccggccagggattcgcaacagttcttgctgaagcagttacatccctggatctgcct gtggccattattaatctaaaagaatatgatccagatgatcatctgatagaagaggtgact agtaaaaatgtctgtgtcttcctggttgcgacatacactgacggcctaccaactgaaagt gcagagtggttctgcaaatggttagaggaagcatccattgattttcgatttggcaaaact tacctgaagggtatgagatatgcggtatttggcctgggaaattctgcctatgctagccac ttcaacaaggttggcaaaaatgttgacaagtggctctggatgcttggcgcgcatcgtgtg atgagtcgaggggagggcgactgcgacgtggttaaaagcaagcacggcagcattgaggcc gacttcagagcatggaagaccaagttcatctcccagctgcaggcacttcagaaaggggag agaaagaagtcctgtggcggccactgcaagaaaggcaaatgtgaatctcaccaacatggc tcagaggagagggaggaaggatctcatgagcaggatgaattgcatcatagagacaccgag gaggaagaaccctttgagagctccagtgaagaagagtttggtggtgaggaccatcagagc ctaaattccattgttgatgttgaagatttgggcaaaattatggatcatgtgaagaaagaa aaggcctgtgaaagtcagctcatcagaggataa >gi568815591f:66841866_67053372|GENSCAN_predicted_peptide_6|76_aa QKITELEDTVIENEGKVKPFSDIRKLKEFTTNKPALQEMFKKSLESPQETNIITHSCNYN LLNKLLPNPTEIQNSS >gi568815591f:66841866_67053372|GENSCAN_predicted_CDS_6|231_bp caaaaaattactgaacttgaagacacagtgatagaaaatgaaggcaaagtaaagccattt tcagatatacgaaagctgaaagaattcaccaccaacaaacctgcactacaagagatgttc aaaaagtccttggagtcgccacaggaaactaatattattactcattcatgcaactataac ctccttaataagctgctcccaaaccctacggagatccaaaacagcagctaa