GENSCAN 1.0 Date run: 8-Nov-116 Time: 07:38:40 Sequence gi568815591r:66888374_67095417 : 207044 bp : 44.48% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1068 1148 81 2 0 60 87 70 0.467 5.07 1.02 Term + 14410 14628 219 0 0 109 41 78 0.117 2.24 1.03 PlyA + 17895 17900 6 1.05 2.04 PlyA - 17945 17940 6 -0.45 2.03 Term - 18194 18098 97 1 1 71 44 61 0.295 -2.56 2.02 Intr - 20461 20220 242 1 2 114 69 98 0.882 6.85 2.01 Init - 20670 20533 138 0 0 87 108 56 0.707 7.66 2.00 Prom - 21743 21704 40 -6.56 3.00 Prom + 22469 22508 40 -5.46 3.01 Init + 26208 26453 246 2 0 86 12 385 0.055 28.30 3.02 Intr + 53475 53651 177 2 0 95 65 291 0.923 27.62 3.03 Intr + 56603 56888 286 1 1 79 85 183 0.992 14.01 3.04 Intr + 60171 60321 151 1 1 71 75 143 0.848 10.52 3.05 Intr + 62579 62762 184 2 1 109 95 181 0.986 20.69 3.06 Intr + 64853 64996 144 1 0 103 105 27 0.975 6.38 3.07 Term + 67129 67149 21 0 0 124 31 16 0.597 -2.19 3.08 PlyA + 67755 67760 6 1.05 4.06 PlyA - 68160 68155 6 1.05 4.05 Term - 69808 69719 90 1 0 52 51 73 0.697 -2.28 4.04 Intr - 70218 70066 153 0 0 95 52 133 0.796 10.67 4.03 Intr - 82320 82282 39 0 0 81 80 33 0.026 0.22 4.02 Intr - 82704 82669 36 0 0 95 87 11 0.040 0.16 4.01 Init - 86325 86245 81 0 0 76 77 30 0.233 1.67 4.00 Prom - 86963 86924 40 -0.76 5.08 PlyA - 88221 88216 6 1.05 5.07 Term - 100126 99998 129 1 0 79 47 187 0.999 11.88 5.06 Intr - 102928 102764 165 1 0 17 60 192 0.992 9.46 5.05 Intr - 105044 104844 201 2 0 51 109 200 0.999 17.88 5.04 Intr - 105968 105839 130 1 1 32 94 107 0.948 6.40 5.03 Intr - 107049 106917 133 1 1 34 67 244 0.932 16.60 5.02 Intr - 108596 108380 217 2 1 96 78 196 0.696 17.48 5.01 Init - 109736 109725 12 2 0 49 100 -8 0.418 -3.42 5.00 Prom - 110104 110065 40 -3.76 6.00 Prom + 114070 114109 40 -6.36 6.01 Init + 118357 118368 12 0 0 114 68 3 0.494 1.27 6.02 Intr + 121210 121311 102 0 0 75 73 117 0.996 9.27 6.03 Intr + 125994 126188 195 2 0 60 100 182 0.999 16.21 6.04 Intr + 129480 129770 291 2 0 105 59 247 0.974 20.83 6.05 Intr + 136527 136649 123 2 0 64 81 176 0.976 15.28 6.06 Term + 136831 136860 30 0 0 73 47 22 0.636 -5.55 6.07 PlyA + 137012 137017 6 1.05 7.06 PlyA - 137111 137106 6 1.05 7.05 Term - 143928 143890 39 0 0 125 42 22 0.404 -1.51 7.04 Intr - 162097 162050 48 1 0 129 116 -24 0.865 3.38 7.03 Intr - 163533 163390 144 0 0 38 115 131 0.978 11.28 7.02 Intr - 187940 187710 231 2 0 78 93 76 0.475 5.07 7.01 Init - 193242 193186 57 0 0 71 75 19 0.413 0.21 7.00 Prom - 199188 199149 40 -1.76 8.02 PlyA - 199478 199473 6 1.05 8.01 Term - 201064 200884 181 0 1 60 55 135 0.479 4.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 26208 26570 363 2 0 86 37 364 0.941 27.18 S.002 Term - 33467 33196 272 0 2 6 48 146 0.926 -1.95 S.003 Init - 33580 33496 85 1 1 77 83 67 0.879 6.08 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:66888374_67095417|GENSCAN_predicted_peptide_1|99_aa MRARDTAVPHGDSQLVFGSSLWKGCTLVRLLRPDGLFRPGSSLTMASFKGPAFPSWRPLQ AQIGKPPLSWWLSPGLLSRPSTLSRLYLQAQPLPADSRF >gi568815591r:66888374_67095417|GENSCAN_predicted_CDS_1|300_bp atgagagccagagatacagcggtgccccatggagacagccagctggtgtttggtagttcc ttgtggaaaggctgtactttggtccggctcctgcgtcctgacggcctctttaggcccggc tcgtccctcacaatggcctctttcaaaggcccagcttttccttcgtggcggcctctccag gcccaaattggcaagcctcccctgtcctggtggctgtctcccggcctactctccaggccc agcacactctcgcggctgtacctccaggcccagcccctgcctgcagacagccgcttttga >gi568815591r:66888374_67095417|GENSCAN_predicted_peptide_2|158_aa MRPELPVMNWCFLTHLAIKWVVHSSIPSSDGGGIYVIGLQLVLKAQPEMMASWGVPYDQL TEEEKTRGWFTGGSARYAGTTQKWTAAALQPLSGTSLMDSSEEKSFQWTELQAVHLVVHF AWKEMARTSDSKFFRFGTQNGFLAPQLADSLLWNLVIL >gi568815591r:66888374_67095417|GENSCAN_predicted_CDS_2|477_bp atgagacctgaactacctgtcatgaactggtgttttctgacccacctagccataaagtgg gtcgtgcacagcagcattccatcatcagatggaggtggtatatatgtgatcgggctccag ctggtcctgaaggcacaacctgaaatgatggcctcatggggagttccctatgatcagttg acagaggaagagaagactaggggctggttcacaggtggttctgcacgatatgcaggcacc acccaaaaatggacagctgcagcactacagcccctttctgggacatcccttatggacagc agtgaagaaaaatctttccagtggacagaacttcaagcagtgcacctggttgtgcacttt gcatggaaggaaatggccagaacatcggactccaagttcttcaggtttgggactcagaat ggcttccttgctcctcagcttgcagacagcctattgtggaaccttgtgatcctgtga >gi568815591r:66888374_67095417|GENSCAN_predicted_peptide_3|402_aa MAPAKKGGEKKKGHSAINEVVTREYTINIHKRIHEVGFKKRAPRALKEIQKYAMKEMGTL DMHIDTRLNKAVWAKGISHTMSVELIRIMFSINPLENLKVYISSRPPLVVFMISVSAMAI AFLTLGYFFKIKEIKSPEMAEDWNTFLLRFNDLDLCVSENETLKHLTNDTTTPESTMTSG QARASTQSPQALEDSGPVNISVSITLTLDPLKPFGGYSRNVTHLYSTILGHQIGLSGREA HEEINITFTLPTAWSSDDCALHGHCEQVVFTACMTLTASPGVFPVTVQPPHCVPDTYSNA TLWYKIFTTARDANTKYAQDYNPFWCYKGAIGKVYHALNPKLTVIVPDDDRSLINLHLMH TSYFLFVMVITMFCYAVIKGRPSKLRQSNPEFCPEKVALAEA >gi568815591r:66888374_67095417|GENSCAN_predicted_CDS_3|1209_bp atggctcccgcaaagaagggtggtgagaagaaaaagggccattctgccatcaacgaggtg gtgacccgagaatacaccatcaacattcacaagcgcatccatgaagtgggcttcaagaag cgtgcccctcgggcactcaaagaaattcagaaatatgccatgaaggagatgggaactcta gacatgcacattgataccaggctcaacaaagctgtctgggccaaaggaatatcccatacc atgtctgtggagctgatcagaataatgttcagcatcaaccccctggagaacctgaaggtg tacatcagcagtcggcctcccctggtggtcttcatgatcagcgtaagcgccatggccata gctttcctgaccctgggctacttcttcaaaatcaaggagattaaatccccagaaatggca gaggattggaatacttttctgctacggttcaatgatttggacttgtgtgtatcagagaat gaaaccctcaagcatctcacaaacgacaccacaactccggaaagtacaatgaccagcggg caggcccgagcttccacccagtccccccaggccctggaggactcgggcccggtgaatatc tcagtctcaatcaccctaaccctggacccactgaaacccttcggagggtattcccgcaac gtcacccatctgtactcaaccatcttagggcatcagattggactttcaggcagggaagcc cacgaggagataaacatcaccttcaccctgcctacagcgtggagctcagatgactgcgcc ctccacggtcactgtgagcaggtggtattcacagcctgcatgaccctcacggccagccct ggggtgttccccgtcactgtacagccaccgcactgtgttcctgacacgtacagcaacgcc acgctctggtacaagatcttcacaactgccagagatgccaacacaaaatacgcccaagat tacaatcctttctggtgttataagggggccattggaaaagtctatcatgctttaaatccc aagcttacagtgattgttccagatgatgaccgttcattaataaatttgcatctcatgcac accagttacttcctctttgtgatggtgataacaatgttttgctatgctgttatcaagggc agacctagcaaattgcgtcagagcaatcctgaattttgtcccgagaaggtggctttggct gaagcctaa >gi568815591r:66888374_67095417|GENSCAN_predicted_peptide_4|132_aa MPRDQTYVRLQPSKICRRFSQTLVVLLPCRRASSPSAMIVLALPMGARNCSWKTPIRLEE EDAEQINSDVLFCKAHMHILQTRKINLEVNAITDQLLTIMQQESEMPEGELVLLHGAALN SKRTPSVAHFES >gi568815591r:66888374_67095417|GENSCAN_predicted_CDS_4|399_bp atgccgagagatcaaacgtacgtaagacttcagcccagcaagatctgccgcagattttct caaaccctggtggtccttttgccatgcagacgtgcctcttcgccttctgccatgattgta ctggcattaccaatgggggctcgaaactgcagctggaaaacacccatccggctggaggaa gaagacgcggagcaaataaattcagatgttttattctgtaaagcacatatgcacatactt caaacacggaaaattaatctggaagtgaatgccataacagatcagctactgaccatcatg cagcaagagtccgagatgcctgagggtgagctggtgctcctgcacggggccgccctgaac agcaagcgcactccgtctgttgcacacttcgaatcctga >gi568815591r:66888374_67095417|GENSCAN_predicted_peptide_5|328_aa MILKSRQETRISDLDSPMIRTGTARRTELPRRVSGVPSACRPVGSHDTARPVLIGSEDEE VGPEESPELRERKPVPAAMSIFTPTNQIRLTNVAVVRMKRAGKRFEIACYKNKVVGWRSG VEKDLDEVLQTHSVFVNVSKGQVAKKEDLISAFGTDDQTEICKQILTKGEVQVSDKERHT QLEQMFRDIATIVADKCVNPETKRPYTVILIERAMKDIHYSVKTNKSTKQQALEVIKQLK EKMKIERAHMRLRFILPVNEGKKLKEKLKPLIKVIESEDYGQQLEIVCLIDPGCFREIDE LIKKETKGKGSLEVLNLKDVEEGDEKFE >gi568815591r:66888374_67095417|GENSCAN_predicted_CDS_5|987_bp atgatattaaagagccgacaggagactaggatctcggacctggatagcccgatgattcgc actggtaccgcgagacgcaccgagctacctcgccgcgttagcggcgtaccgagtgcctgc agacctgtgggcagccatgacactgccaggccggtcctgattggttcagaggacgaagag gtgggcccagaggaaagcccagagctccgggaaaggaagccagtgcccgccgcgatgtcg atcttcacccccaccaaccagatccgcctaaccaatgtggccgtggtacggatgaagcgt gccgggaagcgcttcgaaatcgcctgctacaaaaacaaggtcgtcggctggcggagcggc gtggaaaaagacctcgatgaagttctgcagacccactcagtgtttgtaaatgtttctaaa ggtcaggttgccaaaaaggaagatctcatcagtgcgtttggaacagatgaccaaactgaa atctgtaagcagattttgactaaaggagaagttcaagtatcagataaagaaagacacaca caactggagcagatgtttagggacattgcaactattgtggcagacaaatgtgtgaatcct gaaacaaagagaccatacaccgtgatccttattgagagagccatgaaggacatccactat tcggtgaaaaccaacaagagtacaaaacagcaggctttggaagtgataaagcagttaaaa gagaaaatgaagatagaacgtgctcacatgaggcttcggttcatccttccagtcaatgaa ggcaagaagctgaaagaaaagctcaagccactgatcaaggtcatagaaagtgaagattat ggccaacagttagaaatcgtatgtctgattgacccgggctgcttccgagaaattgatgag ctaataaaaaaggaaactaaaggcaaaggttctttggaagtactcaatctgaaagatgta gaagaaggagatgagaaatttgaatga >gi568815591r:66888374_67095417|GENSCAN_predicted_peptide_6|250_aa MPGQGFATVLAEAVTSLDLPVAIINLKEYDPDDHLIEEVTSKNVCVFLVATYTDGLPTES AEWFCKWLEEASIDFRFGKTYLKGMRYAVFGLGNSAYASHFNKVGKNVDKWLWMLGAHRV MSRGEGDCDVVKSKHGSIEADFRAWKTKFISQLQALQKGERKKSCGGHCKKGKCESHQHG SEEREEGSHEQDELHHRDTEEEEPFESSSEEEFGGEDHQSLNSIVDVEDLGKIMDHVKKE KACESQLIRG >gi568815591r:66888374_67095417|GENSCAN_predicted_CDS_6|753_bp atgcccggccagggattcgcaacagttcttgctgaagcagttacatccctggatctgcct gtggccattattaatctaaaagaatatgatccagatgatcatctgatagaagaggtgact agtaaaaatgtctgtgtcttcctggttgcgacatacactgacggcctaccaactgaaagt gcagagtggttctgcaaatggttagaggaagcatccattgattttcgatttggcaaaact tacctgaagggtatgagatatgcggtatttggcctgggaaattctgcctatgctagccac ttcaacaaggttggcaaaaatgttgacaagtggctctggatgcttggcgcgcatcgtgtg atgagtcgaggggagggcgactgcgacgtggttaaaagcaagcacggcagcattgaggcc gacttcagagcatggaagaccaagttcatctcccagctgcaggcacttcagaaaggggag agaaagaagtcctgtggcggccactgcaagaaaggcaaatgtgaatctcaccaacatggc tcagaggagagggaggaaggatctcatgagcaggatgaattgcatcatagagacaccgag gaggaagaaccctttgagagctccagtgaagaagagtttggtggtgaggaccatcagagc ctaaattccattgttgatgttgaagatttgggcaaaattatggatcatgtgaagaaagaa aaggcctgtgaaagtcagctcatcagaggataa >gi568815591r:66888374_67095417|GENSCAN_predicted_peptide_7|172_aa MGHHYMNKCLHLGIPEGEQKGYTRQHFCHESTLNKGDRDIYNLKHSPYCCYQFPLAGMGP HILYLSQLASNLELFKRGKGRGEQRKEEVNCGMLRKQKITELEDTVIENEGKVKPFSDIR KLKEFTTNKPALQEMFKKSLESPQETNIITHSCNYNLLNKLLPNPTEIQNSS >gi568815591r:66888374_67095417|GENSCAN_predicted_CDS_7|519_bp atgggacaccattacatgaacaaatgtttgcatttaggaattccagaaggagaacagaaa gggtacactcgccagcacttctgccatgagagtacactgaacaaaggagacagggacatt tataacctgaaacattcaccctactgctgttaccagtttccattggctggaatgggacct cacattctgtatttgtcccaattggctagcaacttagaacttttcaaaagaggcaaaggc agaggagaacaaaggaaggaggaagtaaattgtggaatgctcagaaagcaaaaaattact gaacttgaagacacagtgatagaaaatgaaggcaaagtaaagccattttcagatatacga aagctgaaagaattcaccaccaacaaacctgcactacaagagatgttcaaaaagtccttg gagtcgccacaggaaactaatattattactcattcatgcaactataacctccttaataag ctgctcccaaaccctacggagatccaaaacagcagctaa >gi568815591r:66888374_67095417|GENSCAN_predicted_peptide_8|60_aa XGVFRTQRFDLYQQASPPDALHWIPKPWEWTGPPPREGPSQKAEEPGSRGDKEPGLPPPH >gi568815591r:66888374_67095417|GENSCAN_predicted_CDS_8|183_bp nntggagtcttccgcacccagcgcttcgacctttaccagcaggcctccccaccagatgcc ctgcactggatacctaagccttgggaatggacagggccgccacctcgagaagggccctcc caaaaggcagaggagcctgggtcccgaggggacaaggagcctggtttgcccccaccccac tga