GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:29:51 Sequence gi568815594r:69741997_69957644 : 215648 bp : 35.77% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 948 1126 179 1 2 44 93 76 0.381 2.42 1.02 Term + 2294 2509 216 1 0 121 45 146 0.376 9.76 1.03 PlyA + 2669 2674 6 -0.45 2.05 PlyA - 3131 3126 6 1.05 2.04 Term - 5819 5632 188 0 2 74 37 114 0.273 1.57 2.03 Intr - 12802 12674 129 2 0 90 99 91 0.877 10.15 2.02 Intr - 13265 13074 192 0 0 95 109 94 0.859 10.74 2.01 Init - 20812 20692 121 1 1 68 74 85 0.698 5.50 2.00 Prom - 23626 23587 40 -7.05 3.03 PlyA - 24036 24031 6 1.05 3.02 Term - 31501 31434 68 2 2 121 43 59 0.260 1.82 3.01 Init - 41123 40979 145 2 1 89 59 100 0.415 7.54 3.00 Prom - 43535 43496 40 -7.55 4.03 PlyA - 43652 43647 6 1.05 4.02 Term - 49985 49876 110 0 2 97 48 101 0.983 4.69 4.01 Init - 58887 58761 127 0 1 43 65 119 0.320 5.47 4.00 Prom - 59004 58965 40 -4.25 5.00 Prom + 64678 64717 40 -3.05 5.01 Init + 64930 65146 217 0 1 60 56 176 0.448 10.50 5.02 Intr + 75972 76094 123 1 0 94 0 94 0.028 0.84 5.03 Intr + 84355 84470 116 2 2 67 70 80 0.310 3.35 5.04 Intr + 84965 85066 102 1 0 102 88 55 0.730 6.35 5.05 Term + 85721 86038 318 1 0 62 42 201 0.873 6.80 5.06 PlyA + 86207 86212 6 1.05 6.02 PlyA - 86682 86677 6 -3.24 6.01 Sngl - 87571 87104 468 1 0 48 33 334 0.955 19.78 6.00 Prom - 87820 87781 40 -13.20 7.08 PlyA - 88520 88515 6 -0.45 7.07 Term - 88894 88574 321 1 0 22 54 246 0.880 8.44 7.06 Intr - 89194 89018 177 1 0 80 96 113 0.732 10.49 7.05 Intr - 96561 96493 69 0 0 65 73 67 0.006 1.36 7.04 Intr - 96888 96798 91 2 1 26 91 39 0.001 -2.92 7.03 Intr - 102345 102165 181 1 1 84 76 172 0.029 13.60 7.02 Intr - 113430 113305 126 1 0 62 52 66 0.459 0.13 7.01 Init - 115648 115504 145 1 1 79 59 139 0.979 10.43 7.00 Prom - 128407 128368 40 -4.95 8.02 PlyA - 129816 129811 6 1.05 8.01 Sngl - 141896 141585 312 2 0 73 42 160 0.450 5.58 8.00 Prom - 147383 147344 40 -4.95 9.02 PlyA - 147574 147569 6 1.05 9.01 Sngl - 173611 173132 480 1 0 9 48 258 0.800 9.73 9.00 Prom - 178337 178298 40 -2.35 10.04 PlyA - 178647 178642 6 1.05 10.03 Term - 181876 181704 173 2 2 52 42 164 0.907 5.21 10.02 Intr - 182894 182664 231 0 0 38 8 202 0.111 4.02 10.01 Init - 183099 182997 103 0 1 47 68 98 0.675 2.22 10.00 Prom - 185637 185598 40 -3.65 11.04 PlyA - 186407 186402 6 1.05 11.03 Term - 204918 204804 115 2 1 108 47 75 0.333 2.46 11.02 Intr - 210062 210013 50 2 2 92 91 34 0.447 0.66 11.01 Init - 215610 215278 333 0 0 61 84 183 0.653 12.54 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 75972 76141 170 1 2 94 41 117 0.817 4.66 S.002 Term - 100110 99998 113 1 2 108 48 133 0.974 9.04 S.003 Init + 190560 190610 51 2 0 83 95 41 0.863 4.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:69741997_69957644|GENSCAN_predicted_peptide_1|131_aa XDTEEHSGTQKLGDTRNHRALKRVSQPWLRELLGLGSLKGHRSSLFLSSVFLVTHKVVSK RRRSGPGQLCTKEPDTVEALDLEVGPARPCKGKGGAVGCYGVMGHRGPTAAFAVPAAAPA TTTHVSPLQPM >gi568815594r:69741997_69957644|GENSCAN_predicted_CDS_1|396_bp nctgataccgaggaacacagtggcacccagaagcttggagacaccaggaaccacagagcc ctaaagagggtgtcacaaccctggctcagggagctcctaggtttgggctccttgaagggc cacaggtcttctctctttctctcttctgtctttcttgtcacacacaaggtggtgagcaag cggaggcgcagcggccctggccaactttgtacaaaggaacccgacacagtggaggctctg gacctggaagtgggtcctgcccggccatgcaaaggtaagggtggtgcagtaggctgctat ggggtcatggggcacaggggtcccactgccgcctttgctgttcctgcagccgctcctgcc accaccacccatgtctccccactgcagccgatgtga >gi568815594r:69741997_69957644|GENSCAN_predicted_peptide_2|209_aa MTLNEHAAFKRLFNKAHLAPPLIHSTLSGHSTCFREHRVGEQLSIYYIFVQSGIKMLSPK DILRKDLKLVHGYPMTCAFASNWEKIEQFHSRPDDIVIATYPKSGTTWVSEIIDMILNDG DIEKCKRGFITEKVPMLEMTLPGLRTSAPREGVSETDMEQPTLTMDLQDPSCRRLLDAQR HFSWQGELPRELAETVLYAVWSPEGLVQE >gi568815594r:69741997_69957644|GENSCAN_predicted_CDS_2|630_bp atgactctcaatgagcatgctgccttcaagcgtctgtttaacaaagcacatcttgcaccg cccttaatccattcaaccctgagtggacacagcacatgtttcagagagcacagggttggg gaacaactgtcaatctattatatatttgtacaatctggtattaaaatgctttccccaaaa gatattctgcgaaaagatctgaagttggtccatggttatcccatgacctgtgcttttgca agcaactgggaaaaaattgaacagttccatagcagaccagatgacattgtgatagccact tatcctaaatcaggtactacttgggttagtgaaattatagacatgattctaaatgatgga gatattgaaaaatgtaagcgaggttttattactgaaaaagttccaatgttggaaatgact ctccctggattaagaacatcagctcctagagaaggagtgagtgaaacagacatggaacag cccactcttaccatggacctccaggatcctagctgcaggagactccttgatgcccagaga cattttagctggcagggagaacttcccagagaattggcagagacagtactctacgctgtg tggagtccagagggtttggtgcaggaatag >gi568815594r:69741997_69957644|GENSCAN_predicted_peptide_3|70_aa MEQIVGCRPWAQPSGVSGSAVTHGRAEPRDVTPHRASSTGTGTRHPWKGEPFILREQRDE DPSSVTFSAE >gi568815594r:69741997_69957644|GENSCAN_predicted_CDS_3|213_bp atggagcagatcgtgggctgcaggccctgggcccagccatcgggggtgtcaggctcagca gtcacccatggcagggcagaacccagggatgtgactccacacagagcctcctctacaggc acaggaaccaggcacccttggaagggagaaccctttatcctaagagagcagagggatgaa gatccatcttctgtaactttttctgctgaataa >gi568815594r:69741997_69957644|GENSCAN_predicted_peptide_4|78_aa MIYMAWNAKDVAVSYYYFRQMAKMHPDPGTWEEFLEKFMTGKCVSGDWKNQFTVAQYEKF EDYEKKMKESTLQFRSEI >gi568815594r:69741997_69957644|GENSCAN_predicted_CDS_4|237_bp atgatctacatggcatggaatgccaaagatgtggctgtgtcttattattatttccgccag atggcaaaaatgcacccagatcctggaacttgggaggagttcctggaaaaattcatgact ggaaagtgtgtttcaggagactggaagaatcaattcactgtagcccagtatgagaaattt gaagattatgaaaaaaaaatgaaagagtccacactgcagtttcgatcagaaatctaa >gi568815594r:69741997_69957644|GENSCAN_predicted_peptide_5|291_aa MWRRLWNWVRGRNWKSLEGLEEDRKMGESLELPRDLLNGIDENADNDIHNDNEIQAKVVS DGDEKLIGSWNKVPQCGDHLPMGQQETTYLLTPLMCTPYMDCTDMLQGPSLSHRGQAWVQ VFENVLVRTAKSNIPRSSLWSRRKTSVSAAVSKETSKEISKEPQKSPSYRLCRLQAVGGG EFGSTQGQEEQAQKEKRDQRKAAALVMALRQTNFGGSERTENGAGQLPVRACYQCGLLGH FKKDCPMRNKPPPHPCLLCQPLIHVCYADAITGRHTAPEGKGSLGQKPPTR >gi568815594r:69741997_69957644|GENSCAN_predicted_CDS_5|876_bp atgtggaggcgactttggaactgggtaagagggagaaattggaaaagtttggagggccta gaagaagacaggaaaatgggtgaaagtttggaacttcctagagacttgttgaatggcatt gacgaaaatgctgacaatgatatacacaatgacaatgaaatccaggctaaggtggtctca gatggagatgagaaacttattgggagctggaataaagtccctcagtgtggtgaccacctc cctatgggacagcaagagacaacctacttactcaccccactaatgtgcacaccctacatg gactgcacagatatgctgcagggaccatctctcagtcacagaggacaggcatgggtgcag gtttttgagaatgtgttggtaaggactgctaaatccaacattcctcggtcctccttgtgg tctaggaggaaaactagtgtttctgctgctgtgtcaaaggaaacaagcaaagaaatctcc aaggaaccacaaaaatcccccagctatcggttatgtcgccttcaagctgtagggggaggg gaatttggctcaacccagggccaagaggaacaggcccaaaaggaaaagcgagatcagaga aaggccgcagccttagtcatggccctcagacaaacaaactttggtggttcagagaggaca gaaaatggagcaggccaattacccgtgagggcttgttatcagtgtggtttactaggacac tttaaaaaagattgtccaatgagaaacaagccaccccctcatccatgtctgctatgccaa cccctcattcatgtctgctatgccgatgcaatcactggaaggcacactgccccagagggc aaaggttctttaggccagaagcccccaaccagatga >gi568815594r:69741997_69957644|GENSCAN_predicted_peptide_6|155_aa MVLTLTVLKAEEWRLYAEKTLEAGVNDMHTLLTKIPRVRAESNLLGLTVNQAPIVVDLKP GATLVWVHQYPLLLSKAIWGIHKHLERLHKHRIIVKCQSLWNTPLLSVRKLSSEYRLMQD LHAVNQAIVTIHPVVPNLYTLMGHIPASATWFLQS >gi568815594r:69741997_69957644|GENSCAN_predicted_CDS_6|468_bp atggtgttaacccttactgtcctgaaagctgaggaatggaggctatatgcagaaaagacc ctggaagcaggagtaaatgatatgcacacattacttactaaaattcccagagttcgggct gaaagtaacctgcttggactgacagtaaatcaggcaccaattgtagtggacttaaaacca ggagcaactctggtttgggtccatcagtacccactactactttccaaggccatatgggga atccacaaacacttagagcgacttcataagcatagaatcatagtcaaatgccagtcactg tggaatacccctctcttgtcagtgaggaagctgtctagtgaatataggctaatgcaggat ttgcatgcggtaaaccaagctatagtgaccatccacccagtggtgccaaacctgtatact ttaatgggacatattccagcaagtgccacttggtttttacagtcctag >gi568815594r:69741997_69957644|GENSCAN_predicted_peptide_7|369_aa MNSELDYYEKFEEVHGILMYKDFVKYWDNVEAFQARPDDLVIATYPKSGTTWVSEIVYMI YKEGDVEKCKEDVIFNRIPFLECRKENLMNGYQKRGDKIDTFPGKEAIRGACGQDYTSYF VPRDEEQSIHKLHNTARRNYEPEIVALHEKGVRSILGLSYKGQVQYEGKSNFGVEKPYLS QGPVKVQRHKRPPSAPFADTYLSLVPINPPLPRPTAFAAGSSVRPTAPAAHKELNPSGYM LPASPEKEESELQKVRAESLESQLFSITDLMNWKHHAPSYTEKPQAFIDLMQSIIQTTNP TWPDWKQLLLTLFNMEECQRLTQAALQWLEANAPADTVNAQAYAQGQFPETDPNWDPNDA SQLQHLQQY >gi568815594r:69741997_69957644|GENSCAN_predicted_CDS_7|1110_bp atgaattctgaacttgactattatgaaaagtttgaagaagtccatgggattctaatgtat aaagattttgtcaaatattgggataatgtggaagcgttccaggcaagaccagatgatctt gtcattgccacctaccctaaatctggtacaacctgggttagtgaaattgtgtatatgatc tataaagagggtgatgtggaaaagtgcaaagaagatgtaatttttaatcgaatacctttc ctggaatgcagaaaagaaaacctcatgaatggatatcagaaaagaggtgataaaattgat acatttcctggaaaggaagccatcagaggagcttgtggacaggattatacatcatacttc gttccaagagatgaagaacaatccatccacaaattacacaacactgccagacgaaattat gaaccagaaattgtcgcccttcatgagaaaggtgtgagatccatattgggactttcttat aaagggcaagtgcagtatgaggggaagagtaactttggagtggagaaaccctacctcagc cagggtccagtaaaagtacagcgtcataaaagacctccttctgctccctttgctgacacc taccttagcctcgttccaatcaacccccctttaccaagaccaactgcttttgctgctggt tcatctgtgaggccaactgccccagctgcccacaaggagttaaatccgagtgggtacatg ctcccagcctcacctgagaaggaggaatcagagctacagaaagttagggcggaaagccta gaaagtcagctcttttcaatcactgatctcatgaactggaagcatcatgctccctcctac acagagaagccacaggcttttatagacttgatgcagtccattattcagacaaccaatcca acctggccagattggaaacagcttcttctgacactgtttaatatggaagagtgccagaga ctgacacaggcagccctccagtggctagaagcaaatgctcctgcagacacagttaatgcc caggcatatgcgcagggccagttccctgaaacagaccccaattgggacccaaatgacgca tcccagctgcagcatctgcagcagtactga >gi568815594r:69741997_69957644|GENSCAN_predicted_peptide_8|103_aa MAKSGQCTAQGIDSEEASPKPWWLTHGVGLVGIQKSIIEVWKPPPRFQKMYRNAWMSREK FAASVEPSWGTSARAMQKGNVGLELPQRVPTRTLPSEVVRRRL >gi568815594r:69741997_69957644|GENSCAN_predicted_CDS_8|312_bp atggctaaaagtggccaatgtacagctcagggcattgattctgaggaggcaagtcccaag ccttggtggcttacacatggtgttgggcttgtgggtatacagaagtcaataattgaggtt tggaaacctccccctagatttcagaagatgtatagaaatgcctggatgtccagggagaag tttgctgcaagtgtggagccctcatggggaacctctgctagggcaatgcagaagggaaat gtggggttggagctgccacaaagagtccccacaaggacattgcctagtgaagttgtgaga agacggctatga >gi568815594r:69741997_69957644|GENSCAN_predicted_peptide_9|159_aa MSLLGRIGSQKHKAKSHNRPSENWERQKLVVAESKSKSLKTREANSAAISLWPKAQGSPQ TAGASPSVQRIKNLESDVQRQEEWKEAFSTGERRKPEDSARKAMLPSSGCFILVVLAAVL VVSTHTEGRSSSSSPLSQMSVSSGNTLTDIPRNNTLPTT >gi568815594r:69741997_69957644|GENSCAN_predicted_CDS_9|480_bp atgagtttattagggagaattggctcacaaaagcacaaggcaaagtcccataataggcca tctgaaaactgggagagacagaaactggtagtggctgagtccaagtccaaaagcctcaaa accagggaagccaacagtgcagcgatcagtctgtggccaaaagcccaagggtccccacaa accgctggtgcaagtcccagtgtccaaaggataaagaacctagagtctgacgtccaaagg caggaggagtggaaagaagcattcagcacaggagaaagaaggaagccagaagactcagca agaaaagctatgctaccttcttcaggctgttttattctagttgtgctggcagctgttttg gtggtgtccacccacactgaaggtcggtcttcctcttccagtccactgagtcaaatgtca gtctcctctggcaacaccctcacagacatacccagaaacaatactttaccaaccacttag >gi568815594r:69741997_69957644|GENSCAN_predicted_peptide_10|168_aa MKLRTLAVSVTALKVARLEFVPSDVQMCSEFLPSGGLVGLLASGVKLQTFTVSVTAHKSS VDPNSEQQQDLLQRRKEQSFHSVEGDPSGLPHGLRQPAFILLSGPTHILLIAEEAGSGLG QHRKGLPQCSGGLKGSSSAVKVGAQAEEAQRASEGCEDCQRAVVVTSQ >gi568815594r:69741997_69957644|GENSCAN_predicted_CDS_10|507_bp atgaagctgcggaccctcgcggtgagtgttacagctcttaaggtggcgcgtctggagttt gttccttctgatgttcagatgtgttcggagtttcttccttctggtgggctcgtgggcttg ctggcttcaggagtgaagctgcagaccttcacggtgagtgttacagctcataaaagcagt gtggacccaaacagtgagcagcagcaagatttattgcaaagaaggaaagaacaaagcttc cacagtgttgaaggggaccccagcgggttgccacatgggctcaggcagcctgcttttatt ctcttatctggccccacccacatcctgctgattgctgaggaagccggctccggccttggc cagcacagaaaggggctcccacagtgcagcggtgggctgaagggctcctcaagtgccgtc aaagtgggagcccaggcagaggaggcgcagagagcgagcgagggctgtgaggactgccag cgtgctgtcgtcgtcacctctcagtag >gi568815594r:69741997_69957644|GENSCAN_predicted_peptide_11|165_aa MPVLKSPTIPFFDPQIPKLTDLENLHLPLPLLQPLMQQVPQPIPQTLALPPQPLWSVPQP KVLPIPQQVVPYPQRAVPVQALLLNQELLLNPTHQIYPVTQPLAPVHNPISVYNDKDKDY SMTLIKVCSDNVLMENVLQTTDYKLKILDKFINAFYDLAASAIQK >gi568815594r:69741997_69957644|GENSCAN_predicted_CDS_11|498_bp atgcctgtccttaaatctccaacgatacccttttttgaccctcaaatcccaaaactcact gatcttgaaaatctgcatcttcctctgcctctgctccagcccttgatgcagcaggtccct cagcctattcctcagactcttgcacttccccctcagcccctgtggtctgttcctcagccc aaagtcctgcctatcccccagcaagtggtgccctaccctcagagagctgtgcctgttcaa gcccttctgctcaaccaagaacttctacttaaccccacccaccagatctaccctgtgact cagccacttgccccagttcataaccccattagtgtgtataatgataaagacaaagattat tcaatgacacttattaaagtatgctctgacaatgtcctcatggaaaatgttctacaaacc actgactataaattaaagatattggacaagtttataaatgccttttatgatctggctgct agtgccatccagaagtga