GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:01:42 Sequence gi568815595r:125132344_125413640 : 281297 bp : 40.70% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9314 9678 365 1 2 60 5 338 0.211 17.27 1.02 Intr + 14175 14297 123 0 0 72 100 44 0.093 2.78 1.03 Intr + 15053 15152 100 2 1 22 85 70 0.045 -0.71 1.04 Term + 15243 15410 168 2 0 130 51 136 0.905 11.00 1.05 PlyA + 20205 20210 6 1.05 2.00 Prom + 20347 20386 40 -7.45 2.01 Init + 21803 22012 210 1 0 53 55 141 0.387 6.13 2.02 Intr + 28225 28359 135 0 0 93 56 74 0.510 4.54 2.03 Intr + 30081 30291 211 2 1 55 46 188 0.013 8.86 2.04 Intr + 31751 31912 162 0 0 30 51 140 0.015 3.53 2.05 Term + 33465 33586 122 1 2 45 34 101 0.003 -1.94 2.06 PlyA + 34153 34158 6 1.05 3.10 PlyA - 34675 34670 6 1.05 3.09 Term - 36101 36045 57 2 0 67 38 102 0.353 -0.19 3.08 Intr - 37569 37279 291 0 0 67 64 168 0.260 8.71 3.07 Intr - 45296 45259 38 0 2 99 99 35 0.834 2.76 3.06 Intr - 45712 45400 313 1 1 17 115 256 0.046 16.03 3.05 Intr - 55085 54894 192 2 0 93 94 271 0.898 26.97 3.04 Intr - 58178 58032 147 2 0 143 89 70 0.968 12.11 3.03 Intr - 79125 78956 170 1 2 7 96 104 0.067 1.84 3.02 Intr - 80241 80090 152 2 2 95 68 95 0.156 7.09 3.01 Init - 82542 82436 107 0 2 40 48 142 0.117 5.14 3.00 Prom - 85828 85789 40 -6.15 4.08 PlyA - 92419 92414 6 1.05 4.07 Term - 101596 99998 1599 1 0 27 42 1361 0.780 114.54 4.06 Intr - 101986 101868 119 2 2 66 110 55 0.667 4.76 4.05 Intr - 145466 145383 84 0 0 38 105 84 0.253 3.97 4.04 Intr - 146904 146781 124 0 1 70 68 18 0.095 -2.76 4.03 Intr - 155885 155760 126 2 0 78 69 164 0.899 13.46 4.02 Intr - 160338 160117 222 0 0 103 88 64 0.130 5.30 4.01 Init - 181297 180965 333 1 0 62 95 386 0.958 34.04 4.00 Prom - 184300 184261 40 -4.15 5.00 Prom + 188276 188315 40 -3.45 5.01 Init + 213079 213219 141 0 0 55 52 107 0.382 3.98 5.02 Intr + 224953 225018 66 0 0 140 48 30 0.001 2.38 5.03 Intr + 226659 226817 159 2 0 71 50 99 0.004 3.66 5.04 Term + 242963 243217 255 1 0 129 48 147 0.489 9.40 5.05 PlyA + 246925 246930 6 1.05 6.05 PlyA - 247175 247170 6 1.05 6.04 Term - 264829 264747 83 2 2 110 41 90 0.918 3.38 6.03 Intr - 265396 265207 190 1 1 31 99 128 0.888 6.44 6.02 Intr - 268139 268002 138 2 0 103 100 113 0.979 13.74 6.01 Intr - 270940 270846 95 2 2 96 7 116 0.834 3.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 138956 139108 153 1 0 124 47 97 0.888 6.14 S.002 Sngl + 178608 178829 222 2 0 93 49 189 0.944 10.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:125132344_125413640|GENSCAN_predicted_peptide_1|251_aa MMLGLQAAGCGLRAAGCCRGGGFRRRSLLGPPPLDQRPTTSQTALSCEWTMGEAGATENS QRGGRAPDGARRVGLGGRRKSGSARKGKGEAASGKGPARRRPKGVGGSGRPRRLREAPQQ RNTLPRDPQRWHPASLLKAWAPPCRTDVGIEDKGGFLAKDGGSSERRKPEVAPSCASVCS FASRCVPFLQALMESEVAAVTGVLDESSDHTSCPTSCLDSLGSVLMTDQPREHEGRARGG ERMLLLLTERD >gi568815595r:125132344_125413640|GENSCAN_predicted_CDS_1|756_bp atgatgctggggctgcaggctgcgggctgcgggctgcgggctgcgggctgctgccgcggc gggggcttccggcggcgctctcttctgggtcccccacccctggaccagcgaccgacgacc agccagacagccctttcctgcgaatggacaatgggagaggctggcgcaaccgagaatagc cagcgcggaggaagggctccggacggagctaggagggtggggctcggagggcgcaggaag agcggctctgcgaggaaagggaaaggagaggccgcttctgggaagggacccgcacgacga cgcccgaagggcgtcgggggaagtggtaggccccggagactgcgcgaggctcctcagcaa aggaacaccctccccagggatcctcagagatggcaccctgcatccttgcttaaggcctgg gctcctccctgccgtacagatgtgggaatagaggacaaagggggcttcctggccaaggat ggtggaagctctgagaggagaaaaccagaagtggccccttcctgtgcctctgtgtgttcc tttgcaagccgttgtgtgccttttctgcaggcgcttatggagagtgaggtggctgcagtc actggggtgctggatgagtccagtgaccatacatcctgccctacatcctgcctggacagc ctgggaagtgttttgatgactgatcaacccagagagcacgaggggagagcaagaggagga gaaaggatgctgctgctcctgactgaacgtgattga >gi568815595r:125132344_125413640|GENSCAN_predicted_peptide_2|279_aa MLASSLPVQNTLTTVPFHFTSQQLRKLHFQCHALHGTYVSCLFSNSVSEEQAAPAAGNNV SLMVKDMDPEPLPSLNQPLVANASLDEAGDAYNTLSLFTLRIKTWIPKSGKLSLLYQSHQ QGPLDHVTRDKANGGWKQQERGADPPAPTPPPGFHPSLSTSEDFQIQIVFGLKNFLVTAK WQAGHVLTRHERAQRRLAGSQTPLIRKEKGSSKEFTNFLQGHPRASGCQGAPGNHALESD TVPGQLRGPESSQNIHVFPASRVEQPPGEILGVFLGKEY >gi568815595r:125132344_125413640|GENSCAN_predicted_CDS_2|840_bp atgctggcttcaagtctcccagtgcaaaatactttaactaccgtcccctttcacttcact agtcagcagcttcgcaaactgcattttcaatgccatgccctacatgggacttatgtgtcc tgcttgttttctaattcagtcagtgaagagcaggcggccccagcagctggaaacaatgtg agcttgatggttaaggacatggaccctgagcctttgccatctctcaatcagccactggtt gcaaatgcgagtttagatgaagcgggtgacgcttataacaccttaagtcttttcacactc aggattaagacatggatccctaaaagtggcaaacttagcctgctgtatcaatcccatcag cagggtcccttagatcacgtcaccagagacaaagccaacggagggtggaagcaacaggag cgtggtgcagatccaccagccccaactccacctcctgggtttcatccctcccttagcacc agcgaggacttccagattcagattgtttttggtcttaagaactttcttgtcacagccaag tggcaggcgggacatgttttaaccaggcatgaacgggcccagcggagactggcaggttct cagacccccttgatccgaaaggagaaaggaagcagtaaggagttcaccaacttcctccaa gggcaccccagagccagcgggtgccagggggcaccagggaatcacgctttggagtcagac acagtgccaggccagctgcgaggcccagagagcagccagaacatccatgtgttccctgca tcccgagtggaacagccacctggggagattctgggtgtgtttctgggtaaagaatattga >gi568815595r:125132344_125413640|GENSCAN_predicted_peptide_3|488_aa MYDPPKVNKKNDSADHSDESVEIQSALHICGFDQGRWILSYRRSPARRHTGLAPVLGVRV RVCVGCQGARVPAWPKREGRGSAEEPVERKDGQQGERRGEASLTKDAVTFQSSLPVQITR IPACWRMTQMSQVQELFHEAAQQDALAQPQPWWKTQLFMWEPVLFGTWDGVFTSCMINIF GVVLFLRTGWLVGNTGVLLGMFLVSFVILVALVTVLSGIGVGERSSIGSGGVYSMISSVL GGQTGGTIGLLYVFGQGSQRWAVLPMGPWLSVLLTLLPGIACQCVAGAMYITGFAESISD LLGLGNIWAVRGISVAVLLALLGINLAGVKWIIRLQLLLLFLLAVSTLDFVVGSFTHLDP GFTYNWCELFLDQMLPAFSKCLVVAVHPLLVMRHHRADREFHPQPKWVAVVTGGHHSRVI GGKLRCFIGSPPNVRFCRSFSGIVRILQRRILSSPARDMPGCISSGSRVELSYRQLLSDL YLAVWAPS >gi568815595r:125132344_125413640|GENSCAN_predicted_CDS_3|1467_bp atgtatgacccaccaaaagtgaataagaaaaatgactctgctgatcattctgatgaatct gtggaaatacagtcagcccttcatatctgtggatttgaccaaggcaggtggatcctctcc taccgccggtcacccgcgcgccggcacacgggcctggccccagtacttggggtgcgggtc cgcgtctgcgtggggtgccaaggggcccgggtcccagcgtggcccaaacgtgagggtagg ggctccgcggaggagccagtagagagaaaggatggacaacaaggggagaggagaggagaa gccagcctgaccaaggatgctgttaccttccagagcagtctccctgtccagatcaccagg atccctgcttgctggagaatgacccagatgtcccaggtgcaggagctcttccatgaggca gcccagcaggatgccctggcccagccccagccctggtggaagacccagctgttcatgtgg gagcctgtgctgtttgggacctgggatggtgtgttcacatcctgcatgatcaacatcttt ggggttgtgctcttcctgaggactggctggctggtgggaaacacaggagtgctcctgggc atgttcctggtgtccttcgtcatcctggtggccctcgtcacggtgctgtctggcattggc gtcggggagcgcagcagcatcggcagcggtggcgtctactccatgatctcctcggtcctg ggtgggcagacgggaggcaccatcgggctgctctatgtgtttggacagggttctcagcgc tgggcggttctgcccatgggcccatggctttctgtcctgctgactcttctacctggaatc gcatgtcagtgtgttgcaggtgccatgtatatcaccggctttgctgaatccatctcggat ttgctgggcctcgggaatatctgggctgtgcgaggaatttcagttgcggtgcttctggcc ttgctgggcattaacctcgcaggtgtcaaatggataatccgcctccagctgctgttgctg ttcctgctggccgtgtccacactggactttgtggtgggttctttcacccacctggaccca ggctttacctacaattggtgtgagctgttcctggatcagatgctgccagctttctccaaa tgtctggtagtcgctgtccatccattgttggtaatgaggcaccacagagctgatcgggag ttccatccacaacccaagtgggtggcagttgtgacaggtgggcatcacagtagggtgatt ggtggcaaactgcgttgtttcattggaagtcccccaaatgttcgtttctgcaggtctttc tctggtatagttcgtattctccagagaagaattctctcatctcctgccagggacatgccc ggctgcatcagttcggggagcagagtggagctctcatatcgccagctcctgtcggatctc tacctggctgtctgggcaccatcttga >gi568815595r:125132344_125413640|GENSCAN_predicted_peptide_4|868_aa MNIDDKLEGLFLKCGGIDEMQSSRTMVVMGGVSGQSTVSGELQDSVLQDRSMPHQEILAA DEVLQESEMRQQDMISHDELMVHEETVKNDEEQMETHERLPQGLQYALNVPSFLFYSPDF KHKELRTTDVDLLKLLQNTGTYYWKDGSRRTAMKKRRELKLGDFMELGRLEFPSRIGMVS TVSGKISVKQEITFTDVSEQLMRDKKQIREPVDLQKKKKRKQRSPAKILTINEDGSLGLK TPKSHVCEHCNAAFRTNYHLQRHVFIHTGEKPFQCSQCDMRFIQKYLLQRHEKIHTGEKP FRCDECGMRFIQKYHMERHKRTHSGEKPYQCEYCLQYFSRTDRVLKHKRMCHENHDKKLN RCAIKGGLLTSEEDSGFSTSPKDNSLPKKKRQKTEKKSSGMDKESALDKSDLKKDKNDYL PLYSSSTKVKDEYMVAEYAVEMPHSSVGGSHLEDASGEIHPPKLVLKKINSKRSLKQPLE QNQTISPLSTYEESKVSKYAFELVDKQALLDSEGNADIDQVDNLQEGPSKPVHSSTNYDD AMQFLKKKRYLQAASNNSREYALNVGTIASQPSVTQAAVASVIDESTTASILESQALNVE IKSNHDKNVIPDEVLQTLLDHYSHKANGQHEISFSVADTEVTSSISINSSEVPEVTPSEN VGSSSQASSSDKANMLQEYSKFLQQALDRTSQNDAYLNSPSLNFVTDNQTLPNQPAFSSI DKQVYATMPINSFRSGMNSPLRTTPDKSHFGLIVGDSQHSFPFSGDETNHASATSTQDFL DQVTSQKKAEAQPVHQAYQMSSFEQPFRAPYHGSRAGIATQFSTANGQVNLRGPGTSAEF SEFPLVNVNDNRAGMTSSPDATTGQTFG >gi568815595r:125132344_125413640|GENSCAN_predicted_CDS_4|2607_bp atgaacattgacgacaaactggaaggattgtttcttaaatgtggcggcatagacgaaatg cagtcttccaggacaatggttgtaatgggtggagtgtctggccagtctactgtgtctgga gagctacaggattcagtacttcaagatcgaagtatgcctcaccaggagatccttgctgca gatgaagtgttacaagaaagtgaaatgagacaacaggatatgatatcacatgatgaactc atggtccatgaggagacagtgaaaaatgatgaagagcagatggaaacacatgaaagactt cctcaaggactacagtatgcacttaatgtcccttctttccttttttattctcccgacttt aaacataaagagttaagaaccactgatgtagatttactgaaactgttacaaaatacaggc acatattactggaaggatggatccagaagaactgctatgaagaaaaggagagaacttaaa ttaggtgattttatggaacttggaagactagaatttccaagtaggatagggatggtcagc actgtttcaggtaagataagcgtaaagcaggaaattacttttactgatgtatctgagcaa ctgatgagagacaaaaaacaaatcagagagccagtagacttacagaaaaagaagaagcgg aaacaacgttctcccgcaaaaatccttacaataaatgaggatggatcacttggtttgaaa acccctaaatctcacgtttgtgagcactgcaatgctgcctttagaacgaactatcactta cagagacatgtcttcattcatacaggtgaaaaaccatttcaatgtagtcaatgtgacatg cgtttcatacagaagtacctgcttcagagacatgagaagattcatactggtgaaaaacca tttcgctgtgatgaatgtggtatgagattcatacaaaaatatcatatggaaaggcataag agaactcatagtggagaaaaaccttaccagtgtgaatactgtttacagtatttttccaga acagatcgtgtattgaaacataaacgtatgtgccatgaaaatcatgacaaaaaactaaat agatgtgccatcaaaggtggccttctgacatctgaggaagattctggcttttctacatca ccaaaagacaactcactgccaaaaaagaaaaggcagaaaacggagaaaaaatcatctgga atggacaaagagagtgctttggacaaatctgacctgaaaaaagacaaaaatgattacttg cctctttattcttcaagtactaaagtaaaagatgagtatatggttgcagaatatgctgtt gaaatgccacattcgtcagttgggggctcgcatttagaagatgcgtcaggagaaatacac ccacctaagttagttctcaaaaaaattaatagtaagagaagtctgaaacagccactggag caaaatcaaacaatttcacctttatccacatatgaagagagcaaagtttcaaagtatgct tttgaacttgtggataaacaggctttactggactcagaaggcaatgctgacattgatcag gttgataatttgcaggaggggcccagtaaacctgtgcatagtagtactaattatgatgat gccatgcagtttttgaagaagaagcggtatcttcaagcagcaagtaacaacagcagggaa tatgcgctgaatgtgggtaccatagcttctcagccttctgtaacacaagcagctgtggca agtgtcattgatgaaagtaccacggcatccatattagagtcacaggcactgaatgtggag attaagagtaatcatgacaaaaatgttattccagatgaggtactgcagactctgttggat cattattcccacaaagctaatggacagcatgagatatccttcagtgttgcagatactgaa gtgacttctagcatatcaataaattcttcagaagtaccagaggtcaccccgtcagagaat gttggatcaagctcccaagcatcctcatcagataaagccaacatgttgcaggaatactcc aagtttctgcagcaggctttggacagaactagccaaaatgatgcctatttgaatagcccg agccttaactttgtgactgataaccagaccctcccaaatcagccagcattctcttccata gacaagcaggtctatgccaccatgcccatcaatagctttcgatcaggaatgaattctcca ctaagaacaactccagataagtcccactttggactaatagttggtgattcacagcactca tttcccttttcaggtgatgagacaaaccatgcttctgccacatcaacacaggactttctg gatcaagtgacttctcagaagaaagctgaggcccagcctgtccaccaagcttaccaaatg agctcctttgaacagcccttccgtgctccctatcatggatcaagagctggaatagctact caatttagcactgccaatggacaggtgaaccttcggggaccagggacaagtgctgaattt tcagaatttcccttggtgaatgtaaatgataatagagctgggatgacatcttcacctgat gccacaactggccagacttttggctaa >gi568815595r:125132344_125413640|GENSCAN_predicted_peptide_5|206_aa MRKDVEPVIELPKETKKEQNEAKARRKELLKEEIKEENRENQLSKKFRGARAQTHARTHA LTHKRKSWKILIIGLTHTIHPNIQNMLSLNKVQSEAHPKNNSRLNQAGLLHDGTAKFHEL SKSSRDADMPQLERRRGRRHCRPGALWEAPHTGSAPRRPPSSGAETWLERHSVASIPAPA CHQPRVAVEAGPEPQAPQAGPGRKAV >gi568815595r:125132344_125413640|GENSCAN_predicted_CDS_5|621_bp atgaggaaagatgtcgaaccagtaattgaattaccaaaagaaactaaaaaagagcaaaat gaagccaaagcaagaaggaaggaattattaaaagaagaaataaaagaggaaaatagagaa aatcaattaagcaaaaaatttaggggcgcacgcgcacagacacacgcacgcacgcacgca cttacacacaaaaggaagtcatggaagatactcatcattggcctaacccacacaatccac ccaaatattcagaatatgctctccctcaacaaggtccagtctgaggcacatcctaaaaac aactctagactgaatcaggcagggctgctacatgatggtacagccaagtttcatgagttg agcaaaagcagtagagacgcagacatgccgcagctagagcgccgccgcggtcgccgccac tgccggcccggtgcattgtgggaagccccgcacacgggaagcgctccgcggcgcccgccc tcgtctggcgcagagacctggctggaaagacacagcgtggcttcgattccggcgcctgcg tgtcaccagcccagggtggccgtggaagctggacccgagccgcaggccccccaggctggg cctgggaggaaagcggtttga >gi568815595r:125132344_125413640|GENSCAN_predicted_peptide_6|168_aa XSMCEDVCKAFRWYLSSSSKLWGNQNVDINLLSQGFRERSAEYRVWKETSSERFPRVADQ RQLCPSEGEGYLESWHLKSNPYLQHKLLIACCNDHFLGPPVPAALDTVDSLPLPAALLIL HNPIPFSGYPSAIPPLSMALFELISTTPGPVQRPEQYGLMAKSTDRPA >gi568815595r:125132344_125413640|GENSCAN_predicted_CDS_6|507_bp nggtcaatgtgtgaagatgtctgcaaggccttccgctggtatctgagttcttcttcgaag ctctggggaaaccagaatgtggacataaatcttttaagtcagggctttagagaacgctct gcagaataccgtgtctggaaagagacaagttcagagagatttccaagggtagcagaccag aggcagctttgcccttcagaaggtgaaggctacttggagagctggcatctgaagagtaac ccatatcttcaacataagcttctgattgcctgttgcaatgaccatttcttaggccctcct gtccctgcagcactggacactgtggattcccttcctcttcctgctgctctcctgatattg cataatccgattcccttcagtggctaccccagtgccatccctcctttatccatggccctc tttgaacttattagcactactcctggaccagtgcaaagaccagagcagtatggcctgatg gcaaagagcacagaccgccctgcttga