GENSCAN 1.0 Date run: 3-Nov-116 Time: 21:16:25 Sequence gi568815589r:125049676_125289718 : 240043 bp : 43.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 7191 7230 40 -3.16 1.01 Init + 9666 9720 55 2 1 56 95 47 0.494 3.65 1.02 Intr + 52747 52854 108 2 0 99 100 81 0.934 10.66 1.03 Term + 91921 91967 47 2 2 98 42 79 0.498 1.67 1.04 PlyA + 92245 92250 6 1.05 2.09 PlyA - 92261 92256 6 1.05 2.08 Term - 100246 99998 249 1 0 50 54 185 0.856 7.00 2.07 Intr - 104067 103858 210 0 0 73 88 134 0.902 11.01 2.06 Intr - 104310 104231 80 1 2 133 87 39 0.999 7.57 2.05 Intr - 108707 108566 142 2 1 69 95 34 0.871 2.13 2.04 Intr - 121505 121410 96 2 0 93 98 60 0.954 7.71 2.03 Intr - 139634 139515 120 2 0 94 17 78 0.185 1.99 2.02 Intr - 140159 139969 191 0 2 12 105 332 0.640 26.60 2.01 Init - 146048 145685 364 2 1 33 39 178 0.028 4.61 2.00 Prom - 150473 150434 40 -4.56 3.00 Prom + 153256 153295 40 -2.96 3.01 Init + 153339 153391 53 2 2 100 26 34 0.804 -0.97 3.02 Intr + 157889 158046 158 2 2 117 86 134 0.985 15.75 3.03 Intr + 163695 163847 153 1 0 26 119 120 0.925 8.94 3.04 Intr + 170864 171025 162 0 0 60 27 141 0.660 5.15 3.05 Intr + 178199 178384 186 0 0 72 71 91 0.630 5.46 3.06 Intr + 182921 183070 150 0 0 114 99 103 0.986 14.13 3.07 Term + 184013 184305 293 0 2 62 38 145 0.636 2.41 3.08 PlyA + 184869 184874 6 -0.45 4.08 PlyA - 185839 185834 6 1.05 4.07 Term - 187479 186917 563 1 2 80 42 702 0.999 59.34 4.06 Intr - 188633 188466 168 0 0 68 86 101 0.994 7.82 4.05 Intr - 189152 188915 238 2 1 83 96 283 0.921 25.79 4.04 Intr - 189656 189266 391 1 1 54 80 549 0.999 45.43 4.03 Intr - 189858 189746 113 0 2 96 97 87 0.999 9.58 4.02 Intr - 191232 191001 232 2 1 57 58 574 0.526 49.08 4.01 Init - 191451 191330 122 0 2 95 119 395 0.592 40.96 4.00 Prom - 213053 213014 40 -4.76 5.02 PlyA - 213123 213118 6 1.05 5.01 Sngl - 214478 214140 339 2 0 31 37 367 0.893 21.93 5.00 Prom - 235562 235523 40 -2.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:125049676_125289718|GENSCAN_predicted_peptide_1|69_aa MPDSPLDPSAVKDVIETTVLLFHKDQLLLQPPPVHWIPVPLASDHNFTISYSLPSSTIRV NYTVSNDKR >gi568815589r:125049676_125289718|GENSCAN_predicted_CDS_1|210_bp atgcctgattctcccctggatccttctgctgtaaaggatgttattgagacaactgtcctc ctctttcataaagatcagctgctcctccagccaccacctgtgcactggattccagtccca cttgcctctgaccacaacttcaccatcagctactccctgcccagttcaaccattcgagtt aactacacagtgtccaacgacaaaagataa >gi568815589r:125049676_125289718|GENSCAN_predicted_peptide_2|483_aa MEQYIYKRKSDGIYIINLKRTWEKLLLAAHAIVAIENPADASVISPRNTGQRVVLKFAAA TRATPVAGHFTPGTFTNQIQAPSGSHSFLWLLTPGLTTSLKQRHLMLTYLPLLCVIQILL CSATHGPDVTQGKFRLRPPPLPPPLLQPPPPLLPRLVILKMAPLDLDKYVEIARLCKYLP ENDLKSEPSPRGSALGMGSPPPEVTGARSGRRLTGDATGYRNAASRLCDYVCDLLLEESN VQPVSTPVTVCGDIHGQGDFVDRGYYSLETFTYLLALKAKWPDRITLLRGNHESRQITQV YGFYDECQTKYGNANAWRYCTKVFDMLTVAALIDEQILCVHGGLSPDIKTLDQIRTIERN QEIPHKGAFCDLVWSDPEDVDTWAISPRGAGWLFGAKVTNEFVHINNLKLICRAHQLVHE GYKFMFDEKLVTVWSAPNYCYRCGNIASIMVFKDVNTREPKLFRAVPDSERVIPPRTTTP YFL >gi568815589r:125049676_125289718|GENSCAN_predicted_CDS_2|1452_bp atggaacagtacatctataaaaggaaaagtgatggcatctacatcataaatctgaagagg acctgggagaagcttctgctggcagctcatgccattgttgccattgaaaaccctgctgat gccagtgttatatcccctaggaatactggccagagggttgtgctgaagtttgctgctgcc accagagccactccagttgctggccacttcactcctggaaccttcactaaccagatccag gctccttccgggagccacagctttttgtggttactgaccccagggctgaccaccagcctc aaacagaggcatcttatgttaacctacctaccattgctctgtgtaatacagattctcctc tgcagcgccactcacgggccggacgtgacgcagggaaagttccggcttcggcctccgccg ctgccgccgccgctgctacagccgccgccgccgctgttgccgcggcttgttattcttaaa atggcgccgctagacctggacaagtatgtggaaatagcgcggctgtgcaagtacctgcca gagaacgacctgaagtcggagccgagcccccgcggcagtgccctcgggatggggtcgcct ccaccggaggtgacaggagcgcggagtggccggcgtctgacaggagatgccaccggctac cggaacgcggcctcacggctatgtgactacgtttgtgacctcctcttagaagagtcaaat gttcagccagtatcaacaccagtaacagtgtgtggagatatccatggacagggtgatttt gtagacagaggttactatagtttggagaccttcacttaccttcttgcattaaaggctaaa tggcctgatcgtattacacttttgcgaggaaatcatgagagtagacagataacacaggtc tatggattttatgatgagtgccaaaccaaatatggaaatgctaatgcctggagatactgt accaaagtttttgacatgctcacagtagcagctttaatagatgagcagattttgtgtgtc catggtggtttatctcctgatatcaaaacactggatcaaattcgaaccatcgaacggaat caggaaattcctcataaaggagcattttgtgatctggtttggtcagatcctgaagatgtg gatacctgggctatcagtccccgaggagcaggttggctttttggagcaaaggtcacaaat gagtttgttcatatcaacaacttaaaactcatctgcagagcacatcaactagtgcacgaa ggctataaatttatgtttgatgagaagctggtgacagtatggtctgctcctaattactgc tatcgttgtggaaatattgcttcgatcatggtcttcaaagatgtaaatacaagagaacca aagttattccgggcagttccagattcagaacgtgttattcctcccagaacgacaacgcca tatttcctttga >gi568815589r:125049676_125289718|GENSCAN_predicted_peptide_3|384_aa MKQLPVLEPGDKPRKATWYTLTVPGDSPCARVGHSCSYLPPVGNAKRGKVFIVGGANPNR SFSDVHTMDLGKHQWDLDTCKGLLPRYEHASFIPSCTPDRIWVFGGANQSGNRNCLQVLN PETRTWTTPEVTSPPPSPRTFHTSSAAIGNQLYVFGGGERGAQPVQDTKLHVFDAILPFD VIEFTFLDTLTWSQPETLGNPPSPRHGHVMVAAGTKLFIHGGLAGDRFYDDLHCIDISDM KWQKLNPTGAAPAGCAAHSAVAMGKHVYIFGGMTPAGALDTMYQYHTEEQHWTLLKFDTL LPPGRLDHSMCIIPWPVTCASEKEDSNSLTLNHEAEKEDSADKVMSHSGDSHEESQTATL LCLVFGGMNTEGEIYDDCIVTVVD >gi568815589r:125049676_125289718|GENSCAN_predicted_CDS_3|1155_bp atgaagcaactgccagtcttggaacctggagacaagcccaggaaagcaacatggtacacc ttgactgtccctggagacagcccctgtgctcgagttggccacagctgttcatatttaccc ccagttggtaatgccaagagagggaaggtcttcattgttgggggagcaaatccaaacaga agcttctcagacgtgcacaccatggatctgggaaaacaccagtgggacttagatacctgc aagggcctcttgccccggtatgaacatgctagcttcattccctcctgcacacctgaccgt atctgggtatttggaggtgccaaccaatcaggaaatcgaaattgtctacaagtcctgaat cctgaaaccaggacgtggaccacgccagaagtgaccagccccccaccatccccaagaaca ttccacacatcatcggcagccattggaaaccagctatatgtctttgggggcggagagaga ggtgcccagcccgtgcaggacacgaagctgcatgtgtttgacgcaattttgccatttgat gttattgaatttacttttctagacactctgacctggtcacagccagagacacttggaaat cctccatctccccggcatggtcatgtgatggtggcagcagggacaaagctcttcatccac ggaggcttggcgggggacagattctatgatgacctccactgcattgatataagtgacatg aaatggcagaagctaaatcccactggggctgctccagcaggctgtgctgcccactcagct gtggccatgggaaaacatgtgtacatctttggtggaatgactcctgcaggagcactggac acaatgtaccagtatcacacagaagagcagcattggaccttgcttaaatttgatactctt ctaccccctggacgattggaccattccatgtgtatcattccatggccagtgacgtgtgct tctgagaaagaagattccaactctctcactctgaaccatgaagctgagaaagaggattca gctgacaaagtaatgagccacagtggtgactcacatgaggaaagccagactgctacactg ctctgtttggtgtttggtgggatgaatacagaaggggaaatctatgacgattgtattgtg actgtagtggactaa >gi568815589r:125049676_125289718|GENSCAN_predicted_peptide_4|608_aa MKLSLVAAMLLLLSAARAEEEDKKEDVGTVVGIDLGTTYSCVGVFKNGRVEIIANDQGNR ITPSYVAFTPEGERLIGDAAKNQLTSNPENTVFDAKRLIGRTWNDPSVQQDIKFLPFKVT HAVVTVPAYFNDAQRQATKDAGTIAGLNVMRIINEPTAAAIAYGLDKREGEKNILVFDLG GGTFDVSLLTIDNGVFEVVATNGDTHLGGEDFDQRVMEHFIKLYKKKTGKDVRKDNRAVQ KLRREVEKAKRALSSQHQARIEIESFYEGEDFSETLTRAKFEELNMDLFRSTMKPVQKVL EDSDLKKSDIDEIVLVGGSTRIPKIQQLVKEFFNGKEPSRGINPDEAVAYGAAVQAGVLS GDQDTGDLVLLDVCPLTLGIETVGGVMTKLIPRNTVVPTKKSQIFSTASDNQPTVTIKVY EGERPLTKDNHLLGTFDLTGIPPAPRGVPQIEVTFEIDVNGILRVTAEDKGTGNKNKITI TNDQNRLTPEEIERMVNDAEKFAEEDKKLKERIDTRNELESYAYSLKNQIGDKEKLGGKL SSEDKETMEKAVEEKIEWLESHQDADIEDFKAKKKELEEIVQPIISKLYGSAGPPPTGEE DTAEKDEL >gi568815589r:125049676_125289718|GENSCAN_predicted_CDS_4|1827_bp atgaagctctccctggtggccgcgatgctgctgctgctcagcgcggcgcgggccgaggag gaggacaagaaggaggacgtgggcacggtggtcggcatcgacctggggaccacctactcc tgcgtcggcgtgttcaagaacggccgcgtggagatcatcgccaacgatcagggcaaccgc atcacgccgtcctatgtcgccttcactcctgaaggggaacgtctgattggcgatgccgcc aagaaccagctcacctccaaccccgagaacacggtctttgacgccaagcggctcatcggc cgcacgtggaatgacccgtctgtgcagcaggacatcaagttcttgccgttcaaggttacc catgcagttgttactgtaccagcctattttaatgatgcccaacgccaagcaaccaaagac gctggaactattgctggcctaaatgttatgaggatcatcaacgagcctacggcagctgct attgcttatggcctggataagagggagggggagaagaacatcctggtgtttgacctgggt ggcggaaccttcgatgtgtctcttctcaccattgacaatggtgtcttcgaagttgtggcc actaatggagatactcatctgggtggagaagactttgaccagcgtgtcatggaacacttc atcaaactgtacaaaaagaagacgggcaaagatgtcaggaaagacaatagagctgtgcag aaactccggcgcgaggtagaaaaggccaaacgggccctgtcttctcagcatcaagcaaga attgaaattgagtccttctatgaaggagaagacttttctgagaccctgactcgggccaaa tttgaagagctcaacatggatctgttccggtctactatgaagcccgtccagaaagtgttg gaagattctgatttgaagaagtctgatattgatgaaattgttcttgttggtggctcgact cgaattccaaagattcagcaactggttaaagagttcttcaatggcaaggaaccatcccgt ggcataaacccagatgaagctgtagcgtatggtgctgctgtccaggctggtgtgctctct ggtgatcaagatacaggtgacctggtactgcttgatgtatgtccccttacacttggtatt gaaactgtgggaggtgtcatgaccaaactgattccaaggaacacagtggtgcctaccaag aagtctcagatcttttctacagcttctgataatcaaccaactgttacaatcaaggtctat gaaggtgaaagacccctgacaaaagacaatcatcttctgggtacatttgatctgactgga attcctcctgctcctcgtggggtcccacagattgaagtcacctttgagatagatgtgaat ggtattcttcgagtgacagctgaagacaagggtacagggaacaaaaataagatcacaatc accaatgaccagaatcgcctgacacctgaagaaatcgaaaggatggttaatgatgctgag aagtttgctgaggaagacaaaaagctcaaggagcgcattgatactagaaatgagttggaa agctatgcctattctctaaagaatcagattggagataaagaaaagctgggaggtaaactt tcctctgaagataaggagaccatggaaaaagctgtagaagaaaagattgaatggctggaa agccaccaagatgctgacattgaagacttcaaagctaagaagaaggaactggaagaaatt gttcaaccaattatcagcaaactctatggaagtgcaggccctcccccaactggtgaagag gatacagcagaaaaagatgagttgtag >gi568815589r:125049676_125289718|GENSCAN_predicted_peptide_5|112_aa MKFSFVYTVILSAFNDSNVLLKNQIAIYELLFKEGVMVAKKDVHMPKHPELADKNVPNLH VMKAMQSLKSQGYMKEQFAWRHFYWYLTNEGIQYLRDYLHLPPGDCTCYPTP >gi568815589r:125049676_125289718|GENSCAN_predicted_CDS_5|339_bp atgaaattctctttcgtctacactgtcatcctgtctgccttcaatgattcaaatgttctt ctaaagaaccagattgccatttatgaactcctttttaaggagggagtcatggtggccaag aaggatgtccacatgcctaagcacccagagctggcagacaagaacgtgcccaaccttcat gtcatgaaggccatgcagtctctcaagtctcaaggctacatgaaggaacagtttgcctgg agacatttctactggtaccttaccaatgagggtatccagtatctccgggattaccttcat ctgccccccggagactgtacctgctaccctacgccatag