GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:07:41 Sequence gi568815589r:125136595_125341126 : 204532 bp : 43.87% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 1015 1010 6 1.05 1.08 Term - 13327 13079 249 1 0 50 54 185 0.862 7.00 1.07 Intr - 17148 16939 210 0 0 73 88 134 0.903 11.01 1.06 Intr - 17391 17312 80 1 2 133 87 39 0.999 7.57 1.05 Intr - 21788 21647 142 2 1 69 95 34 0.871 2.13 1.04 Intr - 34586 34491 96 2 0 93 98 60 0.954 7.71 1.03 Intr - 52715 52596 120 2 0 94 17 78 0.185 1.99 1.02 Intr - 53240 53050 191 0 2 12 105 332 0.640 26.60 1.01 Init - 59129 58766 364 2 1 33 39 178 0.028 4.61 1.00 Prom - 63554 63515 40 -4.56 2.00 Prom + 66337 66376 40 -2.96 2.01 Init + 66420 66472 53 2 2 100 26 34 0.804 -0.97 2.02 Intr + 70970 71127 158 2 2 117 86 134 0.985 15.75 2.03 Intr + 76776 76928 153 1 0 26 119 120 0.925 8.94 2.04 Intr + 83945 84106 162 0 0 60 27 141 0.660 5.15 2.05 Intr + 91280 91465 186 0 0 72 71 91 0.630 5.46 2.06 Intr + 96002 96151 150 0 0 114 99 103 0.986 14.13 2.07 Term + 97094 97386 293 0 2 62 38 145 0.636 2.41 2.08 PlyA + 97950 97955 6 -0.45 3.08 PlyA - 98920 98915 6 1.05 3.07 Term - 100560 99998 563 1 2 80 42 702 0.999 59.34 3.06 Intr - 101714 101547 168 0 0 68 86 101 0.994 7.82 3.05 Intr - 102233 101996 238 2 1 83 96 283 0.921 25.79 3.04 Intr - 102737 102347 391 1 1 54 80 549 0.999 45.43 3.03 Intr - 102939 102827 113 0 2 96 97 87 0.999 9.58 3.02 Intr - 104313 104082 232 2 1 57 58 574 0.526 49.08 3.01 Init - 104532 104411 122 0 2 95 119 395 0.592 40.96 3.00 Prom - 126134 126095 40 -4.76 4.02 PlyA - 126204 126199 6 1.05 4.01 Sngl - 127559 127221 339 2 0 31 37 367 0.893 21.93 4.00 Prom - 148643 148604 40 -2.36 5.00 Prom + 159427 159466 40 -6.66 5.01 Init + 162328 162512 185 0 2 68 103 114 0.980 9.49 5.02 Intr + 165389 166232 844 2 1 92 85 294 0.928 20.88 5.03 Intr + 168469 168555 87 0 0 88 69 99 0.952 8.17 5.04 Intr + 170819 170953 135 1 0 90 55 38 0.445 1.46 5.05 Intr + 184839 184968 130 2 1 100 67 120 0.767 11.37 5.06 Intr + 187204 187329 126 2 0 55 98 97 0.583 8.05 5.07 Intr + 189822 189995 174 1 0 54 93 166 0.999 13.61 5.08 Intr + 195332 195466 135 0 0 63 96 58 0.957 4.64 5.09 Intr + 195916 196035 120 2 0 67 107 91 0.995 9.37 5.10 Intr + 200424 200501 78 1 0 102 92 14 0.451 2.72 5.11 Intr + 200627 200997 371 0 2 106 64 97 0.222 3.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:125136595_125341126|GENSCAN_predicted_peptide_1|483_aa MEQYIYKRKSDGIYIINLKRTWEKLLLAAHAIVAIENPADASVISPRNTGQRVVLKFAAA TRATPVAGHFTPGTFTNQIQAPSGSHSFLWLLTPGLTTSLKQRHLMLTYLPLLCVIQILL CSATHGPDVTQGKFRLRPPPLPPPLLQPPPPLLPRLVILKMAPLDLDKYVEIARLCKYLP ENDLKSEPSPRGSALGMGSPPPEVTGARSGRRLTGDATGYRNAASRLCDYVCDLLLEESN VQPVSTPVTVCGDIHGQGDFVDRGYYSLETFTYLLALKAKWPDRITLLRGNHESRQITQV YGFYDECQTKYGNANAWRYCTKVFDMLTVAALIDEQILCVHGGLSPDIKTLDQIRTIERN QEIPHKGAFCDLVWSDPEDVDTWAISPRGAGWLFGAKVTNEFVHINNLKLICRAHQLVHE GYKFMFDEKLVTVWSAPNYCYRCGNIASIMVFKDVNTREPKLFRAVPDSERVIPPRTTTP YFL >gi568815589r:125136595_125341126|GENSCAN_predicted_CDS_1|1452_bp atggaacagtacatctataaaaggaaaagtgatggcatctacatcataaatctgaagagg acctgggagaagcttctgctggcagctcatgccattgttgccattgaaaaccctgctgat gccagtgttatatcccctaggaatactggccagagggttgtgctgaagtttgctgctgcc accagagccactccagttgctggccacttcactcctggaaccttcactaaccagatccag gctccttccgggagccacagctttttgtggttactgaccccagggctgaccaccagcctc aaacagaggcatcttatgttaacctacctaccattgctctgtgtaatacagattctcctc tgcagcgccactcacgggccggacgtgacgcagggaaagttccggcttcggcctccgccg ctgccgccgccgctgctacagccgccgccgccgctgttgccgcggcttgttattcttaaa atggcgccgctagacctggacaagtatgtggaaatagcgcggctgtgcaagtacctgcca gagaacgacctgaagtcggagccgagcccccgcggcagtgccctcgggatggggtcgcct ccaccggaggtgacaggagcgcggagtggccggcgtctgacaggagatgccaccggctac cggaacgcggcctcacggctatgtgactacgtttgtgacctcctcttagaagagtcaaat gttcagccagtatcaacaccagtaacagtgtgtggagatatccatggacagggtgatttt gtagacagaggttactatagtttggagaccttcacttaccttcttgcattaaaggctaaa tggcctgatcgtattacacttttgcgaggaaatcatgagagtagacagataacacaggtc tatggattttatgatgagtgccaaaccaaatatggaaatgctaatgcctggagatactgt accaaagtttttgacatgctcacagtagcagctttaatagatgagcagattttgtgtgtc catggtggtttatctcctgatatcaaaacactggatcaaattcgaaccatcgaacggaat caggaaattcctcataaaggagcattttgtgatctggtttggtcagatcctgaagatgtg gatacctgggctatcagtccccgaggagcaggttggctttttggagcaaaggtcacaaat gagtttgttcatatcaacaacttaaaactcatctgcagagcacatcaactagtgcacgaa ggctataaatttatgtttgatgagaagctggtgacagtatggtctgctcctaattactgc tatcgttgtggaaatattgcttcgatcatggtcttcaaagatgtaaatacaagagaacca aagttattccgggcagttccagattcagaacgtgttattcctcccagaacgacaacgcca tatttcctttga >gi568815589r:125136595_125341126|GENSCAN_predicted_peptide_2|384_aa MKQLPVLEPGDKPRKATWYTLTVPGDSPCARVGHSCSYLPPVGNAKRGKVFIVGGANPNR SFSDVHTMDLGKHQWDLDTCKGLLPRYEHASFIPSCTPDRIWVFGGANQSGNRNCLQVLN PETRTWTTPEVTSPPPSPRTFHTSSAAIGNQLYVFGGGERGAQPVQDTKLHVFDAILPFD VIEFTFLDTLTWSQPETLGNPPSPRHGHVMVAAGTKLFIHGGLAGDRFYDDLHCIDISDM KWQKLNPTGAAPAGCAAHSAVAMGKHVYIFGGMTPAGALDTMYQYHTEEQHWTLLKFDTL LPPGRLDHSMCIIPWPVTCASEKEDSNSLTLNHEAEKEDSADKVMSHSGDSHEESQTATL LCLVFGGMNTEGEIYDDCIVTVVD >gi568815589r:125136595_125341126|GENSCAN_predicted_CDS_2|1155_bp atgaagcaactgccagtcttggaacctggagacaagcccaggaaagcaacatggtacacc ttgactgtccctggagacagcccctgtgctcgagttggccacagctgttcatatttaccc ccagttggtaatgccaagagagggaaggtcttcattgttgggggagcaaatccaaacaga agcttctcagacgtgcacaccatggatctgggaaaacaccagtgggacttagatacctgc aagggcctcttgccccggtatgaacatgctagcttcattccctcctgcacacctgaccgt atctgggtatttggaggtgccaaccaatcaggaaatcgaaattgtctacaagtcctgaat cctgaaaccaggacgtggaccacgccagaagtgaccagccccccaccatccccaagaaca ttccacacatcatcggcagccattggaaaccagctatatgtctttgggggcggagagaga ggtgcccagcccgtgcaggacacgaagctgcatgtgtttgacgcaattttgccatttgat gttattgaatttacttttctagacactctgacctggtcacagccagagacacttggaaat cctccatctccccggcatggtcatgtgatggtggcagcagggacaaagctcttcatccac ggaggcttggcgggggacagattctatgatgacctccactgcattgatataagtgacatg aaatggcagaagctaaatcccactggggctgctccagcaggctgtgctgcccactcagct gtggccatgggaaaacatgtgtacatctttggtggaatgactcctgcaggagcactggac acaatgtaccagtatcacacagaagagcagcattggaccttgcttaaatttgatactctt ctaccccctggacgattggaccattccatgtgtatcattccatggccagtgacgtgtgct tctgagaaagaagattccaactctctcactctgaaccatgaagctgagaaagaggattca gctgacaaagtaatgagccacagtggtgactcacatgaggaaagccagactgctacactg ctctgtttggtgtttggtgggatgaatacagaaggggaaatctatgacgattgtattgtg actgtagtggactaa >gi568815589r:125136595_125341126|GENSCAN_predicted_peptide_3|608_aa MKLSLVAAMLLLLSAARAEEEDKKEDVGTVVGIDLGTTYSCVGVFKNGRVEIIANDQGNR ITPSYVAFTPEGERLIGDAAKNQLTSNPENTVFDAKRLIGRTWNDPSVQQDIKFLPFKVT HAVVTVPAYFNDAQRQATKDAGTIAGLNVMRIINEPTAAAIAYGLDKREGEKNILVFDLG GGTFDVSLLTIDNGVFEVVATNGDTHLGGEDFDQRVMEHFIKLYKKKTGKDVRKDNRAVQ KLRREVEKAKRALSSQHQARIEIESFYEGEDFSETLTRAKFEELNMDLFRSTMKPVQKVL EDSDLKKSDIDEIVLVGGSTRIPKIQQLVKEFFNGKEPSRGINPDEAVAYGAAVQAGVLS GDQDTGDLVLLDVCPLTLGIETVGGVMTKLIPRNTVVPTKKSQIFSTASDNQPTVTIKVY EGERPLTKDNHLLGTFDLTGIPPAPRGVPQIEVTFEIDVNGILRVTAEDKGTGNKNKITI TNDQNRLTPEEIERMVNDAEKFAEEDKKLKERIDTRNELESYAYSLKNQIGDKEKLGGKL SSEDKETMEKAVEEKIEWLESHQDADIEDFKAKKKELEEIVQPIISKLYGSAGPPPTGEE DTAEKDEL >gi568815589r:125136595_125341126|GENSCAN_predicted_CDS_3|1827_bp atgaagctctccctggtggccgcgatgctgctgctgctcagcgcggcgcgggccgaggag gaggacaagaaggaggacgtgggcacggtggtcggcatcgacctggggaccacctactcc tgcgtcggcgtgttcaagaacggccgcgtggagatcatcgccaacgatcagggcaaccgc atcacgccgtcctatgtcgccttcactcctgaaggggaacgtctgattggcgatgccgcc aagaaccagctcacctccaaccccgagaacacggtctttgacgccaagcggctcatcggc cgcacgtggaatgacccgtctgtgcagcaggacatcaagttcttgccgttcaaggttacc catgcagttgttactgtaccagcctattttaatgatgcccaacgccaagcaaccaaagac gctggaactattgctggcctaaatgttatgaggatcatcaacgagcctacggcagctgct attgcttatggcctggataagagggagggggagaagaacatcctggtgtttgacctgggt ggcggaaccttcgatgtgtctcttctcaccattgacaatggtgtcttcgaagttgtggcc actaatggagatactcatctgggtggagaagactttgaccagcgtgtcatggaacacttc atcaaactgtacaaaaagaagacgggcaaagatgtcaggaaagacaatagagctgtgcag aaactccggcgcgaggtagaaaaggccaaacgggccctgtcttctcagcatcaagcaaga attgaaattgagtccttctatgaaggagaagacttttctgagaccctgactcgggccaaa tttgaagagctcaacatggatctgttccggtctactatgaagcccgtccagaaagtgttg gaagattctgatttgaagaagtctgatattgatgaaattgttcttgttggtggctcgact cgaattccaaagattcagcaactggttaaagagttcttcaatggcaaggaaccatcccgt ggcataaacccagatgaagctgtagcgtatggtgctgctgtccaggctggtgtgctctct ggtgatcaagatacaggtgacctggtactgcttgatgtatgtccccttacacttggtatt gaaactgtgggaggtgtcatgaccaaactgattccaaggaacacagtggtgcctaccaag aagtctcagatcttttctacagcttctgataatcaaccaactgttacaatcaaggtctat gaaggtgaaagacccctgacaaaagacaatcatcttctgggtacatttgatctgactgga attcctcctgctcctcgtggggtcccacagattgaagtcacctttgagatagatgtgaat ggtattcttcgagtgacagctgaagacaagggtacagggaacaaaaataagatcacaatc accaatgaccagaatcgcctgacacctgaagaaatcgaaaggatggttaatgatgctgag aagtttgctgaggaagacaaaaagctcaaggagcgcattgatactagaaatgagttggaa agctatgcctattctctaaagaatcagattggagataaagaaaagctgggaggtaaactt tcctctgaagataaggagaccatggaaaaagctgtagaagaaaagattgaatggctggaa agccaccaagatgctgacattgaagacttcaaagctaagaagaaggaactggaagaaatt gttcaaccaattatcagcaaactctatggaagtgcaggccctcccccaactggtgaagag gatacagcagaaaaagatgagttgtag >gi568815589r:125136595_125341126|GENSCAN_predicted_peptide_4|112_aa MKFSFVYTVILSAFNDSNVLLKNQIAIYELLFKEGVMVAKKDVHMPKHPELADKNVPNLH VMKAMQSLKSQGYMKEQFAWRHFYWYLTNEGIQYLRDYLHLPPGDCTCYPTP >gi568815589r:125136595_125341126|GENSCAN_predicted_CDS_4|339_bp atgaaattctctttcgtctacactgtcatcctgtctgccttcaatgattcaaatgttctt ctaaagaaccagattgccatttatgaactcctttttaaggagggagtcatggtggccaag aaggatgtccacatgcctaagcacccagagctggcagacaagaacgtgcccaaccttcat gtcatgaaggccatgcagtctctcaagtctcaaggctacatgaaggaacagtttgcctgg agacatttctactggtaccttaccaatgagggtatccagtatctccgggattaccttcat ctgccccccggagactgtacctgctaccctacgccatag >gi568815589r:125136595_125341126|GENSCAN_predicted_peptide_5|795_aa MVKLDIHTLAHHLKQERLYVNSEKQLIQRLNADVLKTAEKLYRTAWIAKQQRINLDRLII TSAEASPAECCQHAKILEDTQFVDGYKQLGFQETAYGEFLSRLRENPRLIASSLVAGEKL NQENTQSVIYTVFTSLYGNCIMQEDESYLLQVLRYLIEFELKESDNPRRLLRRGTCAFSI LFKLFSEGLFSAKLFLTATLHEPIMQLLVEDEDHLETDPNKLIERFSPSQQEKLFGEKGS DRFRQKVQEMVESNEAKLVALVNKFIGYLKQNTYCFPHSLRWIVSQMYKTLSCVDRLEVG EVRAMCTDLLLACFICPAVVNPEQYGIISDAPINEVARFNLMQVGRLLQQLAMTGSEEGD PRTKSSLGKFDKSCVAAFLDVVIGGRAVETPPLSSVNLLEGLSRTVVYITYSQLITLVLN MQLSDGGQGDVPVDENKLHGKPDKTLRFSLCSDNLEGISEGPSNRSNSVSSLDLEGESVS ELGAGPSGSNGVEALQLLEHEQATTQDNLDDKLRKFEIRDMMGLTDDRDISETVSETWST DVLGSDFDPNIDEDRLQEIAEAPDLKQEERLQELESCSGLGSTSDDTDVREVSSRPSTPG LSVVSGISATSEDIPNKIEDLRSECSSDFGGKDSVTSPDMDEITHGAHQLTSPPSQSESL LAMFDPLSSHEGASAVVRPKVHYARPSHPPPDPPILEGAVGGNEARLPNFGSHVLTPAEM EAFKQRHSYPERLVRSRSSDIVSSVRRPMSDPSWNRRPGNEERELPPAAAIGATSLVAAP HSSSSSPSKDSSRGE >gi568815589r:125136595_125341126|GENSCAN_predicted_CDS_5|2385_bp atggtgaaactagatattcatactctggctcatcacctcaagcaggaacgcttatatgta aactctgagaaacagctcattcagaggctcaatgcagatgtacttaagacagctgaaaag ttgtatcgtacagcatggattgcgaagcaacagagaatcaatttggatcggcttatcata accagtgctgaagcttcccctgctgaatgttgccaacatgccaaaattttggaagataca caatttgttgatgggtataagcaattgggatttcaggagactgcttatggagaattcttg agtcgattgagggaaaatcctcgtcttattgcctcctctttggttgctggagagaaactt aatcaggagaacacacaaagtgttatttacacagtttttacctccctgtatggcaattgc atcatgcaagaagatgaaagctacctccttcaggttttgcgatacttgattgaatttgaa cttaaagaaagtgacaaccctaggcgacttttgaggagaggaacttgtgccttcagcatc ttatttaaacttttttctgaaggactgttttctgccaaacttttcctcacagccacttta catgagccaattatgcaactgcttgttgaagatgaagatcacctggaaacagatccaaac aagctaattgagaggttctctccatctcagcaggaaaaactctttggagagaagggctca gatagattcaggcaaaaagttcaagaaatggtggagtccaatgaagcaaagctagtggct ttggtgaacaaatttattggttatctcaaacagaacacatattgttttccacatagttta aggtggatcgtgtctcagatgtacaaaaccctctcctgtgtagataggctggaagttggg gaggtcagggcaatgtgtactgatctcctgttggcctgcttcatttgtcctgcagttgtc aatccagaacaatatggaataatttccgatgctcctattaatgaagtagcacgatttaat ctgatgcaggtaggccgccttttgcagcagttagcaatgactggctctgaagagggagat ccccgaacaaagagcagccttggaaagtttgacaaaagctgtgttgccgctttccttgat gttgtgattgggggccgtgcagtggagacccctccattgtcttccgtcaatcttctggaa ggattgagcagaactgtggtttatataacctacagtcagcttattactctggtcctaaac atgcagctttcggatggaggacaaggagatgtccctgttgatgaaaacaaactccatggt aaacctgataaaaccttgcgcttttccctctgcagtgataatctggaaggaatatctgaa ggtccttcaaatcgctccaattcagtgtcctccctagacctagaaggagagtctgtgtca gaacttggagcaggaccttctggcagtaatggagttgaagctctacagctgttagaacat gagcaagctacaacacaggataaccttgatgataagctaaggaagtttgaaattcgtgac atgatgggattaacagatgatagggacatatcagaaacagtgagtgagacctggagtaca gacgtcttgggaagtgactttgaccctaatattgatgaagatcgcttgcaagaaattgca gaggccccagacctaaagcaggaggagcgtctgcaagaactggagagctgttctggactg ggtagcacatctgatgatacggatgtcagggaggtcagttcccgccccagcacaccaggc ctcagtgttgtgtccggcataagtgcaacctctgaggatattcccaataagattgaagac ctgagatctgagtgcagctctgattttgggggtaaagattctgtcactagtccagacatg gatgaaataactcacggtgcccaccagctgacctctcctccttctcagtcagagtctctg ctggccatgtttgatccactgtcttcacatgaaggggcttctgctgtggtaaggccaaag gttcactatgctaggccatcgcatccaccaccagatcccccaatcctggaaggagctgtg ggaggaaatgaggccaggttgccaaactttggttcccatgttttaactccagctgaaatg gaggcattcaagcaaaggcattcttaccctgagagactagttcgaagcaggagctctgat atagtatcttctgtccggagacccatgagtgaccccagctggaaccggcgtccaggaaat gaagagcgagaactccctccagctgcagccattggtgctacttctttggtggctgcacct cattcatcatcttcatccccgagtaaggactcctcaagaggagag