GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:25:19 Sequence gi568815587r:34782720_35016284 : 233565 bp : 38.86% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 4844 4883 40 -1.95 1.01 Sngl + 14987 15268 282 1 0 37 41 192 0.665 4.64 1.02 PlyA + 18870 18875 6 1.05 2.02 PlyA - 19136 19131 6 1.05 2.01 Sngl - 36964 36584 381 1 0 53 55 243 0.965 13.42 2.00 Prom - 37508 37469 40 -8.95 3.05 PlyA - 38027 38022 6 1.05 3.04 Term - 42013 41889 125 2 2 99 41 55 0.388 -0.53 3.03 Intr - 42448 42394 55 1 1 79 98 30 0.494 0.73 3.02 Intr - 43093 42916 178 0 1 86 69 85 0.506 5.40 3.01 Init - 54928 54834 95 1 2 80 81 125 0.709 10.90 3.00 Prom - 56357 56318 40 -4.25 4.03 PlyA - 56838 56833 6 1.05 4.02 Term - 59571 59336 236 1 2 116 42 144 0.072 8.10 4.01 Init - 64090 64036 55 1 1 69 78 53 0.065 3.96 4.00 Prom - 66124 66085 40 -5.25 5.00 Prom + 66261 66300 40 -7.45 5.01 Init + 66499 66627 129 0 0 48 75 63 0.109 1.23 5.02 Intr + 76678 76817 140 0 2 46 44 139 0.012 3.64 5.03 Intr + 83254 83441 188 1 2 63 101 70 0.234 4.21 5.04 Intr + 87485 87597 113 0 2 88 83 53 0.412 3.98 5.05 Intr + 90059 90154 96 1 0 58 19 184 0.363 7.79 5.06 Term + 92201 92467 267 1 0 22 43 195 0.312 2.91 5.07 PlyA + 93231 93236 6 1.05 6.00 Prom + 94013 94052 40 -3.15 6.01 Init + 109161 109292 132 2 0 62 119 64 0.626 7.09 6.02 Intr + 117843 118028 186 2 0 29 34 203 0.350 7.96 6.03 Intr + 119610 120153 544 2 1 -6 69 328 0.072 13.04 6.04 Term + 124223 124356 134 0 2 80 42 168 0.958 8.67 6.05 PlyA + 125351 125356 6 1.05 7.00 Prom + 125463 125502 40 -9.25 7.01 Init + 126691 126819 129 0 0 78 30 114 0.429 4.80 7.02 Intr + 127241 127375 135 1 0 64 92 55 0.640 3.34 7.03 Intr + 129896 130086 191 1 2 4 94 154 0.617 5.06 7.04 Intr + 130541 130635 95 2 2 53 101 65 0.689 2.99 7.05 Term + 130736 130878 143 0 2 38 43 130 0.648 0.61 7.06 PlyA + 130931 130936 6 -0.45 8.00 Prom + 131580 131619 40 -8.25 8.01 Init + 133937 134096 160 1 1 105 84 91 0.360 8.45 8.02 Intr + 148685 148765 81 0 0 115 100 30 0.513 5.59 8.03 Intr + 159032 159148 117 0 0 41 51 165 0.756 7.62 8.04 Intr + 164787 164887 101 1 2 65 103 125 0.972 10.61 8.05 Intr + 174665 174864 200 1 2 57 113 203 0.993 16.93 8.06 Intr + 183921 184095 175 0 1 88 17 137 0.976 5.62 8.07 Intr + 187420 187567 148 0 1 91 96 172 0.842 17.19 8.08 Intr + 195405 195463 59 1 2 99 106 54 0.452 5.98 8.09 Intr + 199782 201134 1353 2 0 -25 86 562 0.065 32.40 8.10 Intr + 201851 202009 159 1 0 90 95 180 0.989 18.16 8.11 Intr + 209596 209660 65 0 2 77 66 45 0.934 -2.20 8.12 Intr + 212195 212401 207 2 0 91 9 315 0.040 21.17 8.13 Intr + 225696 225767 72 0 0 89 59 85 0.225 3.30 8.14 Term + 230008 230065 58 1 1 105 42 108 0.921 4.08 8.15 PlyA + 230148 230153 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 59617 59336 282 1 0 103 42 171 0.882 9.24 S.002 Sngl - 64090 63806 285 1 0 69 44 189 0.876 7.90 S.003 Term + 77534 77748 215 0 2 100 48 154 0.977 9.01 S.004 Term - 100097 99998 100 1 1 131 46 99 0.971 6.62 S.005 Term + 212195 212453 259 2 1 91 37 333 0.942 22.54 S.006 Init + 229098 229165 68 2 2 91 95 26 0.815 4.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:34782720_35016284|GENSCAN_predicted_peptide_1|93_aa MSRQRFAAEAEPSWTTSTRAVQKGNVGLEPLQTLPSGAMRRGPPFSRPQNGSSSDSLHCV PGKAADTQQQPMKEAGWGAVPCKATGAELPKAM >gi568815587r:34782720_35016284|GENSCAN_predicted_CDS_1|282_bp atgtccaggcagaggtttgctgcagaggcagagccctcatggacaacatctactagggca gtgcagaagggaaatgtggggttggagcccttacagactctgcctagtggagctatgaga agagggccaccattctccagaccccagaatggtagctcctctgacagcttgcactgtgtg cctggaaaagctgcagacactcaacagcagcccatgaaagaagctggctggggagctgta ccctgcaaggccacaggggcagagctacccaaggccatgtga >gi568815587r:34782720_35016284|GENSCAN_predicted_peptide_2|126_aa MWESLEHPRDLLNGLAQNADSDMDNKFQAEVVSDGNEELLGNWSKGDPCYVLAKRLEAFC PCPTDVQNIELERDDLGYLAEEISKQQSIQEVTWVQLKAFSFKREAEHKSSENLQPDKAI EKKIIF >gi568815587r:34782720_35016284|GENSCAN_predicted_CDS_2|381_bp atgtgggaaagtttggaacatcctagagacttgctaaatggcttagcccaaaatgctgat agtgatatggacaataaattccaggctgaggtggtctcagatggaaatgaggaacttctt gggaactggagcaaaggtgacccttgttatgttttagcaaagagactggaggcattttgc ccctgccctacagatgtacagaacattgaactcgagagagatgatttagggtatctggca gaagaaatttctaagcagcaaagcattcaagaggtgacttgggtgcagttaaaggcattc agcttcaaaagggaagcagagcataaaagttcagaaaatttgcagcctgacaaagctata gaaaagaaaatcattttctga >gi568815587r:34782720_35016284|GENSCAN_predicted_peptide_3|150_aa MPADIARCPLQGSITPSGDSLNWEKRLENGESKHNVLTLKQVRPVAARPSKTQCSTVELA IHNGPIESGSRHFPLWAAQIEELGLDPRDWWLLGWRRRSSGYGKLGIGEDPGFSDPSDLP MATEELVADLLPICSSFHCSHGHLCSAAGD >gi568815587r:34782720_35016284|GENSCAN_predicted_CDS_3|453_bp atgcctgcagacattgccagatgtcccctgcagggcagcatcacccccagtggagactca ctgaactgggagaagaggctggagaatggagaaagcaaacacaatgtactgactctaaaa caggtacgaccagttgctgccaggccctcaaagacacagtgttctactgtggaattagcc atacataatgggcccattgaatcaggcagcaggcactttcctctctgggctgcacagata gaggagttgggcctagaccccagggactggtggttacttggatggaggaggagaagcagt ggctatggcaaacttggaataggagaagaccctggattttcagacccaagtgacttgccc atggccacagaagaactggtggcagatctgcttcctatctgtagctctttccactgctca catggccacttgtgttcagcagctggtgactag >gi568815587r:34782720_35016284|GENSCAN_predicted_peptide_4|96_aa MSAFCVPVSVIGYWLLSGGVPATKIWEPMGGKAPQWARLTPVLLGNQVWEGWKQQMGSVL LKPEVSMKTSLTQDAGFQPPNISAELQEVQHSFGFQ >gi568815587r:34782720_35016284|GENSCAN_predicted_CDS_4|291_bp atgtcagccttttgtgtccctgtgtcagtcattggctactggctgctctcggggggtgtc ccggcaactaaaatctgggagccaatgggaggcaaagctcctcaatgggctagactgaca ccagtgctgcttgggaaccaggtttgggaaggatggaagcagcagatgggctcagtctta cttaaacctgaagtctctatgaagacctcactcacccaagacgctggctttcagcccccc aacatctcagcagaactccaagaagtacaacattcatttggcttccaatga >gi568815587r:34782720_35016284|GENSCAN_predicted_peptide_5|310_aa MTAFRVGAYASQVTSGILTQVCLTVDKQRPWICPVIMTPAPKCDSGKREESEELDHDTLE LQVKDGSGGYCFFPPAGVEPRHQPPCRTGRSAAALGSHRSVNPIVNCACKGSRLHVPYEN LMPDDVSLSPITPRWDHLVASSSSGLLPILHYVCYPTSPLVKVSIIGPSPYARGYLPHPP LDGVLSVNLVLDIVEIGSEDTQQMVSAAELVAGGEKSPTSILLHEQRNRRVAERQSSRVA WHRRREEKERLNINRSSAGDSQRGDRLWGSQTLGEYHLPIPSSFQLPIHPTESYFYHPIK SLHSPSFKSE >gi568815587r:34782720_35016284|GENSCAN_predicted_CDS_5|933_bp atgaccgcttttagggtaggggcttatgcaagtcaggtaacaagtggaattctgacccag gtttgtctcacagttgataaacagaggccatggatctgcccagtgatcatgaccccagct cccaaatgtgattctgggaagagggaggagagtgaagaacttgaccatgatacgctggag ctgcaggtcaaagatggctcaggaggttactgcttcttcccacctgctggtgtggagcca aggcaccagcctccctgcaggacaggcagatcagcagcggcattaggttctcataggagt gtgaaccctattgtgaactgtgcatgcaagggatctaggttgcatgttccttatgagaat ctaatgcctgatgatgtgtcactgtctcccatcacccccagatgggaccatctagttgca tctagtagctcagggctcctgccaattctacattatgtgtgctatcccacttcaccactg gtgaaagtttcaataattggaccatcaccttatgccaggggctatctaccccacccacct ctagatggggtcctcagtgtcaacttggtgttagatattgttgaaattggatcggaggac acccagcagatggtgtctgctgcagaattggttgctggtggggaaaaatccccgacatcc atcttgctccacgagcagaggaacagaagagtagcagaacgacagagcagcagagtggca tggcatagaagaagagaagagaaggagcgtctgaacatcaacaggagttcagctggggac agtcagagaggagatcggctgtggggcagccaaactctgggggaatatcaccttcccatt ccatcctctttccagctccccatccatcccactgagagctacttctatcacccaataaaa tccctgcactcaccatccttcaagtctgagtga >gi568815587r:34782720_35016284|GENSCAN_predicted_peptide_6|331_aa MEEALGWYMVHSPPQISGLVGLLMWGEIYAVLGIDSNKVQNGLLRDVTLLLRQTLTEAEK QKVLQAAEKYGDEQHASCSKPRRQRGDREGEEEVETPFLLGKEAVLGLALVDDILLCAPT EEASQEGTEALLNFSANRGYKVSKSKAQLCKTSVKYLGLVLSKGTRVLGEERIKPISSFP HPQTLKQLRGFWGITGFCRLWIPGYGEIARPLYNLIKETQGAKTHLLTWEPEAQKAFNQL KQALLKVPALSLPVGKAFNLYVSEKKGMSLGVLTQARGPAQQSMGYLKAINPNGAANRTT RGRAILPRTLRSTPGGALAVVPHLMSLFGRK >gi568815587r:34782720_35016284|GENSCAN_predicted_CDS_6|996_bp atggaagaagctctaggatggtacatggtccactcacctccacagataagtggacttgtg ggtcttcttatgtggggagaaatttatgcagtcctgggaatagactcaaataaagtacaa aatggtctgctgagggatgttacgttgctgctacgtcagaccctcactgaagctgaaaag cagaaagttctgcaggcagcagaaaaatatggagatgagcaacatgcctcctgtagcaaa ccgaggagacaaagaggagatagggaaggtgaggaagaagtggaaactccattcctacta ggaaaggaagcagttctgggtcttgcattagttgatgatattctgctctgtgccccaact gaggaagcttctcaggaaggcaccgaagctcttctcaacttctcagctaacagaggatat aaggtttcaaaatccaaggcccagctctgcaaaacctcagtgaagtatctaggattagta ctgtccaaggggaccagagtattaggggaagagaggattaagcctatttcctccttccct cacccccaaaccctcaagcaactaagaggattttggggcattacaggattttgtagacta tggatacctgggtatggtgaaatagcccgtccattatataacctcataaaagaaactcag ggagctaaaactcatcttttaacctgggaacctgaagctcaaaaggccttcaaccagcta aagcaagccttgctcaaggtgccagccctcagccttcctgtagggaaggccttcaatctg tatgtgtcagaaaagaagggaatgtccctgggagttttaacacaggcccgaggaccagct caacagtcaatgggttacctaaaagcaatcaatccaaatggtgctgcaaaccgaaccaca cgtggacgtgccattcttccgaggacccttagatcaaccccaggaggagccctagctgtt gttccccacttgatgtcccttttcggcaggaagtag >gi568815587r:34782720_35016284|GENSCAN_predicted_peptide_7|230_aa MGLGEKRQGSTLINAVKEGLSEQVAFNLGPKRHELSQCVMNEYYKQSHQLSMATGYERLD KRIKGIKQSSLGVEHRTTRETKMGCWAALPAWLEYKQAEKCEKGDWPSLHLSPVLDASRL EHWTPSSSVLELRLALLAPQPANGLLRDLGFTPFRSGPSLSRIGSFPWVLGLTDFKNEAA DPCGVKLQTFTVSVTALKGGTQVVRSSRWVHGLADFRNEAADLRRECYSS >gi568815587r:34782720_35016284|GENSCAN_predicted_CDS_7|693_bp atgggacttggagaaaaacggcagggatcaactttgattaacgcagtcaaggaaggcctc tctgagcaggtggcatttaatctaggacctaaaagacatgaactaagccagtgtgtgatg aatgagtactataaacaaagtcatcagctgagtatggcaacagggtatgagaggcttgac aagagaataaaaggtataaaacagtcatctttgggagtggaacaccgaactactagagaa acaaagatgggctgttgggcagcactgccagcatggctagaatacaagcaggcagaaaaa tgtgaaaagggagactggcctagcctacatctttctcccgtgctggatgcttcccgactt gaacactggactccaagttcttcagttttggaactcagactggctctcctcgctcctcag cctgcaaacggcctattgcgggatcttgggttcacgccctttaggtctggtcctagtttg tccagaattggttccttcccgtgggttcttggtctcactgacttcaagaatgaagccgcg gacccttgcggagtgaagctgcagactttcacagtgagtgttacagctcttaaaggtggc acgcaggttgttcgttcctcccggtgggttcatggtctcgctgacttcaggaatgaagcc gcagaccttcgcagggagtgttacagctcataa >gi568815587r:34782720_35016284|GENSCAN_predicted_peptide_8|984_aa MAASWRLGCDPRLLRYLVGFPGRRSVGLVKGALGWSVSRGANWRWFHSTQWLRGDPIKIL MPSLSPTMEEGNIVKWLKKEEFKKRKGSLRILQSSALAAAHRTKEQDDDAGVGAGERPPG EAVSAGDALCEIETDKAVVTLDASDDGILAKIVVEEGSKNIRLGSLIGLIVEEGEDWKHV EIPKDVGPPPPVSKPSEPRPSPEPQISIPVKKEHIPGTLRDALKLVQLKQTGKITESRPT PAPTATPTAPSPLQATAGPSYPRPVIPPVSTPGQPNAVGTFTEIPASNIRRVIAKRLTES KSTVPHAYATADCDLGAVLKVRQDLVKDDIKVSVNDFIIKAAAVTLKINKIDRLLARLIK KKREKNQIDAIKNDKGDITTDPTEMQTTVREYYKHLYVNKPENLEEMDKFLDTYTLPRLN QEELESLNKPITGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEELLPFLLKLFQSIEKE GILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDANILNNILANQIQQHIKKLIHHD EVGFIPGMQGWFNIRKSINLIPHVNRTKDKNHMVISVDAEKAFDKIQQHFMLKTLNKLGI DGTYLKIIRPIFDKPTADIILNRQKLEALPLKTGTRQRCSLSPLLFNIVLEVLARAIRQE KEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLIGNFSKVSEYTINVQKSQSFL YTNNRQTESQIMSELPFTIASKRIKYLGIQLTREVKDFFKENYKPLLDEIKENTNKWKNI PWSWVGRIDIVKMAILPKQMPDVNVSWDGEGPKQLPFIDISVAVATDKGLLTPIIKDAAA KGIQEIADSVKALSKKARDGKLLPEEYQGGSFSISNLGMFGIDEFTAVINPPQACILAVG RFRPVLKLTEDEEGNAKLQQRQLITVTMSSDSRVVDDELATRPEYATSTAILESLEKVMA LSVLERMLAAEDEAWYENEQLVKA >gi568815587r:34782720_35016284|GENSCAN_predicted_CDS_8|2955_bp atggcggcctcctggaggctgggctgtgatccgcggctgctgcgttatcttgtgggcttc cccggccgccgaagcgtagggctggtgaagggggctcttgggtggtctgtaagccgcgga gctaattggagatggtttcacagcacgcagtggcttcggggtgatcccattaagatacta atgccatcactgtctcctacaatggaagaaggaaacattgtgaaatggctgaaaaaggaa gaatttaaaaagagaaagggaagcctccgtatacttcagtcctcagctctggcagctgcc catcggacaaaggaacaggacgatgatgctggcgtcggtgctggggagcggcccccgggt gaagcggtgagtgctggagatgcattatgtgaaattgagactgacaaagctgtggttacc ttagatgcaagtgatgatggaatcttggccaaaatcgtggttgaagaaggaagtaaaaat atacggctaggttcactaattggtttgatagtagaagaaggagaagattggaaacatgtt gaaattcccaaagacgtaggtcctccaccaccagtttcaaaaccttcagagcctcgcccc tcaccagaaccacagatttccatccctgtcaagaaggaacacatacccgggacactacgg gatgctctcaaacttgtccagttgaaacaaacgggcaagattaccgagtccagaccaact ccagcccccacagccactcccacagcaccttcgcccctacaggccacagctggaccatct tatccccggcctgtgatcccaccagtatcaactcctggacaacccaatgcagtgggcaca ttcactgaaatccccgccagcaatattcgaagagttattgccaagagattaactgaatct aaaagtactgtacctcatgcatatgctactgctgactgtgaccttggagctgttttaaaa gttaggcaagatctggtcaaagatgacattaaagtatcagtaaatgattttatcatcaag gcagcagctgttacccttaaaatcaacaaaattgatagactgctagcaagactaataaag aagaaaagagagaagaatcaaatagacgcaataaaaaatgataaaggcgatatcaccacc gatcccacagaaatgcaaactaccgtcagagaatactataaacacctctacgtaaataaa ccagaaaatctagaagaaatggataaattcctcgacacatacaccctcccaagactaaac caggaagaacttgaatctctgaataaaccaataacaggctctgaaattgaggcaataatt aatagcttaccaactaaaaaaagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggagctgttaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcatcctgataccaaagccgggcaga gacacaacaaaaaaagagaattttagaccaatatccctgatgaacatcgatgcaaacatc ctcaataacatactggcaaaccaaatccagcagcacatcaaaaagcttatccaccacgac gaagttggcttcatccctgggatgcaagggtggttcaacatacgaaaatcaataaactta atcccgcatgtaaacagaaccaaagacaaaaaccacatggttatctcagtagatgcagaa aaggcctttgacaaaattcaacaacacttcatgctaaaaactctcaataaattaggtatt gatgggacgtatctcaaaataataagacctatttttgacaaacccacagctgatatcata ctgaataggcaaaaactggaagcattgcctttgaaaactggcacaaggcagagatgctct ctctcaccactcctattcaacatagtgttggaagttctggcgagggcaatcaggcaggag aaagaaataaagggtattcaattaggaaaagaagaagtcaaattgtccctgtttgcagat gacatgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgata ggcaacttcagcaaagtctcagaatacacaatcaatgtgcaaaaatcacaatcattctta tacaccaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgct tcaaagagaataaaatacctaggaatccaacttacaagggaggtgaaggacttcttcaag gagaactacaaaccactgctcgatgaaataaaagagaatacaaacaaatggaagaacatt ccatggtcatgggtaggaagaatcgatattgtgaaaatggctatactgcccaagcaaatg ccagatgttaatgtaagctgggatggagagggcccaaagcaactgccatttattgacatt tcagtggctgtggcaacagataaaggcttacttactccaatcataaaagatgctgctgct aaaggtatccaggaaattgctgactctgtaaaggctctatcaaagaaagcaagagatgga aaattgttgcctgaagaataccaaggaggatcttttagtatttccaacttggggatgttt ggcatcgacgaatttactgcagtgattaaccctcctcaggcctgcattttggcggttggg aggttccgacctgtgctgaagctcactgaggatgaagagggaaatgccaaactgcagcag cgccagctcataacagtcacaatgtcaagtgacagtcgagtggttgatgacgaactggca accagaccagaatatgcaacaagcacagcgatcttggagagcttggagaaggtcatggct ctaagtgtcctggagaggatgttggcagcagaagatgaggcttggtatgaaaatgaacag cttgtgaaagcgtaa