GENSCAN 1.0 Date run: 3-Nov-116 Time: 07:51:08 Sequence gi568815589f:111427396_111643167 : 215772 bp : 42.86% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 Intr - 766 647 120 1 0 35 91 67 0.231 1.47 1.07 Intr - 3233 3152 82 1 1 64 92 85 0.855 5.22 1.06 Intr - 5977 5838 140 1 2 107 78 155 0.953 14.74 1.05 Intr - 9713 9545 169 1 1 40 69 124 0.677 4.73 1.04 Intr - 15029 14911 119 2 2 67 52 130 0.938 5.64 1.03 Intr - 17099 16983 117 2 0 107 79 80 0.968 8.74 1.02 Intr - 24160 24030 131 2 2 95 121 116 0.842 15.09 1.01 Init - 56148 56109 40 0 1 102 80 54 0.002 4.58 1.00 Prom - 59965 59926 40 -2.75 2.04 PlyA - 61783 61778 6 1.05 2.03 Term - 74330 73726 605 0 2 72 36 274 0.573 14.29 2.02 Intr - 84150 84036 115 0 1 88 17 44 0.285 -3.60 2.01 Init - 84613 84536 78 1 0 83 83 70 0.467 7.11 2.00 Prom - 85992 85953 40 -5.15 3.03 PlyA - 89392 89387 6 1.05 3.02 Term - 97930 97696 235 0 1 48 44 314 0.923 17.81 3.01 Init - 98046 97964 83 0 2 83 62 76 0.790 5.01 3.00 Prom - 99297 99258 40 -10.35 4.00 Prom + 99838 99877 40 -14.62 4.01 Init + 100001 100412 412 1 1 67 100 283 0.408 24.12 4.02 Intr + 105015 105281 267 1 0 44 85 143 0.721 6.18 4.03 Term + 114262 115775 1514 2 2 69 36 1557 0.900 138.28 4.04 PlyA + 119238 119243 6 1.05 5.02 PlyA - 119254 119249 6 1.05 5.01 Sngl - 129449 128646 804 2 0 94 32 279 0.911 18.46 5.00 Prom - 133573 133534 40 -7.05 6.13 PlyA - 135205 135200 6 1.05 6.12 Term - 135836 135726 111 2 0 66 48 146 0.997 5.98 6.11 Intr - 142814 142696 119 0 2 118 94 165 0.999 19.36 6.10 Intr - 147447 147339 109 0 1 81 95 67 0.989 5.64 6.09 Intr - 151556 151401 156 2 0 100 101 183 0.999 20.09 6.08 Intr - 156194 156077 118 1 1 78 115 149 0.999 16.15 6.07 Intr - 158770 158603 168 0 0 93 96 98 0.969 9.24 6.06 Intr - 165587 165531 57 1 0 108 30 76 0.516 0.88 6.05 Intr - 166914 166827 88 1 1 77 52 67 0.781 0.11 6.04 Intr - 170037 169922 116 2 2 94 107 71 0.764 8.77 6.03 Intr - 173113 173085 29 1 2 72 115 13 0.074 -1.60 6.02 Intr - 179577 179499 79 2 1 106 99 6 0.859 2.13 6.01 Init - 182070 181619 452 0 2 41 78 384 0.582 28.13 6.00 Prom - 195678 195639 40 -7.25 7.00 Prom + 197812 197851 40 -6.75 7.01 Sngl + 204013 204552 540 0 0 65 38 557 0.827 42.23 7.02 PlyA + 205170 205175 6 -0.45 8.03 PlyA - 205445 205440 6 1.05 8.02 Term - 206689 206493 197 2 2 88 39 76 0.306 -0.71 8.01 Init - 210469 210397 73 1 1 73 49 81 0.385 3.98 8.00 Prom - 212222 212183 40 -0.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 122277 121984 294 0 0 37 54 208 0.968 7.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:111427396_111643167|GENSCAN_predicted_peptide_1|306_aa MAAAAASASQDELNQLERVFLRLGHAETDEQLQNIISKFLPPVLLKLSSTQEGVRKKVME LLVHLNKRIKSRPKIQLPVETLLVQYQDPAAVSFVTNFTIIYVKMGYPRLPVEKQCELAP TLLTAMEGKPQPQQDSYVLNESQSRQNSSSAQGSSSNSGGGSGIPQPPPGMSFYAAKRVI GDNPWTPEQLEQCKLGIVKFIEAEQVPELEAVLHLVIASSDTRHSVATAADLELKSKQSL IDWNNPAIINKMYKVYLGDIPLKTKEGAVLKPELKRDPVSTRVKLKIVPHLLRSRQAAET FPANIQ >gi568815589f:111427396_111643167|GENSCAN_predicted_CDS_1|918_bp atggctgcggccgccgcctccgcctcccaggatgagctgaatcagcttgaacgggtcttt ttacgacttggccatgctgaaacagatgaacaattacagaatattatatctaaattcctt cctcctgttttgctcaaactctctagcacccaagaaggagtacgtaaaaaggtaatggaa ctgctggtccatctgaataaacgtataaaaagccgccccaaaatacaacttccagtagag acactgttggttcagtaccaggaccctgctgcagtttcctttgtcacaaattttactata atttatgttaaaatgggctatcctcgcctaccagtggaaaaacaatgtgaactggcccct acgcttcttactgccatggaagggaagcctcagccacagcaggatagttacgtgttaaat gaatcccagagtcgccaaaattcatcttcagcacagggttcttcttcaaacagtggcgga ggttctggaatcccacagcctcctccgggaatgagcttttatgcagccaaacgagttatt ggtgataacccatggacacctgaacaattggaacagtgcaaattgggaatcgtgaaattc atagaagctgaacaggtgcctgaacttgaagctgttctccacttggtgattgcctctagt gatacacgccacagtgtggcaacggcagcagacctggaattgaaaagcaaacagagctta attgactggaataatcctgccatcattaataagatgtacaaggtgtaccttggagatata ccactgaagacaaaagagggtgcagttctgaagccagagttgaaaagggaccctgtcagt acaagagtcaagttaaagattgtcccccatctcctccgctctagacaagctgctgaaacg ttcccagccaacattcag >gi568815589f:111427396_111643167|GENSCAN_predicted_peptide_2|265_aa MMATLSCKTGKFLLGILKGEGVYEQGAFCKKKNMAVFCPTPQAVRPYGYLPSFPENCCYS VLFQGAYLLPTAINLPSTAPTVPRMFMQWDVCRPTLSCPKFSLSLPPMGAQSLEGVKAAG GWHVSIASTCTRGQFATVPGLGLNFAPKSEQALGAQRGQAAEAGTSEPVGTGGFLDPQSA GMPWSTVGLGGGRGTWEGATNSEGGGAPACSQYLCLLPASSTEDAAPTAPPSAAARRLHS GRSGWAAAAITYKRAYEAMHSGSHV >gi568815589f:111427396_111643167|GENSCAN_predicted_CDS_2|798_bp atgatggcaaccctgagctgcaaaaccggcaagtttttattagggattttaaaaggagag ggggtgtacgaacagggagctttctgcaagaagaaaaatatggctgtattctgcccaacc ccacaggcagtcagaccttatggttatcttccctcgttccctgaaaattgctgttattct gttctttttcagggagcctatctgcttcctactgccatcaacttgccatccacagcaccc actgtgcccaggatgttcatgcagtgggatgtctgcaggcccacgctgagctgccccaag ttctccctcagcctccctcctatgggtgctcaaagtctggagggggtcaaggctgcagga ggctggcatgtcagcattgcaagtacctgcacacgtggccagtttgcgacagtgcctggg cttggcctcaactttgctccgaaatcggagcaggcgctgggagcacagagaggccaggca gcggaagcaggtacttctgagcctgtggggacagggggcttcttggacccccagagtgca gggatgccctggtccacagtggggctgggcggtggcagaggcacctgggaaggcgccacc aactcagaagggggtggggctcccgcgtgttcccagtacctctgcctgctcccagccagc tccacagaggatgcagcccccaccgcacctccctccgctgcagccaggcgtcttcacagt ggccgctctgggtgggctgccgctgccatcacttataaaagagcctatgaggccatgcac agtggctcacacgtgtaa >gi568815589f:111427396_111643167|GENSCAN_predicted_peptide_3|105_aa MEGSQQPHPAAECAPPTEAAAAQRTARCAPSEPATRPRPRNSGPGTTSTCISIRKQHQRH RKELPEACPAASARASLREPLRLRRGAEPDSPNALVQLSESPRFP >gi568815589f:111427396_111643167|GENSCAN_predicted_CDS_3|318_bp atggagggaagccaacagccccatcccgcagcggagtgcgcacctcccacagaagcagca gcagcgcagcgcacagcccgttgtgcgccctcggagccggcgacccggccacggccccga aactcggggcccgggacgaccagtacctgcatcagcatccgcaaacagcaccaacgccac cgcaaagagcttccagaagcttgtcccgcagcgagtgcgcgcgcaagcctgcgggaacca ctgcgcctgcgcagaggcgctgaaccggactccccgaacgcgctcgtgcagctttccgaa agtccgcgctttccatag >gi568815589f:111427396_111643167|GENSCAN_predicted_peptide_4|730_aa MQAVVPLNKMTAISPEPQTLASTEQNEVPRVVTSGEQEAILRGNAADAESFRQRFRWFCY SEVAGPRKALSQLWELCNQWLRPDIHTKEQILELLVFEQFLTILPGEIRIWVKSQHPESS EEVVTLIEDLTQMLEEKERQEEIENTKRRCCEELGRVWSDRAVRQGTVAATRSRKRQGTD YPPEPLKGAQPCQHLDLRLLASRTVREQIPILSHQGYGNLLQLPEEDESALDKIIERCLR DDDHGLMEESQQYCGSSEEDHGNQGNSKGRVAQNKTLGSGSRGKKFDPDKSPFGHNFKET SDLIKHLRVYLRKKSRRYNESKKPFSFHSDLVLNRKEKTAGEKSRKSNDGGKVLSHSSAL TEHQKRQKIHLGDRSQKCSKCGIIFIRRSTLSRRKTPMCEKCRKDSCQEAALNKDEGNES GEKTHKCSKCGKAFGYSASLTKHRRIHTGEKPYMCNECGKAFSDSSSLTPHHRTHSGEKP FKCDDCGKGFTLSAHLIKHQRIHTGEKPYKCKDCGRPFSDSSSLIQHQRIHTGEKPYTCS NCGKSFSHSSSLSKHQRIHTGEKPYKCGECGKAFRQNSCLTRHQRIHTGEKPYLCNDCGM TFSHFTSVIYHQRLHSGEKPYKCNQCEKAFPTHSLLSRHQRIHTGVKPYKCKECGKSFSQ SSSLNEHHRIHTGEKPYECNYCGATFSRSSILVEHLKIHTGRREYECNECEKTFKSNSGL IRHRGFHSAE >gi568815589f:111427396_111643167|GENSCAN_predicted_CDS_4|2193_bp atgcaagctgtagtgcccttgaacaagatgacagccatctcaccagaacctcaaactctg gcctcgactgaacaaaatgaggtcccaagagtggttacttctggggaacaagaagctatt ttaagaggaaatgctgctgatgcagagtctttcagacagaggtttaggtggttttgttac tcagaagtagctggacccaggaaagctctgagtcaactctgggagctctgcaatcagtgg ctgagaccagacattcacacgaaagaacagattttagagcttctggtgtttgagcagttc ctgaccattttgcctggggagatcaggatttgggtaaagtcacaacatcctgagagtagt gaggaagtggtgaccctaatagaagatttgacccagatgcttgaagaaaaagagaggcag gaggagattgaaaacacaaagagaaggtgttgtgaagagttaggcagagtttggagtgat agggccgtaagacaaggaactgtggcagccaccagaagcaggaagaggcaaggaacagat tatcccccagagcctctgaagggagcacagccctgccaacacctagatctcagacttctg gcctccagaactgtgagagaacaaattcctattttgagtcaccaaggttatgggaatttg ttacagctcccagaggaagatgaatcagctttagataaaataatagaaaggtgcctcagg gatgatgatcatggcttgatggaagaatcccagcaatattgtggcagctcagaggaggat cacggtaatcagggaaattcaaaaggaagagtcgcacaaaacaaaactcttgggagtggc agtaggggtaagaaatttgacccagataaaagcccctttggacataatttcaaagaaact tcagacttaattaaacatctgagagtctacttgaggaagaaatctcggaggtataatgaa agcaagaaacccttcagttttcattcagaccttgttctgaaccgcaaggagaaaaccgcc ggagaaaagtcacggaaatctaatgatggtgggaaagtcctgagtcactcttcagctctt actgaacatcagaaacgtcagaagattcatttgggggataggtcccaaaaatgcagtaag tgtgggataatctttattagaagatcaactctttctaggagaaaaacccctatgtgtgag aaatgtcggaaagattcatgtcaagaagcagccttaaataaagatgagggaaatgagagt ggagaaaaaactcataaatgtagtaagtgtggaaaagcctttggctatagcgcctcactc accaaacatcggagaattcacactggagaaaaaccctatatgtgtaatgaatgtggaaaa gcttttagtgatagttcatcgctcacaccacatcatagaactcatagtggagagaaaccc ttcaaatgtgatgactgtgggaaaggtttcaccctaagtgctcacctcattaaacatcag agaattcatactggagaaaaaccttataaatgtaaagactgtgggagaccctttagtgac agttcatctcttattcaacatcagcgaattcatactggagaaaaaccctatacatgtagc aattgtggaaaatccttcagtcatagctcatccctttccaaacatcagagaattcatact ggagagaaaccctataaatgtggcgaatgtggaaaagcctttaggcagaattcatgcctt acccggcatcagagaattcacactggagaaaaaccatatttgtgtaatgattgcggaatg acttttagccattttacgtctgtgatttatcatcaaagacttcattcaggagaaaaaccc tacaaatgtaaccagtgtgagaaagccttcccaacccattcactgctaagtcgtcatcag agaattcatactggtgtaaaaccttataaatgtaaagaatgtgggaagtccttcagtcag agttcatctcttaatgagcaccaccgaattcatacaggagagaaaccctatgagtgtaac tattgtggtgcaacctttagtcgaagctcaatccttgtagaacacctaaaaattcatacc ggaaggagagaatatgaatgtaacgaatgtgagaagacatttaaaagtaattcaggcctc attagacatcggggatttcactctgcagagtaa >gi568815589f:111427396_111643167|GENSCAN_predicted_peptide_5|267_aa MAGGPQETYNHGRRAKGKQAHLHMAAGESKKGEMLHTSKQPDLMRTHYHENNRREICHHD PITSHQVPPAALDTNPKPWQFLCGVGLQVQRRQELRFGSLCLDFRECMEMPGCQGRSLLQ EHSLCGEPLLRQCRGITWGWSPLHSGAVRRWPPSSIPQNGRSTNSLCHAPGKVTGTQQAL RAAMGIVPCRVTGAEVPKTLEAHPLHQHALDVRHGVKDYYGAIRFNDCPTRFWTCMGPVA PLSWSISPPWNGSIYPVSVPLFYLGSN >gi568815589f:111427396_111643167|GENSCAN_predicted_CDS_5|804_bp atggctggggggcctcaggaaacttacaatcatggcagaagggcaaaggggaagcaagca catcttcacatggcagcaggggagagcaaaaagggggaaatgctacacacttctaaacaa ccagatctcatgagaactcactatcatgagaacaacaggagggaaatctgccaccatgat cccatcacctcccaccaggtccctcccgcagcattggatacaaaccctaaaccttggcag tttctatgtggtgttggcctgcaggtgcaaagaagacaagagttgaggtttgggagtctc tgcctagatttcagagaatgtatggaaatgcctggatgtcaaggcagaagtctgctgcag gagcacagcctttgtggagaacctctactacggcagtgcagagggataacatggggatgg agccccctacacagtggagctgtgagaagatggccaccatcctccataccccagaatggt agatccaccaacagcttgtgccatgcacctggaaaagtcacaggcactcaacaggccttg agagcagccatgggaattgtaccctgcagagtaacaggggcagaggtgcccaagacctta gaagcccaccctttgcatcagcatgccctagatgtgagacatggagtcaaagattattat ggagctataagatttaatgactgccccaccaggttttggacttgcatggggcctgtagcc cctttgtcttggtcaatctctcctccttggaatgggagcatttacccagtgtctgtaccc ctgttctatcttggaagtaactaa >gi568815589f:111427396_111643167|GENSCAN_predicted_peptide_6|533_aa MLLCQKAPSLKTTYNHPPAADSAGTALNLETTVKQTRETQLEYNNVGTDLSPEPKSFNYP LLSSSGDQFEIQLNQQLWSLIPNNDVRRLVSHVIRTLKTDCTETHLQLACAKLISRTGLL MKLLSEQQELRTVSMTAWKPRMNRKSRSRMRQSHFASHAGRWWHNHSTLQPQSPKLQMAE LSEARRRSFRMVRTKTWTLKKHFVGYPTNSDFELKTAELPPLKNGGLEFLIAYGMLYFVE VLLEALFLTVDPYMRVAAKRLKEGDTMMGQQVAKVVESKNVALPKGTIVLASPGWTTHSI SDGKDLEKLLTEWPDTIPLSLALGTVGMPGLTAYFGLLEICGVKGGETVMVNAAAGAVGS VVGQIAKLKGCKVVGAVGSDEKVAYLQKLGFDVVFNYKTVESLEETLKKASPDGYDCYFD NVGGEFSNTVIGQMKKFGRIAICGAISTYNRTGPLPPGPPPEIVIYQELRMEAFVVYRWQ GDARQKALKDLLKWVLEGKIQYKEYIIEGFENMPAAFMGMLKGDNLGKTIVKA >gi568815589f:111427396_111643167|GENSCAN_predicted_CDS_6|1602_bp atgctactatgccagaaggcaccatctctgaaaacaacctacaatcatcctcctgcggca gattccgctgggactgcattaaacttagagacgactgttaaacaaaccagggaaacacag ttggaatacaacaacgtgggcactgacctgtcccccgaacccaaaagcttcaattaccca ttgctctcatcctcaggtgaccagtttgaaattcagctaaaccagcagctatggtccctc atccccaacaacgatgtgagaaggcttgtttctcatgttatccggaccttgaagacggac tgcactgagacccatttgcaactggcctgtgccaagctcatctctaggacaggcctccta atgaagcttctcagtgagcagcaagaattgagaactgtatcaatgacagcatggaagccc agaatgaacagaaagagcagaagtcgaatgagacagtctcactttgccagccatgctgga aggtggtggcacaatcatagcacactgcagccacaatctcccaagcttcagatggctgaa ctgagtgaggcaaggagaaggagcttcaggatggttcgtactaagacatggaccctgaag aagcactttgttggctatcctactaatagtgactttgagttgaagacagctgagctccca cccttaaaaaatggaggccttgagtttctaattgcctatggaatgctttattttgtagag gtcctgcttgaagctttgttcctcaccgtggatccctacatgagagtggcagccaaaaga ttgaaggaaggtgatacaatgatggggcagcaagtggccaaagttgtggaaagtaaaaat gtagccctaccaaaaggaactattgtactggcttctccaggctggacaacgcactccatt tctgatgggaaagatctggaaaagctgctgacagagtggccagacacaataccactgtct ttggctctggggacagttggcatgccaggcctgactgcctactttggcctacttgaaatc tgtggtgtgaagggtggagaaacagtgatggttaatgcagcagctggagctgtgggctca gtcgtggggcagattgcaaagctcaagggctgcaaagttgttggagcagtagggtctgat gaaaaggttgcctaccttcaaaagcttggatttgatgtcgtctttaactacaagacggta gagtctttggaagaaaccttgaagaaagcgtctcctgatggttatgattgttattttgat aatgtaggtggagagttttcaaacactgttatcggccagatgaagaaatttggaaggatt gccatatgtggagccatctctacatataacagaaccggcccacttcccccaggcccaccc ccagagattgttatctatcaggagcttcgcatggaagcttttgtcgtctaccgctggcaa ggagatgcccgccaaaaagctctgaaggacttgctgaaatgggtcttagagggtaaaatc cagtacaaggaatatatcattgaaggatttgaaaacatgccagctgcatttatgggaatg ctgaaaggagataatttggggaagacaatagtgaaagcatga >gi568815589f:111427396_111643167|GENSCAN_predicted_peptide_7|179_aa MGAPLLSPGWGAGAAGRRWWMLLAPLLPALLLVRPAGALVEGLYCGTRDCYEVLGVSRSA GKAEIARAYRQLARRYHPDRYRPQPGDEGPGRTPQSAEEAFLLVATAYETLKVRPAGVEG LRRLAAGSPRRLPTPVRGAWASVTPKLSTATTATFKILTFPTWPCERAESAPRGLGGHS >gi568815589f:111427396_111643167|GENSCAN_predicted_CDS_7|540_bp atgggggcgccgctgctctctcccggctggggagccggggctgccggccggcgctggtgg atgctgctggcgcccctgctgccggcgctgctgctggtgcggcccgcgggggccctggtg gaggggctctactgcggcacgcgggactgctacgaggtgctgggcgtgagccgctcggcg ggcaaggcggagatcgcgcgggcctaccgccagctggcccggcgctaccaccctgaccgc taccggccccagcccggagacgagggccccgggcggacgccgcagagcgccgaggaggct ttcctgctggtggcaaccgcctacgagacactcaaggtgaggcctgcgggcgtggagggg cttcgaagactggccgcgggaagcccacggcgccttccgaccccggtccgcggagcgtgg gcctctgtgaccccgaaactgagcacagccaccaccgcgacctttaagatactcacgttt cccacgtggccctgtgaaagagccgagtcggccccacggggccttggggggcacagctag >gi568815589f:111427396_111643167|GENSCAN_predicted_peptide_8|89_aa MARTGETDKDHVKEVREEHLKERNDNFHSKTVGVLGELQNTPLIPSSTSTTPLPLPCSKI FVAITNPSESRVQVLPTPDILVFNCLNVP >gi568815589f:111427396_111643167|GENSCAN_predicted_CDS_8|270_bp atggcgaggacaggagaaacagacaaagatcacgtcaaggaagtcagggaagaacacctt aaagaacgaaatgataatttccacagtaagacagtgggcgttctgggggagctacagaac actcccctcattccctcctccaccagtacaactccacttccactcccctgttccaagata tttgtagctatcacaaacccgtcagagtcaagagtacaagtgttacctactccagacata ctggtctttaactgcctaaacgttccttga