GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:58:17 Sequence gi568815586r:4270346_4479582 : 209237 bp : 43.98% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1947 2029 83 0 2 119 85 4 0.509 2.48 1.02 Intr + 2501 2660 160 0 1 73 -63 176 0.533 0.05 1.03 Intr + 2804 2908 105 2 0 80 59 83 0.577 3.93 1.04 Intr + 3656 3890 235 2 1 37 92 457 0.629 38.69 1.05 Intr + 5660 5875 216 1 0 98 55 222 0.999 18.60 1.06 Intr + 8415 8574 160 2 1 111 111 289 0.999 33.06 1.07 Intr + 9398 9479 82 0 1 82 66 25 0.534 -1.60 1.08 Intr + 13866 13978 113 0 2 50 93 65 0.317 3.22 1.09 Intr + 18497 18645 149 0 2 137 66 187 0.864 21.35 1.10 Intr + 19886 20104 219 1 0 50 76 81 0.639 1.60 1.11 Term + 29515 29664 150 0 0 81 55 278 0.909 21.71 1.12 PlyA + 29841 29846 6 1.05 2.03 PlyA - 30311 30306 6 1.05 2.02 Term - 54408 54027 382 2 1 28 40 838 0.981 67.31 2.01 Init - 57558 57554 5 0 2 76 55 0 0.203 -5.03 2.00 Prom - 61648 61609 40 -1.06 3.00 Prom + 66415 66454 40 -4.66 3.01 Init + 73301 73966 666 1 0 66 41 305 0.111 18.83 3.02 Term + 74675 75607 933 1 0 12 47 280 0.120 8.43 3.03 PlyA + 75655 75660 6 -0.45 4.00 Prom + 75938 75977 40 -3.26 4.01 Init + 79477 79551 75 0 0 71 96 52 0.619 5.39 4.02 Intr + 80922 81032 111 2 0 40 91 105 0.814 6.58 4.03 Intr + 81915 82118 204 2 0 133 62 22 0.809 3.50 4.04 Term + 84093 84110 18 2 0 99 47 -9 0.101 -5.58 4.05 PlyA + 84214 84219 6 1.05 5.03 PlyA - 84244 84239 6 1.05 5.02 Term - 85528 85402 127 0 1 112 47 114 0.981 7.46 5.01 Init - 97322 97261 62 2 2 61 113 30 0.287 3.52 5.00 Prom - 97546 97507 40 -7.06 6.06 PlyA - 98054 98049 6 1.05 6.05 Term - 100438 99998 441 1 0 109 46 584 0.998 51.56 6.04 Intr - 102352 102249 104 2 2 121 70 36 0.945 4.99 6.03 Intr - 109193 109027 167 1 2 81 79 122 0.029 10.20 6.02 Intr - 121574 121454 121 0 1 79 52 59 0.081 0.95 6.01 Init - 154885 154804 82 1 1 89 98 43 0.506 4.68 6.00 Prom - 157693 157654 40 -4.86 7.10 PlyA - 157832 157827 6 1.05 7.09 Term - 158608 158454 155 2 2 134 44 49 0.858 3.18 7.08 Intr - 159043 158863 181 1 1 92 42 48 0.782 0.04 7.07 Intr - 164046 163933 114 0 0 102 70 173 0.738 17.54 7.06 Intr - 173891 173788 104 0 2 87 113 52 0.955 7.49 7.05 Intr - 174417 174373 45 1 0 108 49 39 0.562 0.38 7.04 Intr - 175140 174880 261 1 0 94 79 311 0.591 28.26 7.03 Intr - 178228 178118 111 2 0 72 47 87 0.726 3.35 7.02 Intr - 181242 181102 141 1 0 59 110 95 0.928 9.12 7.01 Init - 197277 197253 25 0 1 76 97 29 0.063 0.57 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:4270346_4479582|GENSCAN_predicted_peptide_1|557_aa XEPSVRLTQPAPHPALLALRAPARAPSLESCIGVATLSADTSGGLSADAGARKRVFPAWP LAAGEPLGALPPACGGPRRCTAGGQKGRCSGPFNRGFRNSFEVIRNTDFRDMTFISGPGK AGGRGAAGLAMELLCHEVDPVRRAVRDRNLLRDDRVLQNLLTIEERYLPQCSYFKCVQKD IQPYMRRMVATWMLEVCEEQKCEEEVFPLAMNYLDRFLAGVPTPKSHLQLLGAVCMFLAS KLKETSPLTAEKLCIYTDNSIKPQELLEWELVVLGKLKWNLAAVTPHDFIEHILRKLPQQ REKLSLIRKHAQTFIALCATVAMAETLEPYSSKGEMSGVHVLTIRNLRHCAFTRIILSES SEQHYNVGIAVVIICRSENENSERSDFKFAMYPPSMIATGSVGAAICGLQQDEEVSSLTC DALTELLAKITNTDVLREMAQGDADGANPGSSSEIVRRYGFKMRAPNIRFLNEDSRTQTE NLPPVQHEECFSNPPEENRGRDLTFIEMDCLKACQEQIEAVLLNSLQQYRQDQRDGSKSE DELDQASTPTDVRDIDL >gi568815586r:4270346_4479582|GENSCAN_predicted_CDS_1|1674_bp nnggaacctagtgtacggctcacccagcccgcgccccaccccgccttgctggctctccgc gcccctgcccgggccccctctctcgaaagctgcatcggtgtggccacgctcagcgcagac acctcgggcggcttgtcagcagatgcaggggcgaggaagcgggtttttcctgcgtggccg ctggccgcgggggaaccgctgggagccctgcccccggcctgcggcggccctagacgctgc accgcgggggggcagaagggacgttgttctggtccctttaatcggggctttcgaaacagc ttcgaagttatcaggaacacagacttcagggacatgacctttatctctgggccggggaaa gcaggagggagaggggccgccgggctggccatggagctgctgtgccacgaggtggacccg gtccgcagggccgtgcgggaccgcaacctgctccgagacgaccgcgtcctgcagaacctg ctcaccatcgaggagcgctaccttccgcagtgctcctacttcaagtgcgtgcagaaggac atccaaccctacatgcgcagaatggtggccacctggatgctggaggtctgtgaggaacag aagtgcgaagaagaggtcttccctctggccatgaattacctggaccgtttcttggctggg gtcccgactccgaagtcccatctgcaactcctgggtgctgtctgcatgttcctggcctcc aaactcaaagagaccagcccgctgaccgcggagaagctgtgcatttacaccgacaactcc atcaagcctcaggagctgctggagtgggaactggtggtgctggggaagttgaagtggaac ctggcagctgtcactcctcatgacttcattgagcacatcttgcgcaagctgccccagcag cgggagaagctgtctctgatccgcaagcatgctcagaccttcattgctctgtgtgccacc gtggccatggcagagaccctggaaccttactcatccaagggagagatgtcaggagttcat gttttgacaatcagaaaccttaggcactgtgcttttacaaggattattttaagtgaatcc tcagaacagcactacaacgtgggtattgctgttgtcattatttgcagatcagaaaatgaa aactcagagaggtcagactttaagtttgccatgtacccaccgtcgatgatcgcaactgga agtgtgggagcagccatctgtgggctccagcaggatgaggaagtgagctcgctcacttgt gatgccctgactgagctgctggctaagatcaccaacacagacgtgctcagagaaatggcg cagggagatgctgacggagcaaatccggggtcctcctctgagatagttcgtagatatggt tttaaaatgcgggctccaaacatacgctttttaaacgaggactccagaacacagactgaa aacctccctccagtgcagcatgaagaatgcttttctaatcctccagaggaaaatcgtgga cgggacttaacgtttatagaaatggattgtctcaaagcttgccaggagcagattgaggcg gtgctcctcaatagcctgcagcagtaccgtcaggaccaacgtgacggatccaagtcggag gatgaactggaccaagccagcacccctacagacgtgcgggatatcgacctgtga >gi568815586r:4270346_4479582|GENSCAN_predicted_peptide_2|128_aa MRLGPAAGRDVSCEQLTQLYSACQRPQVNPGLRRKQNSLLKRLRKAKKEAPPMEKPEVVK THLRDMIILPKMVGSMVGVYNGKTFNQVEIKPEMISHYLGEFSITYKLVKHCRPGIGATH SSRFIPLK >gi568815586r:4270346_4479582|GENSCAN_predicted_CDS_2|387_bp atgagacttggaccagctgcgggacgggacgtgtcctgcgagcagctgacgcagctgtac agtgcgtgccagcggccgcaagtgaacccgggcctgcggcggaaacagaactcgctgctg aagcgcctgcgcaaggccaagaaggaggcaccgcccatggagaagccggaagtggtgaag acgcacctgcgggacatgatcatcctgcccaagatggtgggcagcatggtgggcgtctac aacggcaagaccttcaaccaggtggagatcaagccggagatgatcagtcactacctgggc gagttctccatcacctataagctggtgaagcactgccggcccggcatcggggccacccac tcctcccgcttcatccccctcaagtag >gi568815586r:4270346_4479582|GENSCAN_predicted_peptide_3|532_aa MKAEIKMFFETNENKETTYQNLWDTFKAVCRGKFIALNAHRRKQERSKVGTLTSQLKELE KQEQTNSKASRRQEITKIRAELEETETQKTLPKINESRSWFFEKINKIDRPLARPIKKRE KNQMEAIKNDKGDITTDPTEIRTTIREYYKHLYTNKLESLEEMDKFLDTYTLPRLKQEEV ESLIRPVTGSEIEAIINSLPTKKSPGPDGFTAEFFQRYKEELRIKYLGIQLTRDMKDLFK EHYKPLLNEIKEDTNKWKNIPCSWIGRINIMKMAILPKVIYRFNAIPFELPMTFFTDLEK TTLKFIWNQKRAHIAKSILSQKNKAGGITLPDFKLCYKATVTKTAWYWYQNRDIDQWNRT EPSEIIPHIYNYLNFDKPDKNKKWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSR WSKDLHVRPKTIKTLEENLGNTIQDIGMGKDFMTKTPKAMATKAKVDEWDLIKLKSFCTA KETTIRVNRQPTEWEKIFPIYSSDKGLISRIYKELKQICKKKTTPSTSGQRI >gi568815586r:4270346_4479582|GENSCAN_predicted_CDS_3|1599_bp atgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagagacaacataccag aatctctgggacacatttaaagcagtgtgtagagggaaatttatagcactaaatgctcac aggagaaagcaggaaagatctaaagttggcaccctaacatcacaattgaaagaactagag aagcaagagcaaacaaattcaaaagctagcagaaggcaagaaataactaagatcagagca gaactggaggagacagagacacaaaaaacccttccaaaaatcaatgaatccaggagctgg ttttttgaaaagatcaacaaaattgatagaccgctagcaagaccaataaagaaaagagag aagaatcaaatggaagcaataaaaaatgataaaggggatatcaccacagatcccacagaa atacgaactaccatcagagaatactataaacacctctacacaaataaactagaaagtcta gaagaaatggataaattcctcgacacctacaccctcccaagactaaaacaggaagaagtt gaatctctgattagaccagtaacaggctctgaaattgaggcaataattaatagcttacca accaaaaaaagtccaggaccagacggattcacagccgaattcttccagaggtacaaggag gagctgaggataaaatacctaggaatccaacttacaagggatatgaaggacctcttcaag gagcactataaaccactgctcaatgaaataaaagaggacacaaacaaatggaagaacatt ccatgctcatggataggaagaatcaatatcatgaaaatggccatactgcccaaggtaatt tatagattcaatgccatccccttcgagctaccaatgactttcttcacagacttggaaaaa actactttaaagttcatatggaaccaaaaaagagcccacattgccaagtcaatcctaagc caaaagaacaaagctggaggcatcacgctacctgacttcaaactatgctacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagaccaatggaacagaaca gagccctcagaaataataccacacatctacaactatctgaactttgacaaacctgacaaa aacaagaaatggggaaaggattccctattcaacaaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactggatcccttccttacaccttatacaaaaattaattcaaga tggagtaaagacttacatgttagacctaaaaccataaaaaccctagaagaaaacctaggc aataccattcaggacataggcatgggcaaggacttcatgactaaaacaccaaaagcaatg gcaacaaaagccaaagttgacgaatgggatctaattaaactaaagagcttctgcacagca aaagaaactaccatcagagtgaacaggcaacctacagaatgggagaaaatttttccaatc tactcatctgacaaagggcttatatccagaatctacaaagaactcaaacaaatttgcaag aaaaaaacaaccccatcaacaagtgggcaaaggatatga >gi568815586r:4270346_4479582|GENSCAN_predicted_peptide_4|135_aa MHGILERSKFCKDMTVKYDSRLRERKYGVVEGKALSELRAMAKAAREECPVFTPPGGETL DQVKMRGIDFFEFLCQLILKEADQKEQFSQGSPSNCLETSLAEIFPLGKNHSSKVNSDSG IPGLAASVLVRQPLQ >gi568815586r:4270346_4479582|GENSCAN_predicted_CDS_4|408_bp atgcatggaattttggagagaagcaaattttgcaaagatatgacggtaaagtatgactca agacttcgggaaaggaaatacggggttgtagaaggcaaagcgctaagtgagctgagggcc atggccaaagcagccagggaagagtgccctgtgtttacaccgcccggaggagagacgctg gaccaggtgaaaatgcgtggaatagacttttttgaatttctttgtcaactaatcctgaaa gaagcggatcaaaaagaacagttttcccaaggatctccaagcaactgtctggaaacttct ttggcagagatatttcctttaggaaaaaatcacagctctaaagttaattcagacagcggt attccaggattagcagccagtgtcttagttaggcagccactgcagtga >gi568815586r:4270346_4479582|GENSCAN_predicted_peptide_5|62_aa MSIHYRSFAKDWVAHARGKIRAADLTSFSADTIESSLPILDKAAFLPPVSIPLTLTYFAQ QH >gi568815586r:4270346_4479582|GENSCAN_predicted_CDS_5|189_bp atgagcattcactacaggtccttcgccaaagactgggtagcacatgcacgaggtaagata cgagcagcagatcttacatccttcagtgctgacaccatcgagtcctccctgcccatcctg gataaagcagcattcctcccaccagtctccatcccccttactctgacctattttgctcaa cagcactga >gi568815586r:4270346_4479582|GENSCAN_predicted_peptide_6|304_aa MPGRKEVHVCGPSAVAHACKLSTLADQVPALGPPRAHPGGFSPEDTFLETGTFIEEGCGV TDKDLNSDVCSMSVLRAYPNASPLLGSSWGGLIHLYTATARNSYHLQIHKNGHVDGAPHQ TIYSALMIRSEDAGFVVITGVMSRRYLCMDFRGNIFGSHYFDPENCRFQHQTLENGYDVY HSPQYHFLVSLGRAKRAFLPGMNPPPYSQFLSRRNEIPLIHFNTPIPRRHTRSAEDDSER DPLNVLKPRARMTPAPASCSQELPSAEDNSPMASDPLGVVRGGRVNTHAGGTGPEGCRPF AKFI >gi568815586r:4270346_4479582|GENSCAN_predicted_CDS_6|915_bp atgcctggacggaaagaagtccacgtttgtgggccgagcgcggtggctcatgcctgtaaa ctcagcactttggcagaccaagtccctgcccttggcccgccacgtgcccatcctggtggc ttcagtcctgaagatacgttcttggaaacagggacattcatagaagagggctgtggggtc actgacaaagatctgaattcagacgtctgcagcatgagcgtcctcagagcctatcccaat gcctccccactgctcggctccagctggggtggcctgatccacctgtacacagccacagcc aggaacagctaccacctgcagatccacaagaatggccatgtggatggcgcaccccatcag accatctacagtgccctgatgatcagatcagaggatgctggctttgtggtgattacaggt gtgatgagcagaagatacctctgcatggatttcagaggcaacatttttggatcacactat ttcgacccggagaactgcaggttccaacaccagacgctggaaaacgggtacgacgtctac cactctcctcagtatcacttcctggtcagtctgggccgggcgaagagagccttcctgcca ggcatgaacccacccccgtactcccagttcctgtcccggaggaacgagatccccctaatt cacttcaacacccccataccacggcggcacacccggagcgccgaggacgactcggagcgg gaccccctgaacgtgctgaagccccgggcccggatgaccccggccccggcctcctgttca caggagctcccgagcgccgaggacaacagcccgatggccagtgacccattaggggtggtc aggggcggtcgagtgaacacgcacgctgggggaacgggcccggaaggctgccgccccttc gccaagttcatctag >gi568815586r:4270346_4479582|GENSCAN_predicted_peptide_7|378_aa MDILGGGIEGWPLFENNASSEIHGFKAFNSTTLIQTFHKLYCMAGTVLRSEDTETEHLRN LGCLLLHMLAAITTYAVADGNEELEAHVSLPHRILVGMVVPSPAGTRANNTLLDSRGWGT LLSRSRAGLAGEIAGVNWESGYLVGIKRQRRLYCNVGIGFHLQVLPDGRISGTHEENPYS SYPASCLQAASLLYGLLEISTVERGVVSLFGVRSALFVAMNSKGRLYATPSFQEECKFRE TLLPNNYNAYESDLYQGTYIALSKYGRNRWRMRLCPKQVLNLVHHADSQRRTMFVYLIER LKFRHQRAVSCVVSSFCKSLSTSGWSQGRQAKRTRKHRFMVLLIIKAGRSPVAYSVQLPC SLPMGKLAQEGEDSDGRL >gi568815586r:4270346_4479582|GENSCAN_predicted_CDS_7|1137_bp atggacatcttggggggggggatcgagggctggcccttgtttgagaataacgcgtctagt gaaatccatggatttaaagcctttaattcaacaacacttattcaaacatttcataaactg tattgcatggcgggcacggtgctccgttctgaagatacagagacagaacatttgaggaat ctgggctgcctgcttctgcatatgttagcagccatcacaacatatgctgtggcggatggc aatgaagaactggaagcccacgtgtctctcccacaccgcatcctagtgggcatggtggtg ccctcgcctgcaggcacccgtgccaacaacacgctgctggactcgaggggctggggcacc ctgctgtccaggtctcgcgccgggctagctggagagattgccggggtgaactgggaaagt ggctatttggtggggatcaagcggcagcggaggctctactgcaacgtgggcatcggcttt cacctccaggtgctccccgacggccggatcagcgggacccacgaggagaacccctacagt tcttatcctgcaagctgcctgcaagctgccagcttgctgtatggcctgctggaaatttcc actgtggagcgaggcgtggtgagtctctttggagtgagaagtgccctcttcgttgccatg aacagtaaaggaagattgtacgcaacgcccagcttccaagaagaatgcaagttcagagaa accctcctgcccaacaattacaatgcctacgagtcagacttgtaccaagggacctacatt gccctgagcaaatacggacggaacaggtggaggatgcgattatgccccaagcaggtcctg aatctggtccatcatgcagatagccaacgcagaaccatgtttgtatacctcattgaacga ctgaaattccggcaccagagagctgtgtcctgcgtggtttcctctttctgtaaatcgctg agtacctcaggctggtcccagggaaggcaggcaaagagaacccgaaagcaccgattcatg gtgctcttgattattaaggcaggaagaagtcctgtggcgtactcagtccaactaccctgt agcttgccaatggggaaattggcccaggagggtgaagacagcgatggaagactatag