GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:25:37 Sequence gi568815586f:4174041_4400006 : 225966 bp : 43.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 13810 13849 40 -1.76 1.01 Init + 27753 27861 109 2 1 94 115 64 0.916 10.16 1.02 Intr + 41840 41952 113 0 2 63 30 99 0.040 1.70 1.03 Intr + 42413 42497 85 1 1 87 87 14 0.043 0.59 1.04 Intr + 46052 46180 129 0 0 63 52 56 0.016 0.17 1.05 Term + 49394 49464 71 0 2 90 47 78 0.037 2.00 1.06 PlyA + 50190 50195 6 1.05 2.00 Prom + 54181 54220 40 -3.96 2.01 Init + 54283 54355 73 0 1 85 101 82 0.670 10.43 2.02 Term + 79724 79788 65 0 2 102 44 79 0.586 2.95 2.03 PlyA + 83105 83110 6 1.05 3.00 Prom + 89051 89090 40 -0.86 3.01 Init + 95157 95209 53 2 2 85 5 118 0.565 1.83 3.02 Intr + 98252 98334 83 2 2 119 85 4 0.783 2.48 3.03 Intr + 98806 98965 160 2 1 73 -63 176 0.701 0.05 3.04 Intr + 99109 99213 105 1 0 80 59 83 0.770 3.93 3.05 Intr + 99961 100195 235 1 1 37 92 457 0.839 38.69 3.06 Intr + 101965 102180 216 0 0 98 55 222 0.999 18.60 3.07 Intr + 104720 104879 160 1 1 111 111 289 0.999 33.06 3.08 Intr + 105703 105784 82 2 1 82 66 25 0.534 -1.60 3.09 Intr + 110171 110283 113 2 2 50 93 65 0.317 3.22 3.10 Intr + 114802 114950 149 2 2 137 66 187 0.864 21.35 3.11 Intr + 116191 116409 219 0 0 50 76 81 0.639 1.60 3.12 Term + 125820 125969 150 2 0 81 55 278 0.909 21.71 3.13 PlyA + 126146 126151 6 1.05 4.03 PlyA - 126616 126611 6 1.05 4.02 Term - 150713 150332 382 1 1 28 40 838 0.981 67.31 4.01 Init - 153863 153859 5 2 2 76 55 0 0.203 -5.03 4.00 Prom - 157953 157914 40 -1.06 5.00 Prom + 162720 162759 40 -4.66 5.01 Init + 169606 170271 666 0 0 66 41 305 0.111 18.83 5.02 Term + 170980 171912 933 0 0 12 47 280 0.120 8.43 5.03 PlyA + 171960 171965 6 -0.45 6.00 Prom + 172243 172282 40 -3.26 6.01 Init + 175782 175856 75 2 0 71 96 52 0.619 5.39 6.02 Intr + 177227 177337 111 1 0 40 91 105 0.814 6.58 6.03 Intr + 178220 178423 204 1 0 133 62 22 0.809 3.50 6.04 Term + 180398 180415 18 1 0 99 47 -9 0.101 -5.58 6.05 PlyA + 180519 180524 6 1.05 7.03 PlyA - 180549 180544 6 1.05 7.02 Term - 181833 181707 127 2 1 112 47 114 0.981 7.46 7.01 Init - 193627 193566 62 1 2 61 113 30 0.287 3.52 7.00 Prom - 193851 193812 40 -7.06 8.04 PlyA - 194359 194354 6 1.05 8.03 Term - 196743 196303 441 0 0 109 46 584 0.998 51.56 8.02 Intr - 198657 198554 104 1 2 121 70 36 0.945 4.99 8.01 Init - 205542 205332 211 0 1 79 79 180 0.736 13.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 52381 52256 126 2 0 74 72 65 0.816 4.15 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:4174041_4400006|GENSCAN_predicted_peptide_1|168_aa MSLRRSLSGLSPWGHILLGFRFPGHQVSFPPTDNLQAHICAELVDPQSDQPVNLQSCSPS SATESLNADIDDDLPIADGISILTQDNLRKGWISQDKPSYAAARDSQSLEKSLQYNSLVL VMLCREKKRLCLLDKREMRKNSYPHASALEGKMEEQAANNEKAFREHG >gi568815586f:4174041_4400006|GENSCAN_predicted_CDS_1|507_bp atgagccttcgaaggagcctcagtggtctctcaccctggggccacatcctgctaggcttc aggttccctggtcaccaagtctccttcccccccaccgacaacctgcaagcccacatctgt gcagagcttgtggatcctcagtcagatcaaccagtcaatctccaaagctgctcaccatct agtgccactgagagccttaatgcagatatagatgatgacctgcctatagctgatggaatc agcattctaacccaggataatctgagaaaaggctggataagtcaggataagccaagctat gctgcagccagggattctcaatccttggaaaagtctcttcagtacaatagccttgtgtta gtgatgctttgcagggaaaagaagcggctctgccttctggacaagagagaaatgaggaag aacagctacccccacgcctcagccctagagggcaaaatggaagagcaggcagcaaataat gagaaagccttcagagaacatggctga >gi568815586f:4174041_4400006|GENSCAN_predicted_peptide_2|45_aa MSGSSLKLSPEAVLVPNFLHGLQNDCKAVEDGDLKMLILFLGPLN >gi568815586f:4174041_4400006|GENSCAN_predicted_CDS_2|138_bp atgagtggaagcagcctgaagctctcaccagaagcggtgttggtgccaaacttcttgcac ggcctgcagaacgactgtaaggctgttgaagatggagacctcaagatgctcattctcttt cttggaccacttaattga >gi568815586f:4174041_4400006|GENSCAN_predicted_peptide_3|574_aa MARGGGRGGRRVAALAPGEPSVRLTQPAPHPALLALRAPARAPSLESCIGVATLSADTSG GLSADAGARKRVFPAWPLAAGEPLGALPPACGGPRRCTAGGQKGRCSGPFNRGFRNSFEV IRNTDFRDMTFISGPGKAGGRGAAGLAMELLCHEVDPVRRAVRDRNLLRDDRVLQNLLTI EERYLPQCSYFKCVQKDIQPYMRRMVATWMLEVCEEQKCEEEVFPLAMNYLDRFLAGVPT PKSHLQLLGAVCMFLASKLKETSPLTAEKLCIYTDNSIKPQELLEWELVVLGKLKWNLAA VTPHDFIEHILRKLPQQREKLSLIRKHAQTFIALCATVAMAETLEPYSSKGEMSGVHVLT IRNLRHCAFTRIILSESSEQHYNVGIAVVIICRSENENSERSDFKFAMYPPSMIATGSVG AAICGLQQDEEVSSLTCDALTELLAKITNTDVLREMAQGDADGANPGSSSEIVRRYGFKM RAPNIRFLNEDSRTQTENLPPVQHEECFSNPPEENRGRDLTFIEMDCLKACQEQIEAVLL NSLQQYRQDQRDGSKSEDELDQASTPTDVRDIDL >gi568815586f:4174041_4400006|GENSCAN_predicted_CDS_3|1725_bp atggcgaggggcggggggcggggagggaggcgggtcgcggcgctggctccgggggaacct agtgtacggctcacccagcccgcgccccaccccgccttgctggctctccgcgcccctgcc cgggccccctctctcgaaagctgcatcggtgtggccacgctcagcgcagacacctcgggc ggcttgtcagcagatgcaggggcgaggaagcgggtttttcctgcgtggccgctggccgcg ggggaaccgctgggagccctgcccccggcctgcggcggccctagacgctgcaccgcgggg gggcagaagggacgttgttctggtccctttaatcggggctttcgaaacagcttcgaagtt atcaggaacacagacttcagggacatgacctttatctctgggccggggaaagcaggaggg agaggggccgccgggctggccatggagctgctgtgccacgaggtggacccggtccgcagg gccgtgcgggaccgcaacctgctccgagacgaccgcgtcctgcagaacctgctcaccatc gaggagcgctaccttccgcagtgctcctacttcaagtgcgtgcagaaggacatccaaccc tacatgcgcagaatggtggccacctggatgctggaggtctgtgaggaacagaagtgcgaa gaagaggtcttccctctggccatgaattacctggaccgtttcttggctggggtcccgact ccgaagtcccatctgcaactcctgggtgctgtctgcatgttcctggcctccaaactcaaa gagaccagcccgctgaccgcggagaagctgtgcatttacaccgacaactccatcaagcct caggagctgctggagtgggaactggtggtgctggggaagttgaagtggaacctggcagct gtcactcctcatgacttcattgagcacatcttgcgcaagctgccccagcagcgggagaag ctgtctctgatccgcaagcatgctcagaccttcattgctctgtgtgccaccgtggccatg gcagagaccctggaaccttactcatccaagggagagatgtcaggagttcatgttttgaca atcagaaaccttaggcactgtgcttttacaaggattattttaagtgaatcctcagaacag cactacaacgtgggtattgctgttgtcattatttgcagatcagaaaatgaaaactcagag aggtcagactttaagtttgccatgtacccaccgtcgatgatcgcaactggaagtgtggga gcagccatctgtgggctccagcaggatgaggaagtgagctcgctcacttgtgatgccctg actgagctgctggctaagatcaccaacacagacgtgctcagagaaatggcgcagggagat gctgacggagcaaatccggggtcctcctctgagatagttcgtagatatggttttaaaatg cgggctccaaacatacgctttttaaacgaggactccagaacacagactgaaaacctccct ccagtgcagcatgaagaatgcttttctaatcctccagaggaaaatcgtggacgggactta acgtttatagaaatggattgtctcaaagcttgccaggagcagattgaggcggtgctcctc aatagcctgcagcagtaccgtcaggaccaacgtgacggatccaagtcggaggatgaactg gaccaagccagcacccctacagacgtgcgggatatcgacctgtga >gi568815586f:4174041_4400006|GENSCAN_predicted_peptide_4|128_aa MRLGPAAGRDVSCEQLTQLYSACQRPQVNPGLRRKQNSLLKRLRKAKKEAPPMEKPEVVK THLRDMIILPKMVGSMVGVYNGKTFNQVEIKPEMISHYLGEFSITYKLVKHCRPGIGATH SSRFIPLK >gi568815586f:4174041_4400006|GENSCAN_predicted_CDS_4|387_bp atgagacttggaccagctgcgggacgggacgtgtcctgcgagcagctgacgcagctgtac agtgcgtgccagcggccgcaagtgaacccgggcctgcggcggaaacagaactcgctgctg aagcgcctgcgcaaggccaagaaggaggcaccgcccatggagaagccggaagtggtgaag acgcacctgcgggacatgatcatcctgcccaagatggtgggcagcatggtgggcgtctac aacggcaagaccttcaaccaggtggagatcaagccggagatgatcagtcactacctgggc gagttctccatcacctataagctggtgaagcactgccggcccggcatcggggccacccac tcctcccgcttcatccccctcaagtag >gi568815586f:4174041_4400006|GENSCAN_predicted_peptide_5|532_aa MKAEIKMFFETNENKETTYQNLWDTFKAVCRGKFIALNAHRRKQERSKVGTLTSQLKELE KQEQTNSKASRRQEITKIRAELEETETQKTLPKINESRSWFFEKINKIDRPLARPIKKRE KNQMEAIKNDKGDITTDPTEIRTTIREYYKHLYTNKLESLEEMDKFLDTYTLPRLKQEEV ESLIRPVTGSEIEAIINSLPTKKSPGPDGFTAEFFQRYKEELRIKYLGIQLTRDMKDLFK EHYKPLLNEIKEDTNKWKNIPCSWIGRINIMKMAILPKVIYRFNAIPFELPMTFFTDLEK TTLKFIWNQKRAHIAKSILSQKNKAGGITLPDFKLCYKATVTKTAWYWYQNRDIDQWNRT EPSEIIPHIYNYLNFDKPDKNKKWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSR WSKDLHVRPKTIKTLEENLGNTIQDIGMGKDFMTKTPKAMATKAKVDEWDLIKLKSFCTA KETTIRVNRQPTEWEKIFPIYSSDKGLISRIYKELKQICKKKTTPSTSGQRI >gi568815586f:4174041_4400006|GENSCAN_predicted_CDS_5|1599_bp atgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagagacaacataccag aatctctgggacacatttaaagcagtgtgtagagggaaatttatagcactaaatgctcac aggagaaagcaggaaagatctaaagttggcaccctaacatcacaattgaaagaactagag aagcaagagcaaacaaattcaaaagctagcagaaggcaagaaataactaagatcagagca gaactggaggagacagagacacaaaaaacccttccaaaaatcaatgaatccaggagctgg ttttttgaaaagatcaacaaaattgatagaccgctagcaagaccaataaagaaaagagag aagaatcaaatggaagcaataaaaaatgataaaggggatatcaccacagatcccacagaa atacgaactaccatcagagaatactataaacacctctacacaaataaactagaaagtcta gaagaaatggataaattcctcgacacctacaccctcccaagactaaaacaggaagaagtt gaatctctgattagaccagtaacaggctctgaaattgaggcaataattaatagcttacca accaaaaaaagtccaggaccagacggattcacagccgaattcttccagaggtacaaggag gagctgaggataaaatacctaggaatccaacttacaagggatatgaaggacctcttcaag gagcactataaaccactgctcaatgaaataaaagaggacacaaacaaatggaagaacatt ccatgctcatggataggaagaatcaatatcatgaaaatggccatactgcccaaggtaatt tatagattcaatgccatccccttcgagctaccaatgactttcttcacagacttggaaaaa actactttaaagttcatatggaaccaaaaaagagcccacattgccaagtcaatcctaagc caaaagaacaaagctggaggcatcacgctacctgacttcaaactatgctacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagaccaatggaacagaaca gagccctcagaaataataccacacatctacaactatctgaactttgacaaacctgacaaa aacaagaaatggggaaaggattccctattcaacaaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactggatcccttccttacaccttatacaaaaattaattcaaga tggagtaaagacttacatgttagacctaaaaccataaaaaccctagaagaaaacctaggc aataccattcaggacataggcatgggcaaggacttcatgactaaaacaccaaaagcaatg gcaacaaaagccaaagttgacgaatgggatctaattaaactaaagagcttctgcacagca aaagaaactaccatcagagtgaacaggcaacctacagaatgggagaaaatttttccaatc tactcatctgacaaagggcttatatccagaatctacaaagaactcaaacaaatttgcaag aaaaaaacaaccccatcaacaagtgggcaaaggatatga >gi568815586f:4174041_4400006|GENSCAN_predicted_peptide_6|135_aa MHGILERSKFCKDMTVKYDSRLRERKYGVVEGKALSELRAMAKAAREECPVFTPPGGETL DQVKMRGIDFFEFLCQLILKEADQKEQFSQGSPSNCLETSLAEIFPLGKNHSSKVNSDSG IPGLAASVLVRQPLQ >gi568815586f:4174041_4400006|GENSCAN_predicted_CDS_6|408_bp atgcatggaattttggagagaagcaaattttgcaaagatatgacggtaaagtatgactca agacttcgggaaaggaaatacggggttgtagaaggcaaagcgctaagtgagctgagggcc atggccaaagcagccagggaagagtgccctgtgtttacaccgcccggaggagagacgctg gaccaggtgaaaatgcgtggaatagacttttttgaatttctttgtcaactaatcctgaaa gaagcggatcaaaaagaacagttttcccaaggatctccaagcaactgtctggaaacttct ttggcagagatatttcctttaggaaaaaatcacagctctaaagttaattcagacagcggt attccaggattagcagccagtgtcttagttaggcagccactgcagtga >gi568815586f:4174041_4400006|GENSCAN_predicted_peptide_7|62_aa MSIHYRSFAKDWVAHARGKIRAADLTSFSADTIESSLPILDKAAFLPPVSIPLTLTYFAQ QH >gi568815586f:4174041_4400006|GENSCAN_predicted_CDS_7|189_bp atgagcattcactacaggtccttcgccaaagactgggtagcacatgcacgaggtaagata cgagcagcagatcttacatccttcagtgctgacaccatcgagtcctccctgcccatcctg gataaagcagcattcctcccaccagtctccatcccccttactctgacctattttgctcaa cagcactga >gi568815586f:4174041_4400006|GENSCAN_predicted_peptide_8|251_aa MLGARLRLWVCALCSVCSMSVLRAYPNASPLLGSSWGGLIHLYTATARNSYHLQIHKNGH VDGAPHQTIYSALMIRSEDAGFVVITGVMSRRYLCMDFRGNIFGSHYFDPENCRFQHQTL ENGYDVYHSPQYHFLVSLGRAKRAFLPGMNPPPYSQFLSRRNEIPLIHFNTPIPRRHTRS AEDDSERDPLNVLKPRARMTPAPASCSQELPSAEDNSPMASDPLGVVRGGRVNTHAGGTG PEGCRPFAKFI >gi568815586f:4174041_4400006|GENSCAN_predicted_CDS_8|756_bp atgttgggggcccgcctcaggctctgggtctgtgccttgtgcagcgtctgcagcatgagc gtcctcagagcctatcccaatgcctccccactgctcggctccagctggggtggcctgatc cacctgtacacagccacagccaggaacagctaccacctgcagatccacaagaatggccat gtggatggcgcaccccatcagaccatctacagtgccctgatgatcagatcagaggatgct ggctttgtggtgattacaggtgtgatgagcagaagatacctctgcatggatttcagaggc aacatttttggatcacactatttcgacccggagaactgcaggttccaacaccagacgctg gaaaacgggtacgacgtctaccactctcctcagtatcacttcctggtcagtctgggccgg gcgaagagagccttcctgccaggcatgaacccacccccgtactcccagttcctgtcccgg aggaacgagatccccctaattcacttcaacacccccataccacggcggcacacccggagc gccgaggacgactcggagcgggaccccctgaacgtgctgaagccccgggcccggatgacc ccggccccggcctcctgttcacaggagctcccgagcgccgaggacaacagcccgatggcc agtgacccattaggggtggtcaggggcggtcgagtgaacacgcacgctgggggaacgggc ccggaaggctgccgccccttcgccaagttcatctag