GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:27:01 Sequence gi568815590f:66329132_66530226 : 201095 bp : 42.23% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 2407 2446 40 -2.75 1.01 Init + 7882 8079 198 0 0 73 47 136 0.510 6.95 1.02 Intr + 9781 9885 105 0 0 45 100 77 0.686 4.09 1.03 Term + 16231 16431 201 0 0 21 34 196 0.717 3.91 1.04 PlyA + 16486 16491 6 1.05 2.09 PlyA - 17430 17425 6 1.05 2.08 Term - 22677 22519 159 0 0 69 42 111 0.297 1.56 2.07 Intr - 39470 39361 110 0 2 79 62 122 0.210 7.78 2.06 Intr - 49637 49463 175 2 1 96 28 186 0.825 11.99 2.05 Intr - 52500 52320 181 2 1 -3 54 104 0.431 -3.15 2.04 Intr - 53798 53661 138 1 0 48 93 64 0.661 1.56 2.03 Intr - 56577 56446 132 2 0 13 101 155 0.855 8.14 2.02 Intr - 57732 57707 26 0 2 112 77 12 0.515 -1.59 2.01 Init - 62443 62378 66 1 0 85 119 27 0.761 6.62 2.00 Prom - 69553 69514 40 -6.45 3.00 Prom + 70100 70139 40 -8.85 3.01 Init + 71786 71907 122 1 2 36 56 152 0.175 6.51 3.02 Intr + 84470 84574 105 2 0 88 70 34 0.005 0.01 3.03 Term + 99772 101098 1327 1 1 57 38 1426 0.767 124.53 3.04 PlyA + 101571 101576 6 1.05 4.00 Prom + 102581 102620 40 -6.65 4.01 Init + 103386 103444 59 2 2 106 75 132 0.966 12.55 4.02 Intr + 116087 116283 197 2 2 50 103 216 0.993 17.44 4.03 Intr + 119734 119839 106 2 1 59 111 100 0.489 7.75 4.04 Intr + 122822 122974 153 2 0 68 87 130 0.714 9.17 4.05 Intr + 124928 125026 99 2 0 55 77 101 0.625 4.01 4.06 Intr + 127686 127764 79 0 1 87 43 33 0.377 -2.67 4.07 Intr + 127939 128035 97 0 1 83 51 113 0.909 5.76 4.08 Intr + 131177 131334 158 0 2 80 110 276 0.476 27.81 4.09 Term + 131465 131542 78 1 0 58 51 68 0.613 -3.12 4.10 PlyA + 132835 132840 6 -0.45 5.05 PlyA - 133146 133141 6 -0.45 5.04 Term - 134803 134640 164 2 2 101 42 97 0.525 3.52 5.03 Intr - 138109 137970 140 0 2 35 66 150 0.722 6.69 5.02 Intr - 138938 138843 96 1 0 111 68 66 0.816 5.11 5.01 Init - 143260 143139 122 1 2 70 51 143 0.466 8.52 5.00 Prom - 147254 147215 40 -3.95 6.04 PlyA - 147302 147297 6 1.05 6.03 Term - 151084 150925 160 0 1 59 49 127 0.183 2.33 6.02 Intr - 155468 155411 58 0 1 89 97 46 0.169 2.62 6.01 Init - 163219 163153 67 1 1 60 96 23 0.172 1.59 6.00 Prom - 163788 163749 40 -6.85 7.00 Prom + 169300 169339 40 -5.75 7.01 Init + 173147 173284 138 1 0 27 61 123 0.447 3.69 7.02 Intr + 176244 176397 154 2 1 93 104 182 0.943 19.02 7.03 Intr + 181999 182124 126 2 0 131 78 94 0.383 12.43 7.04 Intr + 182996 183291 296 0 2 -14 44 181 0.135 -0.70 7.05 Intr + 184339 184506 168 0 0 62 110 113 0.501 10.12 7.06 Term + 186839 187015 177 1 0 21 47 188 0.528 4.70 7.07 PlyA + 188376 188381 6 1.05 8.03 PlyA - 189201 189196 6 1.05 8.02 Term - 192590 192447 144 2 0 72 41 142 0.375 4.83 8.01 Init - 196115 196053 63 2 0 66 96 50 0.724 4.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 115475 115617 143 1 2 76 61 117 0.851 7.45 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:66329132_66530226|GENSCAN_predicted_peptide_1|167_aa MAKATGQNENEHPMLQQELFNDYRRQRQGDKSRNSADGGWNKISLFLLTPIIFTQETNWG KSGRHMGHNNSMIDTTSNPVLQMGKLEGTEQLNNLPKVTQPTLKSRDPFQLRREKDRMTE GGAERGNKRKALPATRRKGPWAKDTGKGIETDSPPGPSELKVVLPTA >gi568815590f:66329132_66530226|GENSCAN_predicted_CDS_1|504_bp atggctaaggccacaggtcagaatgaaaatgagcacccaatgttacagcaagagctcttt aatgactaccgaagacaaagacagggagacaagtctagaaattctgctgatggaggctgg aacaaaatatcgctgtttttactcacacccatcatcttcacccaggaaactaactgggga aagtcagggagacacatgggacataacaactctatgatagatacaacttctaatcccgtt ttacagatgggcaagctagaaggcactgagcagttaaataatttgcccaaggtcacacag ccaacccttaaaagtagagaccctttccaactgcggagagagaaagacaggatgacagaa ggaggtgcagagagaggcaacaagaggaaggctctgccagccacacggaggaaggggcca tgggccaaggacactggaaaagggatagaaacagattctcccccggggccttcagaactg aaagtagttctgccaacagcttga >gi568815590f:66329132_66530226|GENSCAN_predicted_peptide_2|328_aa MATDLLALCGPIHPEQGDVFYLRWSFAMLPRETSRGGGALEDLRVVMEVPTAMKLTTSCG SNLDFELLIEQFFLWSRHEALPCFSSKNLCGCELLMANRASCGLLDREETEIRSDLWKYS WRGKKQIWTERGAEDSAKLQVTPWGDLDCVKPILTVLQRAPKVRPLYLCCNQPSVWAVLG MCILSCGCLREALSKTLYGITSLKKQVPAGAGAQAIAGGAPATATSATRTACIHYTDVTV CAEEVQHSSVRELKIKDEEARGTVARSLDEVIPMPKDCFNSKIMKWAVFLLVTKTLHPVT GKGSRSRPQERVLGSRARKNSGRVLSAK >gi568815590f:66329132_66530226|GENSCAN_predicted_CDS_2|987_bp atggctactgatctcttagctctgtgtgggccaatacatcctgaacagggggatgtcttc tatctgagatggagttttgccatgttgcccagagagacttcacggggtggaggagcattg gaggatctcagggtagttatggaagtccccactgctatgaaactcaccacttcctgtggc agcaacttagattttgagctcctcattgaacaattcttcctatggtcaagacatgaagca ttaccctgtttctcatcaaagaatctctgtgggtgtgaactcctgatggcgaacagagcc agctgtgggctccttgacagagaggagacagaaataagaagtgacttgtggaaatactcc tggagaggaaagaagcagatttggacagagagaggagcagaagattcagcaaagcttcag gtcactccgtggggagacctggattgtgtaaaacccattttaactgttctccagagagcc ccaaaagtcagacctttatatctctgctgtaatcagccatctgtttgggctgtcctggga atgtgcatcttgtcttgtggctgtttgagagaagccttgtcaaagaccctttatggaatc acttccctgaagaaacaagttcctgcgggggcaggagctcaggccattgctggcggtgct cctgccactgcgacatctgctacccgcacagcttgtattcattatactgatgtgacagtc tgtgctgaagaagttcagcatagcagtgtaagagaattgaagattaaggatgaggaagca agggggactgttgccaggtccctggatgaagtcatccctatgccaaaggattgctttaat tcaaaaattatgaagtgggctgtgtttcttctggtgactaaaactcttcatcctgttacc ggaaaggggtcccgatccagaccccaagagagggttcttggatctcgtgcaagaaagaat tcagggcgagtcctcagtgcgaagtaa >gi568815590f:66329132_66530226|GENSCAN_predicted_peptide_3|517_aa MKKERIKDDSKFFGLAPWKNDVAIDKTGKTTDEADSEGGLWSKVVLHDRVLLDNNFGVFI KTKYGVPFDPTILLLSLKVPLLIPSSLHTALCRPFLWSPGLRAASGRSGSEPFFSGLGIP ASARGYAAGVWAATVGKVTSAAAPKRVSRAGAMEGQSVEELLAKAEQDEAEKLQRITVHK ELELQFDLGNLLASDRNPPTGLRCAGPTPEAELQALARDNTQLLINQLWQLPTERVEEAI VARLPEPTTRLPREKPLPRPRPLTRWQQFARLKGIRPKKKTNLVWDEVSGQWRRRWGYQR ARDDTKEWLIEVPGNADPLEDQFAKRIQAKKERVAKNELNRLRNLARAHKMQLPSAAGLH PTGHQSKEELGRAMQVAKVSTASVGRFQERLPKEKVPRGSGKKRKFQPLFGDFAAEKKNQ LELLRVMNSKKPQLDVTRATNKQMREEDQEEAAKRRKMSQKGKRKGGRQGPGGKRKGGPP SQGGKRKGGLGGKMNSGPPGLGGKRKGGQRPGGKRRK >gi568815590f:66329132_66530226|GENSCAN_predicted_CDS_3|1554_bp atgaaaaaagaaagaatcaaggatgactccaagttttttggcttggctccctggaaaaat gatgttgccattgacaaaacagggaaaaccacagatgaagcggactcggagggaggatta tggagtaaggtggtattgcatgatagggtccttttggataataattttggagtgttcatc aaaactaaatatggagtaccctttgatcccactattctattactcagcctaaaggtcccg cttctaatcccatcatctctccataccgccctctgccggcctttcctctggtcgcccgga ttgcgagccgcttccggacgctccggaagcgaacctttcttttccggattgggcatcccg gcatctgcacgtggttatgctgccggagtttgggccgccactgtaggaaaagtaacttca gctgcagccccaaagcgagtgagccgagccggagccatggagggccagagcgtggaggag ctgctcgcaaaggcagagcaggacgaggcagagaagttgcaacgcatcacggtgcacaag gagctggagctgcagtttgacctgggcaacctgctggcgtcggaccggaaccccccgacc gggctgcggtgcgccggacccacgccggaggccgagctacaggccctggcgcgggacaac acgcaactgctcatcaaccagctgtggcagctgcccacggagcgcgtggaagaggcgata gtggcgcggctgccggagcccaccacacgcctgccgcgagagaagcctctgccccgaccg cggccacttacacgctggcagcagttcgcgcgcctcaagggcatccgtcccaagaagaag accaacctggtgtgggacgaggtgagtggccagtggcggcggcgctggggctaccagcgc gcccgggacgacaccaaagaatggctgattgaggtgcccggcaatgccgaccccttggag gaccagttcgccaagcggattcaggccaagaaggaaagggtggccaagaacgagctgaac cggctgcgtaacctggcccgcgcgcacaagatgcagctgcccagcgcggccggcttgcac cctaccggacaccagagtaaggaggagctgggccgcgccatgcaagtggccaaggtctcc accgcctctgtggggcgctttcaggagcgcctccccaaggagaaggtgccccggggctcc ggcaagaaaaggaagtttcaaccccttttcggggactttgcagccgagaaaaagaaccag ttggagctgcttcgtgtcatgaacagcaagaagcctcagctggatgtgactagggccacc aataagcagatgagggaggaggaccaggaggaggccgccaagaggaggaaaatgagccag aagggcaagagaaagggaggccggcaggggcctgggggcaagaggaaagggggcccgccc agccagggagggaagaggaaagggggcttgggaggcaagatgaattctgggccgcctggc ttgggtggcaagagaaaaggaggacagcgcccaggaggaaagaggaggaagtaa >gi568815590f:66329132_66530226|GENSCAN_predicted_peptide_4|341_aa MAAAARARVAYLLRQLQRAAFMEAIEFAQKGAFDAYVAVGGGSTMDTCKAANLYASSPHS DFLDYVSAPIGKGKPVSVPLKPLIAGITSRAIKPTLGLIDPLHTLHMPARVVANSGFDVL CHALESYTTLPYHLRSPCPSNPITRPAYQGSNPISDIWAIHALRIVAKYLKRAVRNPDDL EARSHMHLASAFAGIGFGNAGVHLCHGMSYPISGLVKMYKAKDYNVDHPLVPHGLSVVLT SPAVFTFTAQMFPERHLEMAEILGADTRTARIQDAGLVLADTLRKFLFDLDVDDGLAAVG YSKADIPALVKGTLPQQTAVLSFPGATGPGPRLSVHPYSCS >gi568815590f:66329132_66530226|GENSCAN_predicted_CDS_4|1026_bp atggccgctgccgcccgagcccgggtcgcgtacttgctgaggcaactgcaacgcgcagcc ttcatggaagctattgagtttgcccaaaagggagcttttgatgcctatgttgctgtcggt ggtggctctaccatggacacctgtaaggctgctaatctgtatgcatccagccctcattct gatttcctagattatgtcagtgcccccattggcaagggaaagcctgtgtctgtgcctctt aagcctctgattgcaggcatcacttcgagagccatcaaacccacactgggactgattgat cctctgcacaccctccacatgcctgcccgagtggtcgccaacagtggctttgatgtgctt tgccatgccctggagtcatacaccaccctgccctaccacctgcggagcccctgcccttca aatcccatcacacggcctgcgtaccagggcagcaacccaatcagtgacatttgggctatc cacgcgctgcggatcgtggctaagtatctgaagagggctgtcagaaatcccgatgatctt gaagcaaggtctcatatgcacttggcaagtgcttttgctggcatcggctttggaaatgct ggtgttcatctgtgccatggaatgtcttacccaatttcaggtttagtgaagatgtataaa gcaaaggattacaatgtggatcacccactggtgccccatggcctttctgtggtgctcacg tccccagcggtgttcactttcacggcccagatgtttccagagcgacacctggagatggca gaaatactgggagccgacacccgcactgccaggatccaagatgcagggctggtgttggca gacacgctccggaaattcttattcgatctggatgttgatgatggcctagcagctgttggt tactccaaagctgatatccccgcactagtgaaaggaacgctgccccagcagacagcagtt ctgtcctttcctggtgcaacagggccaggccctcgcctttctgttcatccatacagctgt agctga >gi568815590f:66329132_66530226|GENSCAN_predicted_peptide_5|173_aa MAWSPQQLPEDQALTGRARAAVWSGVAGRQRLAACATMTWRAFESHCPLPYALATAFCRN QSWGKHPYHDKTSSTHQQECQSYFQNEAPVRALLTSCGTTTGIQDTAITRLSSLRSFLKA WLWQLKASVHRAAISCHQYGLCLVKNTGLVVSSRENQSFTAEQKRTFIKCQLH >gi568815590f:66329132_66530226|GENSCAN_predicted_CDS_5|522_bp atggcctggtcccctcaacagctcccggaggaccaggcacttactggcagagcccgggca gctgtctggagtggtgttgcgggtcgtcagagactcgcagcctgtgcaacgatgacctgg agagctttcgaaagccactgccccttgccttacgccctggcaacagccttctgccgaaac caatcttggggaaaacacccttatcatgacaaaactagttcaactcatcagcaggagtgt cagtcctacttccaaaatgaagccccagtcagagcgcttctcacctcctgtggtaccaca accggaatccaggacactgctatcactcgcctgtccagcttgagaagcttcctgaaagcc tggctatggcagttgaaagcctctgtacacagggctgctattagctgccaccagtatggg ctgtgtttggtcaagaatactggcctggttgtttcttccagagaaaaccaatccttcact gctgaacagaaaagaaccttcatcaaatgtcaactacattga >gi568815590f:66329132_66530226|GENSCAN_predicted_peptide_6|94_aa MHQIYSCSDENIEVFTTVIPSKGCVAASLAFILRETGDKQDNPERTQQMGNFMATNGIVY PLIVPVSAIKLNPLPTNKKISLLPVQLWKGPKLC >gi568815590f:66329132_66530226|GENSCAN_predicted_CDS_6|285_bp atgcatcagatttacagctgcagtgacgagaacatagaagttttcaccaccgtgattcct tccaagggatgtgtagcagcatccctggcctttatcttgagggagacaggtgataaacaa gataatcctgaacgcactcaacagatgggaaatttcatggccacaaatggaattgtttat cctctcattgttcccgtgtcagcaatcaaactaaatccattacccaccaataaaaagatc agcctgctgccagttcagctgtggaaggggccaaagctatgttga >gi568815590f:66329132_66530226|GENSCAN_predicted_peptide_7|352_aa MSQLSLLAWSHDNEDSWPLQNKQAVSHQKRWINEIAKVTFFSDFKLVVIESDLYTHQPLE LLPHRGDRRDPGDRRRFGRLQTARPPTAHPAKASARPAGQLSFLEGDGVAHFQAYQYISL ISGQECRVPSWLAESAVAGGLEIVGLGSHSLSEICDTSMGEQSRAHTAKSKSVQSVLGEI GLRKGEKSGNKSFHPEIRGCKCSPAHGIMGRKRLHGPQGGTAALPTVGQLGMGGGVKKTA SVRARVGRQMHPHTPSRFLMTDTASAPDLSENFSLCLIGGDSHGHREGSCSDERAVLPAA SPASSGKCGQGTRRSLNMWEPPTAPLRPTKGDPLQVPCVGQGSPDKRKNLQD >gi568815590f:66329132_66530226|GENSCAN_predicted_CDS_7|1059_bp atgagtcagttaagccttctggcatggagccacgacaatgaagattcgtggccccttcag aataagcaggcggtcagccatcagaaaagatggataaatgagattgcaaaggtcacattc ttctcagatttcaagcttgtggtgatcgagtcggacctgtacacgcaccagcccctggag ctgctgccccaccgcggagaccgcagggaccctggcgaccgccgcaggtttgggcggctc cagaccgcgcggccgcccacagcccacccggccaaagcctctgccagacccgctgggcag ctgtccttccttgaaggggatggggtggcacattttcaggcctatcagtacatctccctc atctctgggcaggaatgtcgggttccctcttggttagcagagtcagcagtggctggaggc ctggagatagtggggctagggtctcacagcttatcagaaatctgtgacaccagcatggga gagcaatccagggcccacacagctaaatccaagtccgtgcaaagtgtgctgggtgagatt ggcctaaggaaaggggaaaagagtgggaacaagtcctttcacccagagataagggggtgc aagtgttcaccggcccatggcataatgggaaggaagcgtttgcacggaccacaaggtggc acagcagctctgcccacagtcggccagcttgggatgggaggaggggtgaagaagacagcc tcagtcagggccagagtgggaaggcagatgcatcctcacactccctcccgcttcctcatg acagataccgccagtgccccggatctcagtgaaaacttcagcctctgcctcattggaggc gacagccatgggcacagagaagggagctgttctgatgagagggcagtgcttcctgcggcg tccccggcatcctccggaaaatgtggacaaggcacaagaagaagtctgaatatgtgggag ccaccaacagcgcctttgaggccgactaaaggtgaccctcttcaagtgccctgtgttggc caaggttccccggacaagaggaaaaaccttcaggattga >gi568815590f:66329132_66530226|GENSCAN_predicted_peptide_8|68_aa MGKIFVAFVLSQERPKTIQEQGPLSHPSTLSWVSVVSVAGEWDSAAPPVGPISLLLHGGH LLPGSIAP >gi568815590f:66329132_66530226|GENSCAN_predicted_CDS_8|207_bp atggggaaaatatttgtggcctttgtcctgagtcaagagagacctaaaacaattcaagag cagggtccattgtcccacccgtctacactgtcctgggtgtccgtggtgtccgtggctggc gagtgggactctgcagcaccacctgtggggcccatttcccttcttcttcatggaggacat ctgcttccagggagtattgcaccttga