GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:26:51 Sequence gi568815596f:69688053_69925512 : 237460 bp : 41.30% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5491 5623 133 0 1 83 56 86 0.462 5.45 1.02 Term + 7505 7593 89 0 2 67 48 99 0.472 0.64 1.03 PlyA + 7704 7709 6 1.05 2.04 PlyA - 8029 8024 6 1.05 2.03 Term - 9288 9094 195 0 0 -4 33 195 0.325 1.23 2.02 Intr - 11680 11651 30 1 0 120 91 31 0.453 4.01 2.01 Init - 15136 15056 81 1 0 61 94 38 0.413 2.72 2.00 Prom - 17566 17527 40 -8.65 3.03 PlyA - 17961 17956 6 1.05 3.02 Term - 20729 20567 163 1 1 53 49 154 0.157 4.43 3.01 Init - 26833 26445 389 1 2 26 49 226 0.042 8.82 3.00 Prom - 51964 51925 40 -2.05 4.04 PlyA - 52126 52121 6 1.05 4.03 Term - 53853 53543 311 1 2 93 49 207 0.154 11.54 4.02 Intr - 77373 77290 84 1 0 55 99 69 0.551 3.57 4.01 Init - 79370 79268 103 2 1 83 66 63 0.543 4.05 4.00 Prom - 87581 87542 40 -4.25 5.10 PlyA - 87900 87895 6 1.05 5.09 Term - 101337 101262 76 2 1 76 36 139 0.668 3.83 5.08 Intr - 101874 101737 138 2 0 89 29 117 0.855 4.56 5.07 Intr - 102384 102253 132 2 0 66 -8 188 0.714 5.84 5.06 Intr - 102728 102628 101 2 2 65 97 32 0.725 -0.21 5.05 Intr - 106169 106021 149 0 2 49 111 119 0.948 9.33 5.04 Intr - 107072 106933 140 1 2 29 94 66 0.788 0.49 5.03 Intr - 108308 108199 110 2 2 60 46 71 0.363 -1.74 5.02 Intr - 109132 108587 546 1 0 100 50 172 0.568 6.75 5.01 Init - 110852 110736 117 2 0 34 101 91 0.833 5.25 5.00 Prom - 110982 110943 40 -5.25 6.00 Prom + 112188 112227 40 -10.45 6.01 Init + 113632 113704 73 0 1 72 65 -7 0.537 -3.16 6.02 Intr + 116481 116575 95 1 2 122 61 152 0.941 14.66 6.03 Intr + 118333 118446 114 0 0 57 66 186 0.805 13.02 6.04 Intr + 119854 119944 91 0 1 84 56 105 0.840 5.55 6.05 Intr + 122542 122621 80 2 2 43 68 103 0.677 2.15 6.06 Intr + 124601 124657 57 1 0 111 77 53 0.897 4.66 6.07 Intr + 128049 128142 94 2 1 80 91 48 0.993 2.92 6.08 Intr + 130547 130642 96 0 0 89 87 77 0.982 6.76 6.09 Intr + 131228 131286 59 0 2 91 94 34 0.991 1.98 6.10 Intr + 132647 132769 123 1 0 96 100 176 0.939 19.46 6.11 Intr + 140261 140442 182 1 2 19 56 160 0.161 3.64 6.12 Intr + 141987 142100 114 0 0 33 102 198 0.270 14.34 6.13 Intr + 142190 142451 262 2 1 17 89 231 0.738 12.67 6.14 Intr + 151405 151501 97 0 1 63 88 67 0.366 2.86 6.15 Intr + 152890 152987 98 2 2 64 111 83 0.605 7.01 6.16 Intr + 155097 155209 113 2 2 74 87 93 0.497 6.06 6.17 Intr + 157112 157295 184 2 1 9 49 164 0.110 3.47 6.18 Intr + 161600 161690 91 1 1 53 102 23 0.327 -1.15 6.19 Intr + 166771 166908 138 2 0 63 110 61 0.713 5.31 6.20 Intr + 173226 173295 70 1 1 69 98 61 0.890 2.52 6.21 Intr + 176848 176923 76 1 1 68 100 56 0.920 3.40 6.22 Term + 181667 181816 150 1 0 95 45 113 0.921 4.63 6.23 PlyA + 181866 181871 6 1.05 7.06 PlyA - 183669 183664 6 1.05 7.05 Term - 185771 185510 262 1 1 127 38 354 0.992 28.31 7.04 Intr - 197653 197532 122 1 2 44 79 73 0.264 0.37 7.03 Intr - 201209 201155 55 1 1 48 99 50 0.111 0.16 7.02 Intr - 205973 205810 164 2 2 45 64 192 0.920 10.35 7.01 Init - 208364 208308 57 2 0 57 83 21 0.411 -0.14 7.00 Prom - 209490 209451 40 -2.95 8.05 PlyA - 209849 209844 6 1.05 8.04 Term - 214697 214114 584 0 2 -163 28 1234 0.627 85.97 8.03 Intr - 226630 226264 367 1 1 44 105 176 0.002 8.69 8.02 Intr - 227332 227102 231 1 0 105 55 196 0.002 15.05 8.01 Init - 232560 232477 84 0 0 103 74 39 0.109 4.87 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 228069 228168 100 1 1 112 91 70 0.808 7.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:69688053_69925512|GENSCAN_predicted_peptide_1|73_aa MQHVFRPLGIPEDMAPCLQVPKDCHMSGPQCLWGQERKEMKQAQMIEWKQRSGGETTLGR AGIPLGEAFPTVD >gi568815596f:69688053_69925512|GENSCAN_predicted_CDS_1|222_bp atgcagcatgtcttcaggcctctggggattccagaggacatggcaccctgcctgcaggtc cccaaggactgccacatgtcagggccacagtgtctgtggggacaggagaggaaggaaatg aagcaggcacaaatgattgaatggaagcaacgaagtggaggagaaaccacgctggggaga gctggcatccccttgggagaagcctttccaacagtggactga >gi568815596f:69688053_69925512|GENSCAN_predicted_peptide_2|101_aa MRSGRLSTEGKGGQGLKRGFMAADVLQVLGVSSPPQVAGRRKEAAEQKSEASRGCFMRFK ARSHLYNIKVQGEEASADGEAAASYPEDLAKIIDEGGYTEQ >gi568815596f:69688053_69925512|GENSCAN_predicted_CDS_2|306_bp atgaggtcaggaaggttgtctacagagggaaaaggaggacagggtctgaaaaggggtttc atggcagcagatgttttacaggttttaggagtatcaagtccacctcaagtggctgggaga cgtaaggaagctgcagaacaaaagtctgaggctagcagaggttgcttcatgagatttaag gcaagaagccatctctataacataaaagtgcaaggtgaagaggcaagtgctgatggagaa gctgcagcaagttatccagaagatctagctaagatcatagatgaaggtggctacactgaa caatag >gi568815596f:69688053_69925512|GENSCAN_predicted_peptide_3|183_aa MRYPQDTRLTKNGQQGALWTSPGEAKGKTAEVLNLDLPNVLLTDSGTSGPGCLAIDWVHA EARLPPRASHHKLGPSSGIWPNAGPLPCIGAVGTTQEVNGSSARIPGNKALRFDTLMGTL WTAGWLDWGRWMGCRCSRGKEADLEYTGTLDCWRTIFREERGKAFFKGMWSSETGTCWSC RMS >gi568815596f:69688053_69925512|GENSCAN_predicted_CDS_3|552_bp atgaggtatccacaagacacaaggttaaccaagaatgggcagcagggggcgttgtggaca agccccggggaagccaagggaaaaactgcagaggttctgaacctagacctgcccaacgtc ctgctgaccgactcaggcacctccggacctgggtgtctggctattgattgggtacatgct gaagcacgcttgcctcccagggcatcccatcacaaattaggccccagctctgggatctgg cccaacgcaggccctctcccatgtataggagctgtggggacaacacaggaggttaatggg agctcggccaggatcccaggaaataaagcactgaggtttgatacactgatgggcacactg tggacagcaggctggctggactggggcaggtggatgggatgcagatgcagtcgaggcaaa gaagctgacctcgagtacacaggcaccctcgattgttggaggaccatcttcagagaggaa aggggcaaagctttcttcaagggcatgtggtcctcagaaacagggacctgctggtcctgc aggatgagctga >gi568815596f:69688053_69925512|GENSCAN_predicted_peptide_4|165_aa MADPTTIAKFPKYSLLQRLLMHHDFGVSESSVETEWLLFKTEQQKTTNVGEDMEKLEPFY TAGPPGPPASPPARIGQCPQYRQPRSWPRRVGAVGFLGSRFPGAPHPRRAEPGPPRQFLP APAAPSSAPSRSPAANPRFPPPRETVRLPTAGRAEPVAGKVESEI >gi568815596f:69688053_69925512|GENSCAN_predicted_CDS_4|498_bp atggctgatccaaccacaatagcaaagttcccaaagtacagcctgcttcagagactactg atgcatcatgacttcggggtatctgaatccagtgtcgaaacagaatggctactattcaaa acagaacaacagaaaacaacaaatgttggtgaagatatggagaaacttgaacccttctac actgctggcccgcccggtcccccggcctcgccccctgcccggattggacagtgtccccag tatcgccagccccgctcctggccgcgcagggtgggagcagttggattcttggggtcccgt ttcccgggcgctccacaccctcggagggccgagcccgggccacctcgccagttcctccct gcgcctgccgcgcccagctccgcgccaagccggagtcccgcggcaaatccgcgcttcccc ccacctcgggaaaccgttcgtctcccaacggcaggtcgtgccgagcccgtggccggaaaa gtagaaagtgaaatctga >gi568815596f:69688053_69925512|GENSCAN_predicted_peptide_5|502_aa MSLTSISGRVTKNLSIHPRQQGCFILRARLGFEGDFIRMMGNKPSSAGTPLECILKHWES FDPKTLKKKQLIFFCRRAWPSYCLRDEQTWPAEGNLDFNLIHQLDLFCRQESKWSKVPYA QAFFAPRDNPDVSKHCTIDSALLAIISGGSVESNSPKLEEQVLQEPLGAASKCPSPSSFP NLGPPPTAPSASPASPSPKLPTAPASLLPLQEMPDGRGTPRGLGGAPGEKEKVTKAQGDA LQACRVQDPQEASTNCYWKAQEDAKTLLNFLAEREYRVSKSKAQLCQTSVKYLVLVLSEG TRAPGKSLADKQLPPQIPSFAAKGICSPAENLPLPEPSHFLPEKTGEPKHDCEQESLAAA QRKQKGIGNRKIWKHVLVLGNYPTNPARNHQASDGAENGAENETALLRGTLKSTPGGALA AVPHTTPLSSRKRSKIEELKTDLGGCLQLQEDVWEQTQKLSLPDKQDKETQNKSPSMCNK QTDSRQECGLSADDPELEMGSA >gi568815596f:69688053_69925512|GENSCAN_predicted_CDS_5|1509_bp atgtccttgacctcaatatctggtcgtgtgaccaagaacctcagtattcaccccagacaa caaggctgcttcattttgagggctcgtctgggatttgaaggtgacttcatcagaatgatg ggtaacaagccatcttcagccggcactcctctggagtgcattctgaagcactgggagtcc tttgaccctaaaactttgaagaaaaagcagctcattttcttttgcagaagggcatggcct tcttactgtctcagggatgaacagacatggccagctgaagggaaccttgattttaatctt atccatcaattagatcttttctgtagacaggagagcaaatggtccaaggtcccctatgca caggctttctttgccccgcgagacaacccagacgttagcaagcactgcacaattgactca gctctcttagcaatcatatcaggcgggtctgtggagagtaattcccctaagttagaagag caagttttgcaggagccattgggggcagcttctaagtgccccagtccttctagttttcct aatctggggccccctccaactgcaccatcagcttctccagcttcaccatctccaaagctc cccactgccccagcttcacttctacccctacaagaaatgccagatggaaggggcacccct aggggactgggaggagccccaggagaaaaagagaaagtgactaaggctcaaggagatgct ttgcaggcttgcagagtccaggatcctcaggaagcatccactaattgctactggaaggct caggaagatgctaagactctccttaatttcttagctgaaagggagtatagagtctcaaaa tccaaagctcagctctgtcagacttcagtaaagtacctagttctggtcttatcagaaggg accagagcaccaggaaagtctctggctgacaaacaattgcctcctcaaataccaagcttt gctgctaaagggatctgcagtccagctgaaaacctgcccttgcctgagcccagccacttt ctcccagagaaaactggagaacctaaacatgattgtgaacaggaatctctggctgcagct cagagaaaacagaaaggcatcggtaatagaaaaatctggaaacatgttctagttctgggc aattatcctacaaatcctgccagaaaccatcaagcttcagatggtgctgaaaatggagcc gaaaatgaaactgccctcctacgagggacccttaaatcaaccccaggaggagccctagct gctgttccccacacaacgccactctccagcaggaagcgcagtaagattgaggagctaaaa acagacttgggtggatgtctgcagctgcaagaagatgtgtgggaacagacacagaaactc tccctcccagataagcaagacaaagaaacacagaataagagtccatctatgtgcaacaag cagacggactcaaggcaagaatgtggacttagtgctgatgaccccgagcttgagatggga tctgcctag >gi568815596f:69688053_69925512|GENSCAN_predicted_peptide_6|818_aa MSYHTRPALTICLNNFFLSPVSAGGTDEDAIISVLAYRNTAQRQEIRTAYKSTIGRDLID DLKSELSGNFEQVIVGMMTPTVLYDVQELRRAMKGAGTDEGCLIEILASRTPEEIRRISQ TYQQQYGRSLEDDIRSDTSFMFQRVLVSLSAGGRDEGNYLDDALVRQDAQDLYEAGEKKW GTDEVKFLTVLCSRNRNHLLHVFDEYKRISQKDIEQSIKSETSGSFEDALLAIVKCMRNK SAYFAEKLYKSMKGLGTDDNTLIRVMVSRAEIDMLDIRAHFKRLYGKSLYSFIKTLVHGD PQTTYTANGSLNWYLDAINPESNLVIANKVADFAANSLLGICLVNPLHMMFIAALHKRKR SSGSFCYCHPDSETDEDEEEGDEQQRLLNTPRSAPQPMERGRALDRVNVEAGPMRGGLGS ALGDARCSAFSTTDASRVLLQPQDEDEKRQARARVSLNLRDARLAGLPSPPLPSQVGSRR SGYFSSMFSGSWKESSMNIIELEIPDQNIDVEALQVAFGSLYRDDVLIKPSRVVAILAAA CLLQLDGLIQQCGETMKETVNVKTVCGYYTSAGTYGLDSVKKKLQAGAAEAPAVNCAAFV LKRDLVRRSTAQSQRSWRWSQRSKISPSPVDYVVQAGIIGPQSPWMFLQLVPSWNGSLKQ LLTETDVWFSKQRKDFEGMAFLETEQGKPFVSVFRHLRLQYIISDLASARIIEQDAVVPS EWLSSVYKQQWFAMLRAEQDSEVGPQEINKEELEGNSMRCGRKLAKDGEYCWRWTGFNFG FDLLVTYTNRYIIFKRNTLNQPCSGSVSLQPRRSIAFR >gi568815596f:69688053_69925512|GENSCAN_predicted_CDS_6|2457_bp atgagctaccacacccgacctgccctcactatctgcttaaacaatttctttctatctcct gtatcagcaggaggcaccgatgaagacgccattattagcgtccttgcctaccgcaacacc gcccagcgccaggagatcaggacagcctacaagagcaccatcggcagggacttgatagac gacctgaagtcagaactgagtggcaacttcgagcaggtgattgtggggatgatgacgccc acggtgctgtatgacgtgcaagagctgcgaagggccatgaagggagccggcactgatgag ggctgcctaattgagatcctggcctcccggacccctgaggagatccggcgcataagccaa acctaccagcagcaatatggacggagccttgaagatgacattcgctctgacacatcgttc atgttccagcgagtgctggtgtctctgtcagctggtgggagggatgaaggaaattatctg gacgatgctctcgtgagacaggatgcccaggacctgtatgaggctggagagaagaaatgg gggacagatgaggtgaaatttctaactgttctctgttcccggaaccgaaatcacctgttg catgtgtttgatgaatacaaaaggatatcacagaaggatattgaacagagtattaaatct gaaacatctggtagctttgaagatgctctgctggctatagtaaagtgcatgaggaacaaa tctgcatattttgctgaaaagctctataaatcgatgaagggcttgggcaccgatgataac accctcatcagagtgatggtttctcgagcagaaattgacatgttggatatccgggcacac ttcaagagactctatggaaagtctctgtactcgttcatcaagacattagtccacggggac cctcaaactacgtatactgccaatgggagtctaaattggtaccttgatgcaatcaaccca gagagcaatttggtaatagctaacaaagttgcagatttcgcagcaaattcacttttggga atctgcttggtgaatccattgcacatgatgttcattgctgctctccacaagcgcaagcgg agcagcgggtccttctgctactgtcaccctgactcggagacggacgaggatgaggaggag ggggacgagcagcagcggctcctcaacacccctcgaagcgctccccagcctatggagagg ggacgtgcccttgacagggtgaatgtggaagccggcccgatgcggggcggactggggagc gccttgggagacgccaggtgttccgcattttcaaccacagacgcttccagggtgttgtta cagccccaagatgaggatgagaaacgccaagcccgagctcgcgtctcgttaaaccttaga gacgcccggctggctggccttcccagtccacctttgccctctcaagtgggtagccgcaga tctggctacttttctagtatgttcagtggttcttggaaagaatccagcatgaatattatt gaactggagattcctgaccagaacattgatgtagaagcactgcaggttgcatttggttca ctgtatcgagatgatgtcttgataaagcccagtcgagttgttgccattttggcagcagct tgtttgctgcagttggacggtttaatacagcagtgtggtgagacaatgaaggaaacagtt aatgtgaaaactgtatgtggctattacacatcagcagggacctatggattagattctgta aagaaaaagcttcaggctggggctgcagaggctcctgcagtcaactgcgccgccttcgtc ctcaagcgggacctggtcagacgcagcacagcgcagagccagaggtcctggagatggagc cagaggtccaagatttctccatcacctgtggactatgtagtccaagctggaataatagga ccgcagtcaccctggatgttccttcaacttgtgccttcttggaatggatctttaaaacag cttttgacagaaacagatgtctggttttctaaacagaggaaagattttgaaggtatggcc tttcttgaaactgaacaaggaaaaccatttgtgtcagtattcagacatttaaggttacaa tatattatcagtgatctggcttctgcaagaattattgaacaagatgctgtagtaccttca gaatggctctcttctgtgtataaacagcagtggtttgctatgctgcgggcagaacaggac agtgaggtggggcctcaagaaatcaataaagaagaactagagggaaacagcatgaggtgt ggtagaaagcttgccaaagatggtgaatactgctggcgttggacaggttttaacttcggc ttcgacctacttgtaacttacaccaatcgatacatcattttcaaacgcaatacactgaat cagccatgtagcggatctgtcagtttacagcctcgaaggagcatagcatttaggtag >gi568815596f:69688053_69925512|GENSCAN_predicted_peptide_7|219_aa MLETDRHINFVSISTDIHEDSPLRGERLRLRPIWSPASRKQLCNNFREGREPSFYRRPEN VKTSCEGSGLLCSRSWTSMGLWPVRNRATWQENTTDWIYKEEFIWLVVLKTGKSKSMLPA SGEGHLMVEGIVWPDHYATNKFSQATDLAMKKTEDNNALVFIVDVEVKKHKIKQAVEELY DTNVAKINTVIRADGEKKAHVQLAPEHAAFRGCQKIGII >gi568815596f:69688053_69925512|GENSCAN_predicted_CDS_7|660_bp atgttagaaacagacagacatatcaactttgtatcaatcagcacagatatacatgaggac tcacccctccgtggagagcggctgcgactgcgacccatttggagtcccgcttcccggaaa caactgtgcaacaacttccgggagggccgggaaccgagcttttaccgccgaccagaaaac gtcaaaacgagttgcgagggttccggtcttctttgttcacggtcatggaccagtatgggt ctgtggcctgttaggaaccgggccacatggcaggagaataccacagactggatatacaaa gaagagtttatttggctcgtggttctgaagactgggaagtctaagagcatgcttccagca tctggtgagggtcatctcatggtggaaggcatcgtgtggcctgaccactatgcgaccaac aagttctcccaggccactgacttggccatgaagaagacagaggacaacaatgcacttgtg ttcattgtggatgtcgaggtcaaaaagcataagatcaaacaggctgtggaggagctctat gacaccaatgtggccaagatcaacaccgtgatcagggctgatggagagaagaaagcccat gttcagctggctcctgagcatgctgcttttagaggctgccaaaaaattgggatcatctaa >gi568815596f:69688053_69925512|GENSCAN_predicted_peptide_8|421_aa MEEYHLKSAQPLIYIRGTNIKFTHQRMGIVGRLQQHLDVHPNRRRHSAPGAGATGPTPTA HGLAAAAGPLWSPLWSPLWSPLWTPRSRAEGASLATTLDKRGRGPASPKWLMVSSLAAPH RVWHWLGGPVRRWAALIHQPLRSSAPPGLASDLLPRPVPGENFRGPARRPSNGRLAPRVA WALCPAWVRGRSPRSGGRGPGGSGLRGTRAGEGPASNPPEERCRLVAEAAAAEETETAAA EAEEEARAAEGAAEGAAEEAEAKEARAAAAQAAEEAETQAAEAEEAKAAEGAAEAAEAEA EARAAAAQAAAAAEETETAAAEAEEEARAAEGAAEEADAEAEPEEAGAAAGEAAQTEAAA RAAAAAAEVEEGGGGGRRRRKEEGGGGRRRKKEGGRRRKKGGGRREEEGGRRKEESPMRQ R >gi568815596f:69688053_69925512|GENSCAN_predicted_CDS_8|1266_bp atggaggaataccacctcaagagtgctcagccactcatttacatcaggggcaccaacatc aagttcactcatcagagaatggggatagtcggccgcctccagcagcatctggatgttcat ccgaaccgccgccgccattctgcaccgggggccggagccacgggaccaacccccactgcc cacgggctcgctgccgccgccggaccgctgtggagcccgctatggagcccgctgtggagc ccgctgtggaccccgcggagcagggctgagggagcctctctggcaaccacgctcgacaag agagggagggggccagcgagccctaagtggctgatggtcagcagcctggctgctcctcac cgggtctggcattggctgggagggccggtccggcgctgggcggcgctgattcaccagccg ctgcggagctctgctccacccggccttgcctcggacctgttgccaagacctgtccccggg gaaaacttccggggacctgccaggcgtccatcgaatggaaggctggcacctcgtgtcgcc tgggcgctctgccccgcttgggttcggggccggagccctcggtctggtggccgcggccct ggaggttctgggcttcgcggaactcgggctggcgagggcccagcctcgaatcctcccgaa gagcggtgcaggctagtggccgaagcagcagcagcagaagaaactgaaacagcagcagca gaagcagaagaagaagcaagagcagcagaaggagcagcagaaggagcagcagaagaagca gaagcaaaagaagcaagagcagcagcagcacaagcagcagaagaagcagaaacacaagca gcagaagcagaagaagcaaaagcagcagaaggagcagcagaagcagcagaagcagaagca gaagctagagcagcagcagcacaagcagcagcagcagcagaggaaacagaaacagcagca gcagaagccgaagaagaagcaagagcagcagaaggagcagcagaagaagcagacgcagaa gcagaaccagaagaagcaggagcagcagcaggagaagcagcacaaacagaagcagcagca agagcagcagcagcagcagcggaagtagaggaaggaggaggaggaggaagaaggaggagg aaggaggaaggaggaggaggaaggagaagaaagaaggaaggaggaaggagaagaaagaag ggaggaggaaggagggaggaggaaggaggaaggaggaaggaggagagtccaatgaggcag agataa