GENSCAN 1.0 Date run: 8-Nov-116 Time: 16:44:59 Sequence gi568815575r:8691314_8900171 : 208858 bp : 40.21% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1257 1391 135 2 0 100 47 76 0.638 1.74 1.02 PlyA + 1572 1577 6 1.05 2.06 PlyA - 4079 4074 6 1.05 2.05 Term - 5526 5331 196 2 1 90 45 57 0.197 -2.50 2.04 Intr - 8432 8385 48 1 0 127 102 13 0.243 3.48 2.03 Intr - 10857 10768 90 2 0 80 82 49 0.108 1.69 2.02 Intr - 19720 19617 104 1 2 116 20 72 0.216 1.25 2.01 Init - 20983 20912 72 1 0 65 53 96 0.274 4.92 2.00 Prom - 32205 32166 40 -5.45 3.04 PlyA - 32765 32760 6 1.05 3.03 Term - 33341 33049 293 0 2 82 54 119 0.186 2.42 3.02 Intr - 39856 39754 103 1 1 -14 61 123 0.256 -1.77 3.01 Init - 40723 40517 207 1 0 106 96 247 0.996 24.17 3.00 Prom - 43500 43461 40 -6.95 4.04 PlyA - 44945 44940 6 1.05 4.03 Term - 45579 45492 88 2 1 77 54 150 0.952 6.65 4.02 Intr - 48158 47970 189 1 0 80 63 84 0.229 2.88 4.01 Init - 57850 57756 95 1 2 69 81 114 0.502 6.71 4.00 Prom - 77585 77546 40 -5.85 5.00 Prom + 77610 77649 40 -6.35 5.01 Sngl + 82099 82548 450 0 0 48 43 181 0.168 5.56 5.02 PlyA + 83055 83060 6 1.05 6.00 Prom + 84093 84132 40 -5.15 6.01 Sngl + 85365 86018 654 2 0 33 43 320 0.882 17.92 6.02 PlyA + 86046 86051 6 1.05 7.04 PlyA - 88586 88581 6 1.05 7.03 Term - 91553 91301 253 1 1 48 42 159 0.779 1.43 7.02 Intr - 92029 91829 201 0 0 17 31 205 0.448 5.18 7.01 Init - 92664 92387 278 0 2 47 59 400 0.886 29.50 7.00 Prom - 94048 94009 40 -3.55 8.11 PlyA - 95398 95393 6 1.05 8.10 Term - 104107 103711 397 0 1 85 39 896 0.387 77.96 8.09 Intr - 105069 104955 115 1 1 86 86 72 0.980 5.39 8.08 Intr - 106868 106837 32 1 2 91 91 29 0.976 0.36 8.07 Intr - 107166 107046 121 1 1 57 55 175 0.944 9.73 8.06 Intr - 107781 107653 129 1 0 55 97 153 0.707 12.65 8.05 Intr - 108864 108768 97 0 1 5 50 180 0.000 4.56 8.04 Intr - 109150 108998 153 1 0 36 35 148 0.000 3.65 8.03 Intr - 109869 109700 170 1 2 50 47 144 0.000 5.24 8.02 Intr - 110290 110188 103 1 1 67 -14 118 0.000 -1.67 8.01 Init - 112631 112566 66 2 0 87 93 30 0.000 4.52 8.00 Prom - 121126 121087 40 -5.65 9.00 Prom + 121468 121507 40 -7.45 9.01 Init + 124236 124425 190 2 1 40 100 107 0.764 6.32 9.02 Intr + 129731 129903 173 0 2 16 103 265 0.992 19.54 9.03 Term + 146149 146520 372 0 0 82 47 138 0.013 2.81 9.04 PlyA + 147542 147547 6 1.05 10.03 PlyA - 148188 148183 6 1.05 10.02 Term - 154164 153995 170 1 2 36 37 168 0.412 3.56 10.01 Init - 158954 158816 139 2 1 85 34 69 0.341 1.60 10.00 Prom - 159090 159051 40 -2.95 11.02 PlyA - 160275 160270 6 1.05 11.01 Sngl - 165713 165354 360 2 0 89 39 561 0.984 47.12 11.00 Prom - 168354 168315 40 -7.45 12.00 Prom + 168682 168721 40 -9.25 12.01 Init + 169748 170021 274 1 1 47 33 238 0.240 11.38 12.02 Intr + 172561 172725 165 2 0 26 73 108 0.409 2.11 12.03 Intr + 182528 182668 141 0 0 16 27 163 0.372 2.40 12.04 Intr + 182941 183102 162 2 0 69 100 152 0.546 13.53 12.05 Intr + 187048 187223 176 2 2 55 55 152 0.440 7.34 12.06 Intr + 193401 193512 112 2 1 12 105 69 0.163 0.03 12.07 Term + 199882 199967 86 2 2 108 46 97 0.412 4.34 12.08 PlyA + 201192 201197 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 108858 108768 91 0 1 105 50 162 0.993 14.80 S.002 Init + 109613 110237 625 1 1 67 47 308 0.881 19.15 S.003 Term + 112587 112765 179 1 2 69 39 166 0.961 6.67 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:8691314_8900171|GENSCAN_predicted_peptide_1|44_aa GIQLRKAFPTFPRSRSEDTHVRASRSLEERNMLISEDEGSQRGT >gi568815575r:8691314_8900171|GENSCAN_predicted_CDS_1|135_bp ggcatacagcttaggaaagcttttccaacattcccaagaagcaggtcagaagacactcat gtgagggcctcccgatccctggaagaaaggaacatgctcatctctgaagatgaagggtca cagagaggaacctga >gi568815575r:8691314_8900171|GENSCAN_predicted_peptide_2|169_aa MTGVLTERRNFNTDMQRCEDTDVKTLVEILSSPLYAHFHDVVCVIVLVLPLSIMEFSNIF RPYCLYGGHRCGGRDEGPDVDVRNSVGGRTMVPWFGARITSNVLRNNCQWLEIFLIVTVG SEELLASSGSRPGMLIKNLYCTEKPSHSKDYLAPNISNAEIDKLWSTHL >gi568815575r:8691314_8900171|GENSCAN_predicted_CDS_2|510_bp atgactggtgtccttactgaaaggagaaattttaacacagacatgcagagatgtgaggac acggatgtgaagacattagtggaaattctaagttccccactatatgctcatttccatgat gtggtttgtgtcatcgtcttggtcctgcccttgagtattatggaattcagtaatatcttc agaccttactgtctctatgggggacatagatgtggggggagagatgagggcccagatgtg gacgtacggaactcagtaggtggtagaacaatggttccctggtttggtgccagaatcaca agcaatgttctaaggaacaattgccaatggctggaaatatttttgattgtcacagttggg agtgaggagttactggcatctagtgggtcaaggccagggatgctaataaagaacctatac tgcacagaaaagccctcccacagcaaggattatctggctccaaatattagtaatgctgag attgacaaactctggtctacacatttgtga >gi568815575r:8691314_8900171|GENSCAN_predicted_peptide_3|200_aa MVPGVPGAVLTLCLWLAASSGCLAAGPGAAAARRLDESLSAGSVQRARCASRCLSLQITR ISAFFQHFQRRASGKKCRGQGIWRGRIRILLNESRGNRPPNPEPANSLQLLPFKGITFGR WAPSCLQGHASSSLDVRRPWRLALCSGDIGGQPPAGISSVMQFMPKSIAWDQAYAGLWLR AHPHTGLISAFSCFLSSLFY >gi568815575r:8691314_8900171|GENSCAN_predicted_CDS_3|603_bp atggtgcccggggtgcccggcgcggtcctgaccctctgcctctggctggcggcctccagc ggctgcctggcggccggccccggcgcggctgctgcgcggcggctggacgagtcgctgtct gccgggagcgtccagcgcgctcgctgcgcctccaggtgcctgagcctgcagatcactcgc atctccgccttcttccagcacttccagcgtagagcgtcagggaagaagtgccggggacag gggatttggcggggccgtattcggatactcctcaacgagtcccgggggaaccgtccccca aaccccgaacctgctaatagcttacagctgcttccttttaagggaataacctttggccga tgggcaccatcttgcttgcaaggtcatgcatcctcctcccttgatgtgcgcagaccatgg cgattggctctctgctcaggggatataggaggtcagccccctgctggaatcagctctgta atgcaattcatgcccaaaagcatcgcatgggatcaggcttatgcagggctctggctgaga gcccatccccacacaggtcttatctctgccttctcctgcttcctctcttccctcttctac tga >gi568815575r:8691314_8900171|GENSCAN_predicted_peptide_4|123_aa MGLVMSVLLNWGAGRFCPTGVPLAMSETQLVLGCFLLAPVDTEEETVSSCLSYIQRPCPL GLGAMQGFPQVSKHVYVHLLAVVSSCAERLASVERREIKSGGEGKRLKKSFSGTELAKPE NIA >gi568815575r:8691314_8900171|GENSCAN_predicted_CDS_4|372_bp atgggattggttatgtcagtgcttctcaattggggagcggggcgattttgccccaccggg gtccctttggcaatgtcggagacacagttggtgttgggatgctttttattagctccagtg gatacagaagaggaaacagttagttcatgccttagttacattcagaggccctgcccgctt ggtcttggagcaatgcagggcttcccccaggtttccaaacatgtttatgttcatctcttg gctgttgtctcatcttgtgcagagcggctggcgagtgttgaaagaagagaaattaagtct ggaggagaaggaaagcggttaaagaagagctttagcggaactgagctagcaaagcctgag aacattgcctga >gi568815575r:8691314_8900171|GENSCAN_predicted_peptide_5|149_aa MSEKKAPAPVRDLQIKPPSPWDRAPGGRDGYGRSFSRPKCPCLTILKRAADLPALHSSSD KGQAASSNGSLTPMYPDWETPPSRDQQTPHTGELWLASGGCPSGTKLPEEGRGSNLCYSA ASAGDTQAKRVWSEPPAKSSRSAAEGPDC >gi568815575r:8691314_8900171|GENSCAN_predicted_CDS_5|450_bp atgtctgaaaaaaaggcaccagcccccgtcagggacttacagataaaacccccatctccc tgggacagagcacctgggggaagggatggctatgggcgcagcttcagcagacctaagtgt ccctgcctgaccattcttaagagagcagcagatctcccagcactgcattcgagctctgat aagggacaagctgcctcctcaaatgggtccctgacccccatgtatcctgactgggagaca cctcccagtagggatcaacagacacctcatacaggagagctctggctggcatctggcggg tgcccctctgggacgaagcttccagaggaaggaagaggcagcaatctttgctattctgca gcctctgctggtgacacccaggcaaagagggtctggagtgaacctccagcaaagtccagc agatctgcagcagaggggcctgactgttag >gi568815575r:8691314_8900171|GENSCAN_predicted_peptide_6|217_aa MNIDVKILNKILANRIQQHIKKLIHHNQVGFIPGMQGWFNIRKSVNVIHHINRTNDKNHM NISIDAEKAFDKIQQPFMQKTLNKLGTDVIYLKIIRAIYDKPTANIILNGQKLEAFPLKT GTKQGCPLLPLLFNIVLEVLARAIRQEKEIKGIQLGKEKVKFSLFADDMIVYLENHMISA QNLLKLISNFSKVSGYKVNVQESQAFLYTNNREPNHE >gi568815575r:8691314_8900171|GENSCAN_predicted_CDS_6|654_bp atgaacattgatgtgaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccacaatcaagttggcttcatccctgggatgcaaggctggttcaac atacgcaaatcagtaaatgtaatccatcacataaacagaaccaatgataaaaaccacatg aatatctcaatagatgcagaaaaggccttcgacaaaattcaacagcccttcatgcaaaaa actctcaataagctaggcactgatgtaatatatctcaaaataataagagctatttatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaacc ggaacgaagcaaggatgccctctcttaccactcctattcaacatagtattggaagttctg gccagggcaatcaggcaagagaaagaaataaagggtattcaattaggaaaagagaaagtc aagttctctctgtttgcagatgacatgattgtatatttagaaaaccacatgatctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaagtcaatgtg caagaatcacaagcattcctatacaccaataacagagagccaaatcatgaatga >gi568815575r:8691314_8900171|GENSCAN_predicted_peptide_7|243_aa MKAELSEAKALDATMEDSSASLAPTMLFLTTFEAAPATEESLILLIAPLRPQAQRRPDGE VIPTLDVALFDWTDYEDLKPDGGPSAKKKEKPGCWPGTQVLAGSTGALSVRTHTAWCRAL RDGNTEIALYCPTAALQQLQVQNQEAILRVEKKFKLPERRACDAAGPGTPLRGPGKYMAE QGPEPGPRLSSPKIHVPASPRPKISSRFSKAFGKRGLHPGIPFLCGWNNFTVLCLPWTTA YHV >gi568815575r:8691314_8900171|GENSCAN_predicted_CDS_7|732_bp atgaaggcggagctctccgaagccaaggcgctggatgcgaccatggaggattcttccgcc agtctggcccccaccatgctcttcctcaccacctttgaggcagcacctgccacagaagag tccctgatcctgctcatcgcccccctgcggccccaggcacagcgcaggcctgacggggag gtgatacccacactggacgtggccttgttcgactggaccgattatgaagacttaaaacct gatggcgggccctctgcaaagaagaaagaaaaaccagggtgctggccgggtacccaggtg ctggccggcagcaccggcgcactctcagtccggacccacacagcctggtgtcgcgctctc cgcgatggcaataccgagattgccctctactgtccgactgcagcactacaacagcttcaa gttcaaaaccaagaggccattttgagagtggaaaagaaatttaaacttcccgaaagaagg gcatgcgatgctgctggcccagggaccccactgagaggtccgggaaagtacatggcagag caaggacccgaacctgggccaaggctgtccagccccaagatccacgttcctgcctctcca cgaccaaaaatttcttctagattctccaaggcttttggcaagagaggcttgcaccctggc attccctttctctgtggctggaataacttcactgttctctgccttccctggacaacggcg tatcatgtttaa >gi568815575r:8691314_8900171|GENSCAN_predicted_peptide_8|460_aa MGHPGALYTWRMRSHLGGIHSKEEALKTSGDRGPKEPPPTPVHSKYGWSLTERGKAWCEA LSGRLTCTRAARDPWPLLGFGVHHARAKRPQVFKAGNGPGIVAWGLGQGEALRGHRKDPA GAIQVCEAAVSSAASPRDGLDGPRRIGGDTGIFSDPSCSHRIHGPTMEPVGRKRSRKAAK AQLEAQVTAAQGATKEGSGIASNFPGQPTMEPVGRKRSRKAAKAQLEAQVRAAPAKKHTG KDPVRDECEERNPFTETREEDVTDEHGEREPFAEKDEHTGIHTMKLEHIAADIKKGLAAK REMIKIDKAAYRKTKNTIERALKKKQLKRQKRDYRHTRKLLNVLKEYIAEKQKDDEAEEA EAAAAAAEAAAAAEAAAAAAEVIVVEDEEEEEKEEEEEKEEEEEEGEEEGGGEEGEEGGG GGEGEETEEEEEEEEEEEEEEQIVGIRLSFMINLLYQCLR >gi568815575r:8691314_8900171|GENSCAN_predicted_CDS_8|1383_bp atgggtcatccaggagccttgtatacctggaggatgaggagccacctggggggaattcat tctaaggaggaggcgctgaaaacttctggggacagaggccccaaagaaccaccacccacc ccagtccacagcaaatatgggtggtcccttacggaaagagggaaggcgtggtgtgaagcg ctgagcggccggctgacctgcacacgggcagcccgagacccctggccccttcttggcttt ggagtccaccatgcacgtgctaagaggccccaggttttcaaagcaggaaatggccctggc atcgttgcctggggcctgggtcagggcgaggccctgaggggacaccgcaaggatcctgca ggggccattcaggtgtgcgaggcagctgtgagttctgcagccagccccagggacgggctg gacgggccacgaaggataggaggggacactggcatcttctccgacccctcctgctcccac aggatacacgggccaaccatggagcccgtgggcaggaagcgcagcaggaaggctgccaaa gctcagttggaagctcaagttacggccgcccagggggccacgaaagaaggttcagggatc gcctctaacttcccaggacagccaaccatggagcccgtgggcaggaagcgcagcaggaag gctgccaaagctcagttggaagctcaagttagggccgccccggcgaagaagcacacagga aaggatccagtccgtgatgaatgtgaggaaagaaacccttttacagaaacaagggaggaa gatgtaactgatgagcatggggaaagagaaccttttgctgaaaaagatgaacacacgggg attcataccatgaagctagaacatattgcagctgacattaaaaagggccttgctgcaaaa agagaaatgataaaaatagataaagcagcttacaggaaaaccaagaacacaattgaacgt gctttgaaaaaaaaacaactaaaaaggcagaaacgtgattatagacatactcggaagttg ctgaatgtccttaaagaatacatcgcagagaagcagaaagatgatgaagcagaagaagca gaagccgcagcagcagcagcagaagccgcagcagcagcagaagccgcagcagcagcagca gaagtaatagtagtagaagacgaagaggaggaagagaaggaggaggaggaggagaaagaa gaggaggaagaagaaggagaagaagaaggaggaggagaagaaggagaagaaggaggagga ggaggagaaggagaagaaacagaagaagaggaagaggaagaagaagaagaggaagaggaa gaacaaattgttggtatcaggcttagctttatgattaacctattgtatcaatgtctcagg tag >gi568815575r:8691314_8900171|GENSCAN_predicted_peptide_9|244_aa MRRGIVQGRFGSVEVLLVTTTRNILLTSSGRSQECSKTSYNEYTGQSSLPSRNYPAPNVN SAKVFVITIITITITIIIIIIIIITITIVIIPSLPSLPPSSSSSPSPLSSSSGSQIKCSL HTRRTNSSLEPHVNMTQTLQTKHAQLNFPSSSQFAPPLGVITQEKLSPPTPGLIPKHPFS QSPVLKAQKPDHSEPPALQSSDPGHSSGIPGNTPHLSPPFLTTLLRAARAFSLNGDLNHV FSSA >gi568815575r:8691314_8900171|GENSCAN_predicted_CDS_9|735_bp atgaggagggggattgttcagggaagatttggcagtgtggaggtacttttggtcaccaca actaggaatatactactgacatctagtggtagaagccaggaatgcagcaaaacatcctac aatgaatacacaggacagtcctccctcccctccaggaattatccagcccccaatgtcaat agtgccaaagtctttgtcatcaccatcatcaccatcaccatcaccatcatcatcatcatc atcatcatcatcaccatcaccattgtcatcatcccatcattaccatcattaccaccatca tcatcatcatcaccatcaccattatcatcgtcatcaggatctcagattaagtgttccctc catacccggagaaccaacagctcattggaacctcatgtgaacatgacacagacccttcaa accaaacatgctcagttgaattttccatcttcttcccaatttgcccctcctctgggtgtt ataacccaggagaaactatccccacctactccagggttgattcccaaacaccctttctct cagtcccccgtcctaaaagcacagaagcctgatcattctgaaccacctgctctgcagagc tctgatccaggccattcctctggcattcctggaaatactcctcacttgtcgcccccattt ttaaccactctcctcagagcagcacgagccttctctttaaatggtgatcttaaccatgtt ttctcttctgcctga >gi568815575r:8691314_8900171|GENSCAN_predicted_peptide_10|102_aa MAAWRRIVRLYQGLEGKVQQGLMGGVAWPTLVCFGQWYQPTDCASRASREQSLEDNARSS HRSPEDSPLANVAAKNPMLVAQKHCRALELSSSPPPGRTLDR >gi568815575r:8691314_8900171|GENSCAN_predicted_CDS_10|309_bp atggctgcctggagaaggatcgtgagactctaccaagggctagaagggaaagtgcagcag ggactgatgggaggtgttgcctggcccacacttgtctgttttggtcaatggtatcaacct acggactgtgcctctagagcctcccgggagcagtctttagaggacaatgctaggtcctct cacaggagccccgaagattctcctcttgccaatgtggctgctaaaaacccgatgcttgta gctcagaaacactgccgtgccttggaacttagttcctctccacccccagggaggaccctt gaccgatga >gi568815575r:8691314_8900171|GENSCAN_predicted_peptide_11|119_aa MSTLFPSLFLCGTETLWFNLDRPCMGETELQQQEQQHQTWLESITEKDNNLVPIGKPASE HYDDEEEEDDEDNEDSEEDLEDDEDMQDMDKMNDYNGSPDDGEVNEVDMKETNRMWTSE >gi568815575r:8691314_8900171|GENSCAN_predicted_CDS_11|360_bp atgtccactttgtttccctcactcttcctttgtgggactgagacactgtggtttaatctg gatcgaccttgtatgggagagacagagctgcagcagcaggaacagcagcatcagacctgg ctcgaaagcatcacagagaaagacaacaatctggttcctattggcaagccagcctcagag cactatgatgatgaggaagaggaggatgatgaagacaatgaggatagtgaagaggactta gaggatgatgaggacatgcaggacatggacaagatgaatgactacaatgggtcacctgat gacggagaggtcaatgaggtggacatgaaggaaacgaacaggatgtggaccagtgaatga >gi568815575r:8691314_8900171|GENSCAN_predicted_peptide_12|371_aa MKDLVVLEEYSEVGRAWRRNVSNEEDAHHGCRQVSGVNCAVELPLAVSRAVGILNRAEMS VQGSWKLQNQASAGEPSVHCCGVKQSFQNHKGTFNRGLGTEAMSQVATRGDGESQAITAE TQGFYIIEKECVGQLESTPQGKARRLLGEAKGKPDDQNHNMMHGSSIGEVQAGSSRQGEQ GESLTAGDCETVGSCVQDAKRTGSKNWSRESLESACEEEEWLGRFHSGKDRMETLLLAAV RIGARTEGALQTSLPSLEDTCITSDLGLRGKEGVMVLDLSPVSRYRAQLSIWRCNCNKNT IRTPEESKKIHANTENSCDRDAESRKSEMFKEAKEPPCLDTRKGFQTQSGTKPSALLGLQ LADSPCKSWVL >gi568815575r:8691314_8900171|GENSCAN_predicted_CDS_12|1116_bp atgaaagacttggttgtacttgaggaatattcagaagttggcagggcttggagaaggaat gtgagtaacgaggaagatgctcaccacggatgccgccaggtttctggtgtgaactgtgct gtggagctgccactcgctgtgagcagagccgttggcattctcaacagggcagagatgagt gttcaagggagctggaagttgcagaaccaagccagcgctggagaaccaagtgttcattgt tgtggagtaaagcaatctttccagaatcacaaaggaacatttaataggggcttaggaaca gaagccatgtctcaggtggccacaagaggagatggtgaatcccaggccattaccgctgag acccagggcttctatatcatagagaaggaatgtgtaggacaattggagtcaactcctcag ggaaaggcaagaaggctattgggagaggcaaaaggcaagcccgacgatcaaaatcacaac atgatgcatggaagctctatcggagaggtacaagcaggcagttccaggcagggagaacag ggagaatccctcacagctggagactgtgagacagtggggagctgtgtgcaagatgctaag aggactgggagcaagaactggtcccgtgagagtttggaatctgcttgtgaagaagaggaa tggcttggaagatttcattcaggaaaagataggatggagacattgcttttggcagcagtc agaattggggccaggaccgaaggtgctttgcaaacaagtcttccttctcttgaggacaca tgtataacctcagatttgggccttcgaggaaaggaaggggtcatggtgctggatctgagt ccagtatccaggtatagagctcagctgtcaatatggagatgcaactgcaataagaacact ataaggacaccagaggagagtaaaaaaatccatgccaacacagaaaacagctgtgacaga gatgcagagtcaaggaagagtgagatgtttaaggaagccaaggaaccaccatgtctagac acaaggaaaggctttcagacccagagtggaactaaaccatcagctctcctgggtctccag cttgcggactcgccctgcaaatcttgggttttgtaa