GENSCAN 1.0 Date run: 2-Nov-116 Time: 22:27:23 Sequence gi568815593r:2647567_2851413 : 203847 bp : 44.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9542 9651 110 0 2 55 76 85 0.172 4.00 1.02 Term + 22646 22876 231 1 0 73 36 104 0.040 0.17 1.03 PlyA + 22916 22921 6 1.05 2.02 PlyA - 23983 23978 6 1.05 2.01 Sngl - 30485 30078 408 2 0 72 48 375 0.961 28.19 2.00 Prom - 30574 30535 40 -2.96 3.03 PlyA - 31088 31083 6 1.05 3.02 Term - 36433 36259 175 0 1 62 42 114 0.172 1.43 3.01 Init - 39301 39174 128 1 2 97 84 72 0.238 5.36 3.00 Prom - 46782 46743 40 -4.26 4.04 PlyA - 47170 47165 6 1.05 4.03 Term - 49931 49787 145 1 1 120 52 65 0.613 3.48 4.02 Intr - 51268 51099 170 1 2 57 44 170 0.657 8.24 4.01 Init - 53985 53863 123 0 0 66 77 66 0.627 3.47 4.00 Prom - 54136 54097 40 -10.74 5.03 PlyA - 55258 55253 6 1.05 5.02 Term - 58677 58565 113 1 2 93 44 122 0.593 7.02 5.01 Init - 61376 61256 121 2 1 77 65 107 0.407 7.65 5.00 Prom - 63140 63101 40 -5.16 6.00 Prom + 65558 65597 40 -2.66 6.01 Init + 81049 81107 59 0 2 107 31 51 0.347 0.11 6.02 Intr + 85459 85803 345 1 0 36 31 212 0.110 4.81 6.03 Intr + 87422 87486 65 2 2 66 56 59 0.251 -1.14 6.04 Intr + 91036 91214 179 2 2 77 89 36 0.301 2.24 6.05 Intr + 91259 91340 82 1 1 64 80 98 0.501 5.81 6.06 Term + 91693 92189 497 2 2 35 54 253 0.209 11.33 6.07 PlyA + 94095 94100 6 1.05 7.05 PlyA - 94253 94248 6 1.05 7.04 Term - 100050 99998 53 1 2 96 43 63 0.980 0.19 7.03 Intr - 101486 100779 708 0 0 120 102 1250 0.997 121.02 7.02 Intr - 102221 101816 406 2 1 86 105 980 0.997 93.42 7.01 Init - 103847 103599 249 2 0 92 46 533 0.998 44.96 7.00 Prom - 104334 104295 40 -13.70 8.00 Prom + 104598 104637 40 -15.01 8.01 Init + 104699 104898 200 1 2 83 83 137 0.983 9.08 8.02 Intr + 105056 105188 133 2 1 72 70 51 0.703 2.35 8.03 Intr + 107463 107734 272 2 2 58 64 110 0.869 1.94 8.04 Intr + 108901 109330 430 1 1 116 41 175 0.762 9.32 8.05 Intr + 111508 111661 154 0 1 83 57 57 0.763 1.75 8.06 Term + 111733 111845 113 2 2 41 43 114 0.714 0.92 8.07 PlyA + 112043 112048 6 1.05 9.02 PlyA - 112381 112376 6 1.05 9.01 Sngl - 117119 116841 279 2 0 71 46 177 0.674 7.24 9.00 Prom - 131082 131043 40 -2.36 10.00 Prom + 133659 133698 40 -1.96 10.01 Init + 133853 133891 39 1 0 87 99 -12 0.422 0.19 10.02 Term + 135093 135281 189 2 0 61 42 131 0.621 3.25 10.03 PlyA + 136303 136308 6 1.05 11.02 PlyA - 136531 136526 6 1.05 11.01 Sngl - 140884 140522 363 1 0 50 49 178 0.947 6.18 11.00 Prom - 146327 146288 40 -3.06 12.03 PlyA - 146449 146444 6 1.05 12.02 Term - 159246 159074 173 1 2 64 48 133 0.570 4.89 12.01 Init - 161504 161339 166 2 1 72 33 168 0.486 9.49 12.00 Prom - 172601 172562 40 -5.36 13.05 PlyA - 173330 173325 6 1.05 13.04 Term - 175108 174996 113 2 2 102 48 74 0.091 3.52 13.03 Intr - 187171 187053 119 0 2 11 106 91 0.003 3.31 13.02 Intr - 190754 190690 65 2 2 111 42 56 0.002 0.82 13.01 Intr - 195622 195547 76 0 1 95 37 106 0.101 5.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 187072 187161 90 0 0 99 47 105 0.956 5.22 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:2647567_2851413|GENSCAN_predicted_peptide_1|113_aa XGPSYGGSQMTVLMAFLCFILLVTSRIHSIPIRGPPQRPQHKAPGLPRPLPKPQPTSELL DQQGPAFQPSPQMHLEAPRKYSLNGGRTNDGAALTAPKPANFQIHFWRPKELH >gi568815593r:2647567_2851413|GENSCAN_predicted_CDS_1|342_bp ngcggtccctcctatggtggctcccagatgaccgtcctgatggcgtttctgtgcttcatc ctccttgtcaccagtagaatccactccatccccatcaggggccctccccagcgcccgcag cacaaagctccaggcctgccccgacccctgcccaagccccaacctacgtcagagcttctg gaccagcagggccctgctttccagccttctccgcagatgcacttagaggctcccaggaaa tactccctgaacgggggtcgaacaaatgacggagctgctctcacggctcccaaaccagcg aattttcagattcatttctggagacctaaagaacttcattaa >gi568815593r:2647567_2851413|GENSCAN_predicted_peptide_2|135_aa MPNITNHHQHHEHHKDHQDHQASPICSTSPTITNITNITKHHQPSPISPRSPRPPSITNM LNITNHLQHHQASPICPTSPTITDITKHHQYAQHHQQLPRSPICPTSPAITNIMNITKVM KTTKHHQYAQHHQPS >gi568815593r:2647567_2851413|GENSCAN_predicted_CDS_2|408_bp atgcccaacatcaccaaccatcaccaacatcatgaacatcacaaagatcaccaagatcat caagcatcaccaatatgctcaacatcaccaaccatcaccaacatcactaacatcaccaag catcaccaaccatcaccaatatcaccaagatcaccgagaccaccaagcatcactaatatg ctcaacatcaccaaccatctccaacatcaccaagcatcaccaatatgcccaacatcacca accatcaccgatattaccaaacatcaccaatatgcccaacatcaccaacaattaccaaga tcaccaatatgcccaacatcaccagccatcaccaacatcatgaacatcacaaaggtcatg aagaccacaaagcatcaccaatatgctcaacatcaccaaccatcatga >gi568815593r:2647567_2851413|GENSCAN_predicted_peptide_3|100_aa MGAIARGGRLATPHAVTSAAGARNYYHTRSPPCTAISPLARRGCTGSRVDRRPQAIVGFW GTSVLRRLAAAGRAAGTLHGLPDSRWCRPNSPCWLLLPLW >gi568815593r:2647567_2851413|GENSCAN_predicted_CDS_3|303_bp atgggggcaatcgctcgtggagggcgcttggctaccccccatgcagtcacctcagcagca ggtgcacgtaattactaccacacacgttcaccaccctgcaccgccatttcccctttagca agacggggatgtactgggtcccgagtggaccgccgtccgcaggccatcgtgggtttctgg gggacatcagtgctgagacgcctggcagcagcagggagggctgcaggcaccctccacggg cttccagactcacggtggtgccgcccaaactcgccctgttggctgctgctgcccttgtgg tga >gi568815593r:2647567_2851413|GENSCAN_predicted_peptide_4|145_aa MSNKRNGEMWPYDLSGWEVCGSDSSQISSAEECEAEGRQSKEGSLSGILLAYQEGQRCSD DAQASGSVAGVLIALQVRADFQAGSCPLMDKVMSKETQPVFINAMESSMCGSRGKKVWFT ARCSSELSPSNSRPVKAWLRPEPSG >gi568815593r:2647567_2851413|GENSCAN_predicted_CDS_4|438_bp atgtcaaacaagagaaatggggagatgtggccttacgatttatctggctgggaagtctgc ggtagtgacagcagtcagatcagttctgcagaggagtgcgaggctgaaggcagacagagc aaggaagggtcactaagtggcatcttactcgcatatcaggaaggtcagcggtgctcggac gacgcacaggcctccggctccgtggctggtgtcctcatcgctctccaggttcgggcagac tttcaggctgggtcttgccctctgatggacaaggtcatgtccaaagagacccagccggtc ttcatcaatgcaatggaatcatccatgtgtggctcccgaggaaagaaggtgtggtttact gccaggtgctcttcagagctgagtccctctaacagcaggcctgttaaggcgtggctgcgg cctgaaccctccggctga >gi568815593r:2647567_2851413|GENSCAN_predicted_peptide_5|77_aa MGRADVKDGSDPAETLAVSKGVAARSGDPMDQSRALEAKSGCRQEQDKPVHLYTRVPGAS PMGSLCGPGHLSCSCRL >gi568815593r:2647567_2851413|GENSCAN_predicted_CDS_5|234_bp atgggcagggccgatgtcaaggatggcagtgatcctgctgagacactcgctgtctccaag ggtgtggcagcaaggagtggagacccgatggaccagagccgtgccctggaagcaaagagt ggatgccgccaggagcaagacaaacctgtgcacttgtacacccgagttcctggtgccagc ccgatgggctccctctgtgggccggggcacctgagctgcagctgccggctctga >gi568815593r:2647567_2851413|GENSCAN_predicted_peptide_6|408_aa MTREVRGLAGCPVLFPHLLSCLHKPLTPVYTSLLHLTVYTSFLHLTVYTSLLHLTVYSSL LHLTVYTSLQQMVFYEGPMQLVVYAKPQKEEVFSDDSNAPTGIPAGMKEESGAEGNWQNW GFPGGSSSLGLQSSGVILDEQRLCNPSSPSIKDVRMWLYSDPESNGRKDGAHHRAQDPKG YKAPTPPHLTLTTPSRTLGPPSSCASGTVSPPAGGQSPSQGRQRSRRVKPMGLDAEAGER KERGASPEVSPARQPRTSFSVRVLGAAVPGVRPGVPSVSTARLGAGAADRGYTARADRTL PPERWKSAGQAGLGLSSSAAERSRNTVCRQDPKQERPGGPGRRGRLAVAGTRALGTSPGP GGPEPPPPSPPDPAIVLNPGRALSVSRRSAALLCSPCARGSQSPELRA >gi568815593r:2647567_2851413|GENSCAN_predicted_CDS_6|1227_bp atgaccagggaagtcagaggccttgctgggtgccccgtgctcttcccacatcttctcagc tgtctacacaagcctcttacacctgtgtacacaagcctcctgcacctgactgtctacaca agcttcctgcacctgactgtctatacaagcctcctgcacctgactgtctactcaagcctc ctgcacctgactgtctacacaagcctccagcagatggttttctacgaggggcccatgcag ctagtagtctacgcaaagccccagaaggaagaggtgttttcagatgatagcaatgcaccc actggaatcccagcagggatgaaggaggagagtggagcagaaggaaactggcaaaactgg ggttttccaggggggtccagcagccttgggttgcagtcttcaggggtgattctggatgaa caaaggctgtgtaacccttcatctccatccataaaagatgttcggatgtggctgtacagt gaccccgagtccaatggcagaaaggatggggctcatcacagggcccaggaccccaaaggt tacaaagcacccacccctccccatctgaccctcaccacccccagccgcacgcttgggcct ccctcctcctgtgcctccgggaccgtctccccacctgcgggaggccagagccccagtcag ggccgccagaggtcgcggagggtgaagccaatgggcctggacgccgaggccggggagagg aaggagcgaggggcgtcccccgaggtgtcgcctgcaaggcagccgcgtacgtcgttctca gtgcgggtcctgggggcggcggtgccgggggtacggcctggggtcccgtcggtgtccact gcccgcctgggcgctggggccgcagatcggggctacacggctcgcgccgaccgaacgctg ccgccagagcggtggaaaagcgctgggcaggccggcctggggctttcgagctctgccgcc gagcgcagcagaaatacggtgtgcaggcaggacccgaaacaggagagaccggggggaccg gggaggagaggaaggctggctgtggcaggaacccgggcgctcggcacctcccctggtccc gggggccctgaacccccgccgccgtcgccgccggaccccgcgatcgtcctgaacccgggc cgggcgctttcggtttcccggcgctctgcggcgctgctttgttcaccctgcgcgcgcggt tcccagagtccagagctgcgggcctga >gi568815593r:2647567_2851413|GENSCAN_predicted_peptide_7|471_aa MSYPQGYLYQAPGSLALYSCPAYGASALAAPRSEELARSASGSAFSPYPGSAAFTAQAAT GFGSPLQYSADAAAAAAGFPSYMGAPYDAHTTGMTGAISYHPYGSAAYPYQLNDPAYRKN ATRDATATLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTW APRNKSEDEDEDEGDATRSKDESPDKAQEGTETSAEDEGISLHVDSLTDHSCSAESDGEK LPCRAGDPLCESGSECKDKYDDLEDDEDDDEEGERGLAPPKPVTSSPLTGLEAPLLSPPP EAAPRGGRKTPQGSRTSPGAPPPASKPKLWSLAEIATSDLKQPSLGPGCGPPGLPAAAAP ASTGAPPGGSPYPASPLLGRPLYYTSPFYGNYTNYGNLNAALQGQGLLRYNSAAAAPGEA LHTAPKAASDAGKAGAHPLESHYRSPGGGYEPKKDASEGCTVVGGGVQPYL >gi568815593r:2647567_2851413|GENSCAN_predicted_CDS_7|1416_bp atgtcctacccgcagggctacctgtaccaggcgcccggctcgctggcgctctactcgtgc ccggcctacggcgcgtcggctttggcggctccgcgcagcgaggagctggcgcgctcggcg tcgggctcggcgttcagcccctacccgggctcggcggccttcacggcgcaggcggccacc ggcttcgggagcccgctgcagtactcggccgacgccgccgccgccgccgccggcttcccg tcctacatgggcgcaccctacgacgcgcacaccaccggcatgaccggcgccatcagctac cacccgtacggcagcgcggcctacccgtaccagctcaacgaccccgcgtaccgcaagaac gccacgcgggacgccacggccactctcaaggcctggctcaacgagcaccgcaagaacccc taccccaccaagggcgagaagatcatgctagccatcatcaccaagatgaccctcacccag gtctccacctggttcgccaacgcgcgccggcgcctcaagaaggagaacaagatgacctgg gccccgagaaacaaaagcgaagatgaggacgaggacgagggcgacgctaccagaagcaag gacgagagtcccgacaaggcgcaggagggcacggagacctcggcagaggacgaagggatc agcctgcacgtggactcgctcacggatcactcgtgctcggccgagtcggacggggagaag cttccgtgccgcgccggggaccccctgtgcgaatcgggctcggagtgcaaggacaagtat gacgacctggaggacgacgaggacgacgacgaggagggcgagcggggcctggcgccgccc aagcccgtgacctcgtcgccgcttaccggcttggaggcgccgctgctgagccccccgccc gaggccgcgccccgcggtggccgcaagacgccccagggcagccggacgtctccgggcgcg ccgccccccgccagcaagcccaagctgtggtcgctggccgagatcgccacgtcggacctc aagcagccgagcctgggcccgggctgcgggccaccggggctgcccgcggccgccgcgccg gcctcaaccggggcaccgccaggaggctcgccctaccctgcctcgccgctgctgggccgc cccctctactacacgtcgcccttctacggcaactacacaaactacgggaacttgaacgcg gcgctgcagggccagggtctcctgcggtacaactctgcggccgcggcccccggcgaggcc ctgcacaccgcgccaaaggcggccagcgacgcgggcaaggcgggcgcgcacccgctcgag tcccactaccggtccccgggcggcggctacgagcccaagaaagatgccagcgagggctgc accgtggttggcgggggcgtccagccctacctatag >gi568815593r:2647567_2851413|GENSCAN_predicted_peptide_8|433_aa MVAPAARVFLRAVRAALTSTVPDLLCLLARGSPRGLASGRLPLAVHSAQHGPGSGAPWLR IARRALRFVLSKHWGDDCYLTNRLWQDLKPPSHVENGQELRLAPPVQWALQPKNLERVYV DTQVSASGDFLRGRARGTAGPGGSGSGSPRGRGRLRRPGRSPGAAPSSVSRGRKEATQAR SRARGRRGGAVARVCRPESRQRSHLSPPGSGAPRGQASARLKTEVRHAGPLVGGSHPAYR GNGASRQLRLQPGRERSAGSLQRLESEEKGHSLEMDRAEFLKGIAVERDPLQNQQRPGSL SSSTGDGSKMKCLKLRRRFTFAHTRTAGPDPHRDADTRALALPGERRTQGLSGQYAPSRN NQFCLVLFTVARRSWIFANTNLLRAPAQWEAIGGGLVLETGLMGGVEVMDAEWPPVLEVD LMGGVEAMDAKSS >gi568815593r:2647567_2851413|GENSCAN_predicted_CDS_8|1302_bp atggtggcgcccgcggctcgggtcttcctccgggcagtgcgcgcggctctcacttccacg gtcccggacctgctctgcctcctggcccgaggctccccgcgtggcctcgcgtctggtcgc ctacccctcgcggtccactccgcccagcatggacctggatccggggcgccttggttgcga attgccaggagggccctgagatttgtgttgtcaaaacactggggggatgattgttacctg acaaaccgcctctggcaggacctgaagcctcccagtcacgtcgagaacgggcaggagctc aggttggcgccgccggtgcagtgggcattacagccaaagaatttagaacgagtctacgtg gacacgcaagtttcagcgagcggggatttcctgcgaggccgagcgcgaggaacagccggc cccggaggctcgggctcggggtctccccggggacgcggccgactgagaaggccggggcgc tcgccgggcgcggcgccgtcctccgtgtcgcggggacggaaggaggcgacgcaggccagg agtcgtgcgcgcgggcggcgagggggcgctgtggcccgtgtctgccggcccgagtcccgg caacggtcgcacctctccccgcccgggtcgggagcccctcgtggccaggcgtccgcgcgg ctcaagaccgaggtccgccacgctgggccgctcgtgggcggctctcacccggcataccgt ggaaacggcgcgtcccgccagctgcggctccagcctgggagggagcgcagcgcggggagc ctgcagcgtctggagagtgaggaaaagggacattccttggaaatggacagagccgagttc cttaaagggatcgcagttgaaagagaccctcttcaaaatcagcaacgacctggcagcctt agttcctcaacaggagatggttcgaagatgaaatgtttgaaactccgccgccgtttcacc tttgcacacacgcgcacggcaggcccagatccgcacagagacgcagacacgcgcgcgctc gcgcttcccggagagcgtagaacacagggcctttctggccagtatgctccttccagaaac aaccagttctgcctagttctcttcactgtggccagaagaagctggatctttgcaaataca aatctgctgagagccccagcacagtgggaggccatcgggggtggccttgtattagagaca ggcctcatgggaggtgtcgaggtcatggatgcagagtggcctccagtattagaggtggac ctcatgggaggtgtcgaggccatggatgcaaagtcctcatga >gi568815593r:2647567_2851413|GENSCAN_predicted_peptide_9|92_aa MTDVLPREAAESVPFRAAPKLAAMTMSLYWTSVLAWLEKKMATFPMGLRSFLCFWLAWWP FQSPGPKHSSCFFHFETSLELIDHRELLPTRV >gi568815593r:2647567_2851413|GENSCAN_predicted_CDS_9|279_bp atgacggatgtgctaccgagagaagcagcagaaagcgtgcctttccgtgctgcccccaaa ctcgcagctatgaccatgagcttgtactggacttctgtcctggcctggctggaaaagaaa atggcgacattccccatggggctgcgcagcttcctctgtttctggctggcttggtggccc ttccagagccctggtcccaaacactcctcctgcttctttcacttcgaaacaagcttggaa ctcattgaccacagggagctgctgccgacccgagtgtga >gi568815593r:2647567_2851413|GENSCAN_predicted_peptide_10|75_aa MGRKRKLCVSAAQGNGPQQEMILNCHMTGELDNLVPYCTTDYPALNTNIILITLIPEILL YANGTPGFHLSTWLP >gi568815593r:2647567_2851413|GENSCAN_predicted_CDS_10|228_bp atggggaggaaaaggaaattatgtgtttctgctgcccagggcaatggtcctcaacaggag atgattttgaattgtcatatgactggggaactggataacttagtaccctactgcacaaca gactaccctgccctgaataccaacatcatcctcatcaccctcatcccagaaatactgctg tatgctaatggaacgcctggctttcacctgagtacatggttgccttga >gi568815593r:2647567_2851413|GENSCAN_predicted_peptide_11|120_aa MQQKTCYWGGGKKRVQMYPWLEKWEEDPETLLIIIKGAGGKAFCDGSDIRVISEAEKAKR KIAPVFFREEYMLNNAFGSCQKPYVALIHGITMGGRVGRQSMGSFKWLQKSVFLPCQKPQ >gi568815593r:2647567_2851413|GENSCAN_predicted_CDS_11|363_bp atgcagcagaagacgtgctattggggggggggaaagaaacgtgtgcagatgtatccatgg ctagagaagtgggaagaagatcctgaaactctcctgatcattataaagggagccggagga aaggctttctgtgatgggagtgatattagagtcatctcggaggctgaaaaggcaaaacgg aagatagcgccagttttcttcagagaagaatatatgctgaacaatgcttttggttcttgc cagaaaccttatgttgcacttattcatggaattacaatgggtgggagagttggtcgtcag tccatgggcagcttcaagtggctacagaaaagtgtctttttgccatgccagaaaccgcag tag >gi568815593r:2647567_2851413|GENSCAN_predicted_peptide_12|112_aa MERQPTKEHTLIELQVHVCRQPKWLSSRKGGILVTEYVVAASTVTPSYRRLWSEKSEEDR LLLTHGPPHLITNVLLQPQASLHAQQPTPVSLSEAACGPPEPAVLELTVPGN >gi568815593r:2647567_2851413|GENSCAN_predicted_CDS_12|339_bp atggaaagacagccaaccaaggagcacacgctaatagaattgcaggtgcatgtctgccgc caaccaaaatggctttcctcccgcaaaggcggcattctggtaacagaatacgtggtggca gctagcaccgtcacgccctcataccgcaggctctggtctgagaagagtgaggaggaccgt ctgctgctcacccatggtcctccacacctcatcaccaacgtactgctccagccccaggct tctcttcacgcccagcagccaacccctgtgtcactctcagaggctgcctgtgggcctcca gagccagctgtgcttgagctgacagtgcctgggaattga >gi568815593r:2647567_2851413|GENSCAN_predicted_peptide_13|124_aa XGYRYAPDGPTFVYQWEIANACSWHMSKSFFTRIRADVCAMPVQPAELHSVSTHEFLRSH IETIAVTTLQFSSYGVQVHNHLRTEVEREEPSAHFGPRYQCILHLCPPQLQQQNGTMTSQ QEHS >gi568815593r:2647567_2851413|GENSCAN_predicted_CDS_13|375_bp nnggggtaccgctatgctcctgacggtcccactttcgtctaccaatgggaaatagccaat gcttgctcgtggcatatgagtaagagcttcttcaccagaatccgagcagatgtgtgtgcc atgcctgtacaacctgcagagctccattcggtttcaacgcatgaattcctgaggagtcac attgaaaccatcgcagttactacactccagtttagcagttatggtgtacaagtccacaac cacctaagaacagaagttgaaagggaagagccatcagctcattttgggccccgttaccag tgcatccttcacttgtgcccaccacagctgcagcaacagaatggcacaatgacatctcag caggaacattcctga