GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:58:54 Sequence gi568815581f:36838198_37043128 : 204931 bp : 45.90% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 18702 18841 140 1 2 97 41 113 0.416 5.63 1.02 PlyA + 19021 19026 6 1.05 2.07 PlyA - 22595 22590 6 1.05 2.06 Term - 26195 25965 231 2 0 55 40 180 0.487 6.37 2.05 Intr - 28633 28587 47 2 2 138 76 11 0.897 3.03 2.04 Intr - 31937 31892 46 2 1 114 76 33 0.297 2.68 2.03 Intr - 39591 39508 84 0 0 97 16 75 0.197 1.22 2.02 Intr - 44681 44510 172 1 1 102 42 24 0.280 -0.85 2.01 Init - 46552 46329 224 1 2 90 60 92 0.157 4.02 2.00 Prom - 51668 51629 40 -8.56 3.00 Prom + 52825 52864 40 -6.16 3.01 Init + 53723 53861 139 1 1 84 110 18 0.342 3.90 3.02 Intr + 58232 58280 49 0 1 74 100 52 0.537 2.74 3.03 Intr + 62825 63023 199 2 1 66 56 154 0.679 9.35 3.04 Intr + 69147 69192 46 2 1 120 64 36 0.644 2.38 3.05 Intr + 82067 82153 87 0 0 76 55 81 0.230 3.54 3.06 Intr + 83398 83504 107 2 2 39 70 17 0.062 -5.07 3.07 Intr + 85561 85668 108 0 0 56 72 69 0.203 2.58 3.08 Intr + 90116 90194 79 1 1 76 77 82 0.250 5.02 3.09 Term + 94327 94412 86 2 2 124 43 55 0.044 2.42 3.10 PlyA + 95700 95705 6 1.05 4.00 Prom + 99438 99477 40 -5.76 4.01 Init + 100001 100170 170 1 2 75 117 208 0.988 21.41 4.02 Intr + 102093 102319 227 0 2 124 76 275 0.932 27.53 4.03 Intr + 102413 102690 278 0 2 82 71 608 0.987 55.74 4.04 Intr + 104003 104168 166 1 1 88 90 340 0.994 33.73 4.05 Term + 104555 104934 380 0 2 112 44 598 0.998 52.75 4.06 PlyA + 106202 106207 6 1.05 5.04 PlyA - 106297 106292 6 1.05 5.03 Term - 107877 107677 201 0 0 68 43 99 0.764 0.79 5.02 Intr - 108331 108225 107 2 2 98 80 73 0.256 7.43 5.01 Init - 109477 109432 46 1 1 55 59 16 0.301 -3.74 5.00 Prom - 110018 109979 40 -4.56 6.00 Prom + 110361 110400 40 -2.56 6.01 Init + 110929 111019 91 0 1 100 100 124 0.999 13.52 6.02 Intr + 112017 112208 192 1 0 82 105 57 0.978 6.26 6.03 Intr + 114689 115099 411 0 0 126 58 456 0.999 40.56 6.04 Intr + 115573 115710 138 2 0 78 55 93 0.924 5.44 6.05 Intr + 148420 148534 115 2 1 122 116 14 0.997 7.11 6.06 Intr + 150322 150529 208 1 1 70 12 265 0.999 16.18 6.07 Intr + 151050 151214 165 2 0 97 111 120 0.999 15.36 6.08 Intr + 152577 152660 84 2 0 62 94 125 0.979 10.52 6.09 Intr + 180808 180875 68 0 2 123 86 105 0.947 11.50 6.10 Intr + 182737 182817 81 1 0 73 109 28 0.685 2.15 6.11 Intr + 193417 193488 72 1 0 92 109 36 0.529 4.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:36838198_37043128|GENSCAN_predicted_peptide_1|46_aa XQAASWSPLLVCSAQHRLITHCAYDEAERLFGKTATLIPNSPQGRS >gi568815581f:36838198_37043128|GENSCAN_predicted_CDS_1|141_bp ngacaggcagcttcatggtccccgctgctcgtgtgcagcgcacaacaccgcctcataact cactgtgcttatgatgaagcggagcggttgtttgggaagacagccactctgatacccaac agcccccagggccgttcctga >gi568815581f:36838198_37043128|GENSCAN_predicted_peptide_2|267_aa MATNRPQLPKCPVPSRTSLLQPPLLLRLTTPHPSYWRGSGPWWQPAEGPAADLDATHIPS AAEPGPNGGYRRDSRGIPTHSSSPGSKSPPPDAHLLQELNLHFASIVLQTSLEENVTVVW CSHLYLSSSYKAQPCEVGPIIPIGQGGARQMLSKGNDLLELPWGCSRTETVLAVPGPPRA APELRPCRLCQWKDLKCTDKDTAVKDDVAEPPGSTASRPQVCDREQLNFIITLSPMVFFI VFFTTGTKGSGLILADWRTSLVELTAL >gi568815581f:36838198_37043128|GENSCAN_predicted_CDS_2|804_bp atggcgacaaatcggccccaactgcccaaatgtcccgtcccatctcggacgtctctcctg cagcccccactgctcctccgtctgacaacccctcacccatcatactggagggggagcggg ccctggtggcagccagccgagggccctgccgctgacttagacgccacacatatcccttct gcagcggaaccaggccccaacgggggctaccgccgagactccaggggaatacctactcat tcttcaagccccggttcaaagtcgcctcctcctgatgcccacctgctgcaggaattgaac ctccactttgcatctatagtgcttcagaccagccttgaggagaatgtcactgtggtgtgg tgtagccatctatacttgtcctcctcctacaaggctcaaccctgtgaagttggtcctatc atccctattggccaaggaggagcccgtcagatgctcagcaagggcaatgacttgcttgag ctcccctggggctgctctagaactgagaccgtgctggctgtgccaggtccccccagggct gctccagaactgagaccgtgccggctgtgccagtggaaggaccttaaatgcacagacaag gacacagcagtaaaggatgatgtggccgaaccccccggttccacggcgagcagaccccag gtgtgtgatcgggagcagttaaatttcatcattaccctttctccgatggtgttttttata gtcttttttactactggcactaaaggttctggactcatcctggcggactggaggacatca ctcgtcgagctgactgctctgtga >gi568815581f:36838198_37043128|GENSCAN_predicted_peptide_3|299_aa MAEGDKEGVLEETMPKLRPDTWKPARWRSRVKEGSVQPSQRMSMAAGQDPFWNEGLMTYY QTSLPGPLGQHQDMNGREQTHRVYGHMDTRTHMHDCCGVHQHVNVCTHLYRTSAHTHVSV YMHTGPFAHVPKADLKGLSGKCGHACMAVDTVCGIAPRDLRPGGGLCFQSWELALSYYTK QELKRAPLATTSAKPHLPQSPQVWRWIWKRSGPQAAAQAVLSQFCLLHATVDERQSSLAG VVTEQGKSNLEVRPEELLRNVSTGELVGVEGAQRLPPKIIHVLSLLRGAELTRNWNLHA >gi568815581f:36838198_37043128|GENSCAN_predicted_CDS_3|900_bp atggcagagggggacaaggaaggcgttctggaggaaactatgcccaagctgaggcctgac acgtggaagccagccaggtggagaagcagagtgaaagagggaagtgtgcaaccctcccag agaatgtccatggcagcagggcaggaccccttctggaatgagggtcttatgacctactac cagacaagtttgcctgggcctctaggccagcaccaggacatgaatggccgggagcaaaca catcgcgtgtatggacacatggacacacggacacacatgcacgactgctgcggagtccac caacatgtgaatgtctgtacacatctgtacagaacaagtgcgcacacccatgtgtctgtg tacatgcacacgggtccattcgcacatgtgcccaaagcagatctgaaaggcttgagtgga aaatgtgggcatgcctgcatggccgtggacacagtctgcggcattgcaccaagggacctt cgtccagggggagggctgtgtttccagagctgggagctggccctgagctattacaccaaa caggagctcaagagggctccgcttgccaccacttccgccaagcctcaccttccacagagc ccccaggtctggaggtggatctggaagagaagcggcccgcaggcggcagctcaggctgtg ttgtctcagttttgccttctgcacgctactgtggatgagaggcagtcttctctggcaggt gtggtcacggagcaggggaaatcaaatcttgaagtgagaccggaagagctgctgcggaac gtgagcacgggagagctggtgggggtagaaggcgctcagcgtctccctcccaagattatc cacgtcctgtctttgctccgcggcgcggagctgactcgcaactggaatctccacgcttga >gi568815581f:36838198_37043128|GENSCAN_predicted_peptide_4|406_aa MVHCAGCKRPILDRFLLNVLDRAWHVKCVQCCECKCNLTEKCFSREGKLYCKNDFFRCFG TKCAGCAQGISPSDLVRRARSKVFHLNCFTCMMCNKQLSTGEELYIIDENKFVCKEDYLS NSSVAKENSLHSATTGSDPSLSPDSQDPSQDDAKDSESANVSDKEAGSNENDDQNLGAKR RGPRTTIKAKQLETLKAAFAATPKPTRHIREQLAQETGLNMRVIQVWFQNRRSKERRMKQ LSALGARRHAFFRSPRRMRPLVDRLEPGELIPNGPFSFYGDYQSEYYGPGGNYDFFPQGP PSSQAQTPVDLPFVPSSGPSGTPLGGLEHPLPGHHPSSEAQRFTDILAHPPGDSPSPEPS LPGPLHSMSAEVFGPSPPFSSLSVNGGASYGNHLSHPPEMNEAAVW >gi568815581f:36838198_37043128|GENSCAN_predicted_CDS_4|1221_bp atggttcactgtgccggctgcaaaaggcccatcctggaccgctttctcttgaacgtgctg gacagggcctggcacgtcaagtgcgtccagtgctgtgaatgtaaatgcaacctgaccgag aagtgcttctccagggaaggcaaactctactgcaagaacgacttcttccggtgtttcggt accaaatgcgcaggctgcgctcagggcatctcccctagcgacctggtgcggagagcgcgg agcaaagtgtttcacctgaactgcttcacctgcatgatgtgtaacaagcagctctccact ggcgaggaactctacatcatcgacgagaataagttcgtctgcaaagaggattacctaagt aacagcagtgttgccaaagagaacagccttcactcggccaccacgggcagtgaccccagt ttgtctccggattcccaagacccgtcgcaggacgacgccaaggactcggagagcgccaac gtgtcggacaaggaagcgggtagcaacgagaatgacgaccagaacctgggcgccaagcgg cggggaccgcgcaccaccatcaaagccaagcagctggagacgctgaaggccgccttcgct gctacacccaagcccacccgccacatccgcgagcagctggcgcaggagaccggcctcaac atgcgcgtcattcaggtctggttccagaaccggcgctccaaggagcggaggatgaagcag ctgagcgccctgggcgcccggcgccacgccttcttccgcagtccgcgccggatgcggccg ctggtggaccgcctggagccgggcgagctcatccccaatggtcccttctccttctacgga gattaccagagcgagtactacgggcccgggggcaactacgacttcttcccgcaaggcccc ccgtcctcgcaggcccagacaccagtggacctacccttcgtgccgtcatctgggccgtcc gggacgcccctgggtggcctggagcacccgctgccgggccaccacccgtcgagcgaggcg cagcggtttaccgacatcctggcgcacccacccggggactcgcccagccccgagcccagc ctgcccgggcctctgcactccatgtcggccgaggtcttcggacccagcccgcccttctcg tcgctgtcggtcaacggtggggcgagctacggaaaccacctgtcccacccccccgaaatg aacgaggcggccgtgtggtag >gi568815581f:36838198_37043128|GENSCAN_predicted_peptide_5|117_aa MKTTRAAEGKENLSEGASPGNAYPGSPQVILPQVPHLEQLTSGSQPGEVGWDSESRDEGR DCATTTLLEESNPHRVPNSKTLLIKYPDCTIKLLNNSAALIGKCSFPVSAVHLNECL >gi568815581f:36838198_37043128|GENSCAN_predicted_CDS_5|354_bp atgaagacaaccagggctgcagaggggaaggagaacctgtctgagggtgccagccctggc aatgcctatcctggctctccacaggtaatacttccccaagtgcctcacctagagcagctt accagcggcagccagcccggagaagttgggtgggattctgaatcccgcgacgaggggaga gactgcgccacaaccacgttgttagaagaatcgaatccacatcgtgtgcccaactcgaaa acgcttcttatcaaataccccgattgcacaataaagctgctaaataattcagcagccttg ataggaaaatgcagcttcccagtgagcgcagttcatttaaatgaatgtctttaa >gi568815581f:36838198_37043128|GENSCAN_predicted_peptide_6|542_aa MAGPQPLALQLEQLLNPRPSEADPEADPEEATAARVIDRFDEGEDGEGDFLVVGSIRKLA SASLLDTDKRYCGKTTSRKAWNEDHWEQTLPGSSDEEISDEEGSGDEDSEGLGLEEYDED DLGAAEEQECGDHRESKKSRSHSAKTPGFSVQSISDFEKFTKGMDDLGSSEEEEDEESGM EEGDDAEDSQGESEEDRAGDRNSEDDGVVMTFSSVKVSEEVEKGRAVKNQIALWDQLLEG RIKLQKALLTTNQLPQPDVFPLFKDKGGPEFSSALKNSHKALKALLRSLVGLQEELLFQY PDTRYLVDGTKPNAGSEEISSEDDELVEEKKQQRRRVPAKRKLEMEDYPSFMAKRFADFT VYRNRTLQKWHDKTKLASGKLGKASGFGAFERSILTQIDHILMDKERLLRRTQTKRSVYR VLGKPEPAAQPVPESLPGEPEILPQAPANAHLKDLDEEIFDDDDFYHQLLRELIERKTSS LDPNDQVAMGRQWLAIQKLRSKIHKKVDRKASKGRKLRFHVLSKLLSFMAPIDHTTMNDD AS >gi568815581f:36838198_37043128|GENSCAN_predicted_CDS_6|1626_bp atggcggggccgcagcccctggcgctgcaactggaacagttgttgaacccgcgaccaagc gaggcggaccctgaagcggaccccgaggaagccactgctgccagggtgattgacaggttt gatgaaggggaagatggggaaggtgatttcctagtagtgggtagcattagaaaactggca tcagcctccctcttggacacggacaaaaggtattgcggcaaaaccacctctagaaaagca tggaatgaagaccattgggagcagactctgccaggatcgtctgatgaggaaatatctgat gaggaagggtctggagatgaagattcagagggactgggtctggaggaatatgatgaggac gacctgggtgctgctgaggaacaggagtgtggtgatcacagggagagcaagaagagcaga agccactctgcaaaaacaccgggcttcagtgtccagagtatcagtgactttgagaaattt accaagggaatggatgaccttgggagcagtgaggaggaggaagacgaagagagtggcatg gaagaaggggatgacgcggaagactcccaaggcgagagtgaggaagacagggctggagat agaaacagtgaggatgatggtgtggtgatgaccttctctagtgtcaaagtttctgaggaa gtggagaaaggaagagccgtgaagaaccagatagcactgtgggaccagctcttggaagga aggatcaaactacaaaaagctctgttgaccaccaaccagcttcctcaaccagatgttttc ccattgttcaaggacaaaggtggcccagaattttccagtgccctgaaaaatagtcacaag gcacttaaagcattgttgaggtcattggtaggtcttcaggaagagttgcttttccagtac ccagacactagatatctagtagatgggacaaagcccaatgcgggaagtgaggagatttct agtgaagatgatgagctggtagaagagaagaagcagcaacgaagaagggtccctgcaaag aggaagctggagatggaggactatcccagcttcatggcaaagcgctttgccgactttaca gtctacaggaaccgcacacttcagaaatggcacgataagaccaaactggcttctggaaaa ctggggaaggcaagtggttttggtgcctttgaacgctcaatcttgactcagatcgaccat attctgatggacaaagagagattacttcgaaggacacagaccaagcgctctgtctatcga gttcttggcaaacctgagccagcagctcagcctgtcccagagagtttgccaggggaaccg gagatccttcctcaagcccctgctaatgctcatctgaaggacttggatgaagaaatcttt gatgatgatgacttttaccaccagctccttcgagaactcatagaacggaagaccagctcc ttggatcccaacgatcaggtggccatgggaaggcagtggcttgcaatccagaagttacga agcaaaatccacaaaaaagtagataggaaagccagcaaaggcaggaaacttcggtttcat gtccttagcaagctactgagtttcatggcacctattgaccatactacaatgaatgatgat gccagn