GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:46:50 Sequence gi568815577f:33166466_33396354 : 229889 bp : 43.99% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1043 1038 6 1.05 1.03 Term - 25222 24661 562 0 1 -71 32 545 0.675 26.85 1.02 Intr - 28958 28846 113 2 2 77 109 96 0.946 9.78 1.01 Init - 30747 30589 159 0 0 62 38 102 0.306 2.42 1.00 Prom - 62131 62092 40 -3.56 2.00 Prom + 62850 62889 40 -2.86 2.01 Init + 75458 75512 55 1 1 76 106 3 0.895 2.42 2.02 Intr + 77208 77249 42 1 0 110 99 22 0.909 3.71 2.03 Intr + 78486 78609 124 1 1 110 73 -17 0.922 -1.36 2.04 Intr + 80253 80425 173 0 2 58 79 132 0.980 8.89 2.05 Intr + 82244 82389 146 0 2 82 115 130 0.996 15.10 2.06 Intr + 86197 86365 169 0 1 95 91 64 0.995 6.92 2.07 Intr + 88210 88442 233 2 2 37 55 148 0.923 3.89 2.08 Intr + 90721 90831 111 0 0 38 93 78 0.641 3.88 2.09 Term + 96328 97035 708 0 0 116 41 331 0.541 24.71 2.10 PlyA + 97715 97720 6 1.05 3.00 Prom + 98571 98610 40 -10.35 3.01 Init + 100001 100049 49 1 1 75 94 67 0.638 5.14 3.02 Intr + 101929 102052 124 2 1 56 98 35 0.537 0.94 3.03 Intr + 110137 110288 152 1 2 48 72 78 0.365 2.01 3.04 Intr + 113287 113453 167 2 2 64 116 99 0.951 9.98 3.05 Intr + 116629 116776 148 0 1 65 115 56 0.775 5.81 3.06 Intr + 121639 121796 158 2 2 46 92 119 0.053 7.83 3.07 Intr + 129719 129842 124 1 1 123 21 99 0.964 6.86 3.08 Intr + 132226 132469 244 2 1 25 80 169 0.531 6.36 3.09 Intr + 133479 133501 23 0 2 121 62 -26 0.262 -4.71 3.10 Intr + 135617 135732 116 0 2 68 72 74 0.629 3.97 3.11 Term + 141720 141857 138 2 0 132 41 48 0.823 2.46 3.12 PlyA + 142405 142410 6 1.05 4.00 Prom + 144794 144833 40 -5.86 4.01 Init + 158591 158666 76 1 1 49 94 190 0.501 14.95 4.02 Intr + 158757 158983 227 1 2 -12 30 182 0.671 0.10 4.03 Intr + 168253 168468 216 0 0 61 33 153 0.348 5.80 4.04 Intr + 171531 171616 86 2 2 24 102 56 0.368 -0.78 4.05 Intr + 174534 174709 176 0 2 85 116 29 0.761 4.98 4.06 Intr + 176803 176957 155 2 2 81 74 42 0.417 1.89 4.07 Term + 188851 189084 234 0 0 54 47 176 0.684 6.42 4.08 PlyA + 193328 193333 6 1.05 5.03 PlyA - 194403 194398 6 1.05 5.02 Term - 202404 202257 148 2 1 68 53 134 0.604 5.27 5.01 Intr - 216920 216707 214 0 1 70 69 130 0.248 7.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:33166466_33396354|GENSCAN_predicted_peptide_1|277_aa MSIRIDLNPAADSQVCDRIPEDCGGRVSVDQGKSLLPEKHLHGQLSYGYYDLKDSLCYIG ITWERGRATESWAPPWTYLIVILFIGVPFASNQPTNEPANQPSNQPTKQSTNQPTNQPTN QPTNQRTNEPSKQQSKQATNQPTKQPTNQPTKQPNNQPTNQPSNQATNQATNQPTNQATN QATNQASNEASNQATNEASNQATNEASNQASKQPTNQATNQPTSQASKQASKQASKQATN QPTKQPSKQPSNQPTKQPSNQPSNQRTNQPSNQSTNK >gi568815577f:33166466_33396354|GENSCAN_predicted_CDS_1|834_bp atgtccatacgcattgatcttaacccagcggcagactcacaagtgtgtgaccgcatccca gaggactgcgggggaagggtttctgtggatcagggcaaatcgctcctgccggagaagcac cttcacggtcagctctcttatggttattatgacctgaaggactctctttgctacatcggc atcacctgggaacgtggtagggctacagaatcttgggccccaccttggacctacctaatc gtcatcttattcattggtgttcccttcgccagcaaccaaccaaccaatgaaccagccaac caaccaagcaaccaaccaaccaagcaatcaacgaaccaaccaaccaaccaaccaaccaac caaccaaccaaccaacgaaccaacgaaccaagcaagcaacaaagcaagcaagcaaccaac caaccaaccaagcaaccaaccaaccaaccaaccaagcaaccaaacaaccaaccaaccaac caaccaagcaaccaagcaaccaatcaagcaaccaaccaaccaaccaaccaagcaaccaac caagcaaccaaccaagcaagcaacgaagcaagcaaccaagcaaccaatgaagcaagcaac caagcaaccaatgaagcaagcaaccaagcaagcaagcaaccaaccaaccaagcaaccaac caaccaaccagccaagcaagcaagcaagcaagcaagcaagcaagcaagcaagcaaccaac caaccaaccaagcaaccaagcaaacaaccaagcaaccagccaaccaagcaaccaagcaac caaccaagcaaccaacgaaccaaccaaccaagcaaccagtcaaccaataagtaa >gi568815577f:33166466_33396354|GENSCAN_predicted_peptide_2|586_aa MLLSQNAFIFRSLNLVLMVYISLVFGISYDSPDYTDESCTFKISLRNFRSILSWELKNHS IVPTHYTLLYTIMSKPEDLKVVKNCANTTRSFCDLTDEWRSTHEAYVTVLEGFSGNTTLF SCSHNFWLAIDMSFEPPEFEIVGFTNHINVMVKFPSIVEEELQFDLSLVIEEQSEGIVKK HKPEIKGNMSGNFTYIIDKLIPNTNYCVSVYLEHSDEQAVIKSPLKCTLLPPGQESVFNQ KIFESTYDLEAPPLRIVPPFQIEPVYILHALIDALCLPKMYKTKLHPDQGFSGSPGAVSW AIGHSYLTQTESLQMRQPPFGQGQFLMCPELVPANGFVVSLTSRMEPQTFLNFHNFLAWP FPNLPPLEAMDMVEVIYINRKKKVWDYNYDDESDSDTEAAPRTSGGGYTMHGLTVRPLGQ ASATSTESQLIDPESEEEPDLPEVDVELPTMPKDSPQQLELLSGPCERRKSPLQDPFPEE DYSSTEGSGGRITFNVDLNSVFLRVLDDEDSDDLEAPLMLSSHLEEMVDPEDPDNVQSNH LLASGEGTQPTFPSPSSEGLWSEDAPSDQSDTSESDVDLGDGYIMR >gi568815577f:33166466_33396354|GENSCAN_predicted_CDS_2|1761_bp atgcttttgagccagaatgccttcatcttcagatcacttaatttggttctcatggtgtat atcagcctcgtgtttggtatttcatatgattcgcctgattacacagatgaatcttgcact ttcaagatatcattgcgaaatttccggtccatcttatcatgggaattaaaaaaccactcc attgtaccaactcactatacattgctgtatacaatcatgagtaaaccagaagatttgaag gtggttaagaactgtgcaaataccacaagatcattttgtgacctcacagatgagtggaga agcacacacgaggcctatgtcaccgtcctagaaggattcagcgggaacacaacgttgttc agttgctcacacaatttctggctggccatagacatgtcttttgaaccaccagagtttgag attgttggttttaccaaccacattaatgtgatggtgaaatttccatctattgttgaggaa gaattacagtttgatttatctctcgtcattgaagaacagtcagagggaattgttaagaag cataaacccgaaataaaaggaaacatgagtggaaatttcacctatatcattgacaagtta attccaaacacgaactactgtgtatctgtttatttagagcacagtgatgagcaagcagta ataaagtctcccttaaaatgcaccctccttccacctggccaggaatcagtttttaatcag aaaatctttgaatctacctatgacctggaagcccccccacttcgaattgtcccacctttc cagattgaaccagtgtacatcttacatgcattgattgatgccttatgtctccctaaaatg tataaaaccaagctgcaccctgaccaagggttctcgggatctcctggggctgtgtcatgg gccattggtcactcatatttgactcagactgaatctcttcaaatgaggcagccccctttt gggcaagggcagttccttatgtgtccggagttggttcctgccaatgggttcgtggtctcg ctgacttcaagaatggagccgcagaccttcctgaattttcataactttttagcctggcca tttcctaacctgccaccgttggaagccatggatatggtggaggtcatttacatcaacaga aagaagaaagtgtgggattataattatgatgatgaaagtgatagcgatactgaggcagcg cccaggacaagtggcggtggctataccatgcatggactgactgtcaggcctctgggtcag gcctctgccacctctacagaatcccagttgatagacccggagtccgaggaggagcctgac ctgcctgaggttgatgtggagctccccacgatgccaaaggacagccctcagcagttggaa ctcttgagtgggccctgtgagaggagaaagagtccactccaggacccttttcccgaagag gactacagctccacggaggggtctgggggcagaattaccttcaatgtggacttaaactct gtgtttttgagagttcttgatgacgaggacagtgacgacttagaagcccctctgatgcta tcgtctcatctggaagagatggttgacccagaggatcctgataatgtgcaatcaaaccat ttgctggccagcggggaagggacacagccaacctttcccagcccctcttcagagggcctg tggtccgaagatgctccatctgatcaaagtgacacttctgagtcagatgttgaccttggg gatggttatataatgagatga >gi568815577f:33166466_33396354|GENSCAN_predicted_peptide_3|480_aa MAWSLGSWLGGCLLVSALGMVPPPENVRMNSVNFKNILQWESPAFAKGNLTFTAQYLRIF QDKCMNTTLTECDFSSLSKYGDHTLRVRAEFADEHSDWVNITFCPVDDTIIGPPGMQVEV LADSLHMRFLAPKIENEYETWTMKNVYNSWTYNVQYWKNGTDEKFQITPQYDFEVLRNLE PWTTYCVQVRGFLPDRNKAGEWSEPVCEQTTHDETVPSWMVAVILMASVFMVCLALLGCF ALLWCVYKKTKYAFSPRNSLPQHLKEFLGHPHHNTLLFFSFPLSDENDVFDKLSVIAEDS ESGKQNPEKRSYYPGHSTEKQKNGKYEWLGYKPEKSSCKRRLASWKLAQVNAAAVPIEKG YLEAKYIQYGGSSSLLFVVMCAGVMVLARKISFLRGSHLHEASCHIGEVTWLGPEDASSS QQRTGNRGSQSNSNRLSGINFSTSQKHFGPQIPKGTLFFILENTMPTYQNKGQIGAFASK >gi568815577f:33166466_33396354|GENSCAN_predicted_CDS_3|1443_bp atggcgtggagccttgggagctggctgggtggctgcctgctggtgtcagcattgggaatg gtaccacctcccgaaaatgtcagaatgaattctgttaatttcaagaacattctacagtgg gagtcacctgcttttgccaaagggaacctgactttcacagctcagtacctaaggatattc caagataaatgcatgaatactaccttgacggaatgtgatttctcaagtctttccaagtat ggtgaccacaccttgagagtcagggctgaatttgcagatgagcattcagactgggtaaac atcaccttctgtcctgtggatgacaccattattggaccccctggaatgcaagtagaagta cttgctgattctttacatatgcgtttcttagcccctaaaattgagaatgaatacgaaact tggactatgaagaatgtgtataactcatggacttataatgtgcaatactggaaaaacggt actgatgaaaagtttcaaattactccccagtatgactttgaggtcctcagaaacctggag ccatggacaacttattgtgttcaagttcgagggtttcttcctgatcggaacaaagctggg gaatggagtgagcctgtctgtgagcaaacaacccatgacgaaacggtcccctcctggatg gtggccgtcatcctcatggcctcggtcttcatggtctgcctggcactcctcggctgcttc gccttgctgtggtgcgtttacaagaagacaaagtacgccttctcccctaggaattctctt ccacagcacctgaaagagtttttgggccatcctcatcataacacacttctgtttttctcc tttccattgtcggatgagaatgatgtttttgacaagctaagtgtcattgcagaagactct gagagcggcaagcagaatcctgaaaaaagaagttattatccagggcacagtacagagaaa caaaagaatggaaaatatgagtggttaggatacaagcctgaaaaatcaagctgcaagcgt agattagcaagctggaagcttgcacaggtgaatgcggcagctgtgccaatagaaaaggga tacctggaagccaagtacatccaatatggaggttcctcctcccttctctttgtcgtcatg tgtgcaggtgtcatggtgctggccaggaagatttctttcctgaggggttcacacttgcat gaagcaagctgccatattggagaagtcacatggctaggacctgaggatgcctcctccagc caacaacgaacagggaatcgaggttctcaatccaacagcaacaggctctctggcatcaat ttttccacttctcagaaacactttggcccacaaattcccaaaggaaccttgttcttcatt ctggaaaacacgatgccaacatatcagaataaaggacagataggagcttttgcttcaaag tga >gi568815577f:33166466_33396354|GENSCAN_predicted_peptide_4|389_aa MMVVLLGATTLVLVAVAPWVLSAAADAQSGKPSVHFAAPKIKPDLGSQINQEKVVFWVLS CRLPVAVYGSSGAPGSHPREMAVPELCVEFDSFRESTAAPLCQVMRRVIQVCEGQLDVQT EGTGAISGYPTTQFMTQVVIQGDITSYDAVNTEGKAAEIHYTFPPLQWEMGQGKLFAKYA SNKSLISKELKQSTTKPKSIKRTGMDNWIKLSGCQNITSTKCNFSSLKLNVYEEIKLRIR AEKENTSSWYEVDSFTPFRKAQIGPPEVHLEAEDKAIVIHISPGTKDSVMWALDGLSFTY SLVIWKNSSGVEYFSEQPLKNLLLSTSEEQIEKCFIIENISTIATVEETNQTDEDHKKYS SQTSQDSGNYSNEDESESKTSEELQQDFV >gi568815577f:33166466_33396354|GENSCAN_predicted_CDS_4|1170_bp atgatggtcgtcctcctgggcgcgacgaccctagtgctcgtcgccgtggcgccatgggtg ttgtccgcagccgcagacgcccagtctgggaaaccttcggtccactttgccgcgccaaag attaaacccgacctgggctcgcaaatcaaccaggagaaagtggtgttctgggtcctctct tgccgcttgcctgtggccgtgtacgggtcctcgggagcgcccgggtcccacccccgtgaa atggcggtgccagagctttgtgtcgagtttgattctttccgggaaagtaccgcggctccg ctgtgtcaagtgatgcgcagggtgatccaggtgtgtgaggggcagctggatgtccagact gagggcactggtgccatcagtggctatcctaccactcaattcatgacccaggtggtgatc cagggtgatatcaccagttatgatgcagttaacacagaggggaaagctgctgagatacac tatactttcccacccctgcagtgggagatggggcaggggaaactatttgcaaaatatgca tccaacaagagcttaatatccaaggaactcaaacaatcaacaacaaaacccaaatccatc aaaagaactgggatggataattggataaaattgtctgggtgtcagaatattactagtacc aaatgcaacttttcttcactcaagctgaatgtttatgaagaaattaaattgcgtataaga gcagaaaaagaaaacacttcttcatggtatgaggttgactcatttacaccatttcgcaaa gctcagattggtcctccagaagtacatttagaagctgaagataaggcaatagtgatacac atctctcctggaacaaaagatagtgttatgtgggctttggatggtttaagctttacatat agcttagttatctggaaaaactcttcaggtgtagaatatttctctgaacagccattgaag aatcttctgctttcaacttctgaggaacaaatcgaaaaatgtttcataattgaaaatata agcacaattgctacagtagaagaaactaatcaaactgatgaagatcataaaaaatacagt tcccaaactagccaagattcaggaaattattctaatgaagatgaaagcgaaagtaaaaca agtgaagaactacagcaggactttgtatga >gi568815577f:33166466_33396354|GENSCAN_predicted_peptide_5|120_aa XWVTWRFLWLTASDFSINFVAKAPCTVLCMVRTVWFVSDSGAFPDAGLSVLKLGKPRVNQ DESAALPRALDKPAAGDILTTQPHLHYGLYPVPKLLSKKNEKNEDMLTIRRRRMGREEFC >gi568815577f:33166466_33396354|GENSCAN_predicted_CDS_5|363_bp ntctgggtcacgtggcgcttcctgtggctcactgcctctgacttttcaatcaactttgtg gctaaagcaccatgcactgtgctttgcatggtccggactgtctggtttgtcagcgactca ggggcttttccagatgcgggactttcagtgctgaaactgggaaagcccagggtaaaccag gatgagtcagcggccctgcctcgtgctctggataagcctgctgctggagacatcctgacc actcagccccacctgcattatggcttgtacccagttcccaagctcttgtccaagaagaat gagaagaatgaggatatgctgacaattcgaaggcggaggatgggcagagaagaattttgt tga