GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:51:37 Sequence gi568815577f:33141923_33363497 : 221575 bp : 43.83% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1965 2146 182 0 2 96 3 99 0.252 1.71 1.02 Term + 7543 7691 149 2 2 88 43 127 0.545 6.26 1.03 PlyA + 7793 7798 6 1.05 2.00 Prom + 11125 11164 40 -6.26 2.01 Init + 16905 16976 72 2 0 68 74 71 0.461 4.77 2.02 Intr + 17534 17636 103 1 1 -3 99 75 0.268 -0.75 2.03 Term + 23412 23485 74 1 2 108 54 85 0.794 5.17 2.04 PlyA + 23781 23786 6 1.05 3.04 PlyA - 23915 23910 6 1.05 3.03 Term - 49765 49204 562 0 1 -71 32 545 0.675 26.85 3.02 Intr - 53501 53389 113 2 2 77 109 96 0.947 9.78 3.01 Init - 55290 55132 159 0 0 62 38 102 0.306 2.42 3.00 Prom - 86674 86635 40 -3.56 4.00 Prom + 87393 87432 40 -2.86 4.01 Init + 100001 100055 55 1 1 76 106 3 0.895 2.42 4.02 Intr + 101751 101792 42 1 0 110 99 22 0.909 3.71 4.03 Intr + 103029 103152 124 1 1 110 73 -17 0.922 -1.36 4.04 Intr + 104796 104968 173 0 2 58 79 132 0.980 8.89 4.05 Intr + 106787 106932 146 0 2 82 115 130 0.996 15.10 4.06 Intr + 110740 110908 169 0 1 95 91 64 0.995 6.92 4.07 Intr + 112753 112985 233 2 2 37 55 148 0.923 3.89 4.08 Intr + 115264 115374 111 0 0 38 93 78 0.641 3.88 4.09 Term + 120871 121578 708 0 0 116 41 331 0.541 24.71 4.10 PlyA + 122258 122263 6 1.05 5.00 Prom + 123114 123153 40 -10.35 5.01 Init + 124544 124592 49 1 1 75 94 67 0.638 5.14 5.02 Intr + 126472 126595 124 2 1 56 98 35 0.537 0.94 5.03 Intr + 134680 134831 152 1 2 48 72 78 0.365 2.01 5.04 Intr + 137830 137996 167 2 2 64 116 99 0.951 9.98 5.05 Intr + 141172 141319 148 0 1 65 115 56 0.775 5.81 5.06 Intr + 146182 146339 158 2 2 46 92 119 0.053 7.83 5.07 Intr + 154262 154385 124 1 1 123 21 99 0.964 6.86 5.08 Intr + 156769 157012 244 2 1 25 80 169 0.531 6.36 5.09 Intr + 158022 158044 23 0 2 121 62 -26 0.262 -4.71 5.10 Intr + 160160 160275 116 0 2 68 72 74 0.629 3.97 5.11 Term + 166263 166400 138 2 0 132 41 48 0.823 2.46 5.12 PlyA + 166948 166953 6 1.05 6.00 Prom + 169337 169376 40 -5.86 6.01 Init + 183134 183209 76 1 1 49 94 190 0.501 14.95 6.02 Intr + 183300 183526 227 1 2 -12 30 182 0.671 0.10 6.03 Intr + 192796 193011 216 0 0 61 33 153 0.348 5.80 6.04 Intr + 196074 196159 86 2 2 24 102 56 0.368 -0.78 6.05 Intr + 199077 199252 176 0 2 85 116 29 0.761 4.98 6.06 Intr + 201346 201500 155 2 2 81 74 42 0.417 1.89 6.07 Term + 213394 213627 234 0 0 54 47 176 0.693 6.42 6.08 PlyA + 217871 217876 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:33141923_33363497|GENSCAN_predicted_peptide_1|110_aa XAGLVLADLDLAPLILVSLILLDPLADWICSYHGIGRSTRERARPHKYFPRSLTELGPLI ARLICTLTASNCHHTAGSPQQLENRSRAARAGSTTVQGAEAQGASGTVTI >gi568815577f:33141923_33363497|GENSCAN_predicted_CDS_1|333_bp nnggctggcttggtcttggctgatctggacttggcgcctctgatcctggtgtctctcatc ctcctggacccactggctgactggatatgttcctatcatggcattggcagaagcacaagg gagagagcccgaccccacaagtactttccaaggtccttgactgagttgggtccactaata gcccgcctcatctgcacactaactgcttctaattgtcatcatacggcaggatccccgcaa caacttgagaacagatcccgagcagcccgtgcaggctccacgacggtccagggagctgag gctcagggagcatcgggcactgtgacgatttag >gi568815577f:33141923_33363497|GENSCAN_predicted_peptide_2|82_aa MRSSRGGVLLTAPGESSAKNQHNTKHKTTTEYTGLSERTRKPLPGRKNGSGVEELLQAGF IPELELTFPEGFAFQAFQVIAN >gi568815577f:33141923_33363497|GENSCAN_predicted_CDS_2|249_bp atgaggagttcccgtggaggtgtgctgctgacagccccaggtgagagctcagccaagaat cagcacaacactaagcacaaaacaaccactgaatacactggattaagtgaacgtactaga aaaccactacctggtaggaagaatggctctggtgtggaggagttactacaagcagggttc atcccagaactggagctcacattcccagaaggctttgcttttcaggcttttcaggttatc gccaattga >gi568815577f:33141923_33363497|GENSCAN_predicted_peptide_3|277_aa MSIRIDLNPAADSQVCDRIPEDCGGRVSVDQGKSLLPEKHLHGQLSYGYYDLKDSLCYIG ITWERGRATESWAPPWTYLIVILFIGVPFASNQPTNEPANQPSNQPTKQSTNQPTNQPTN QPTNQRTNEPSKQQSKQATNQPTKQPTNQPTKQPNNQPTNQPSNQATNQATNQPTNQATN QATNQASNEASNQATNEASNQATNEASNQASKQPTNQATNQPTSQASKQASKQASKQATN QPTKQPSKQPSNQPTKQPSNQPSNQRTNQPSNQSTNK >gi568815577f:33141923_33363497|GENSCAN_predicted_CDS_3|834_bp atgtccatacgcattgatcttaacccagcggcagactcacaagtgtgtgaccgcatccca gaggactgcgggggaagggtttctgtggatcagggcaaatcgctcctgccggagaagcac cttcacggtcagctctcttatggttattatgacctgaaggactctctttgctacatcggc atcacctgggaacgtggtagggctacagaatcttgggccccaccttggacctacctaatc gtcatcttattcattggtgttcccttcgccagcaaccaaccaaccaatgaaccagccaac caaccaagcaaccaaccaaccaagcaatcaacgaaccaaccaaccaaccaaccaaccaac caaccaaccaaccaacgaaccaacgaaccaagcaagcaacaaagcaagcaagcaaccaac caaccaaccaagcaaccaaccaaccaaccaaccaagcaaccaaacaaccaaccaaccaac caaccaagcaaccaagcaaccaatcaagcaaccaaccaaccaaccaaccaagcaaccaac caagcaaccaaccaagcaagcaacgaagcaagcaaccaagcaaccaatgaagcaagcaac caagcaaccaatgaagcaagcaaccaagcaagcaagcaaccaaccaaccaagcaaccaac caaccaaccagccaagcaagcaagcaagcaagcaagcaagcaagcaagcaagcaaccaac caaccaaccaagcaaccaagcaaacaaccaagcaaccagccaaccaagcaaccaagcaac caaccaagcaaccaacgaaccaaccaaccaagcaaccagtcaaccaataagtaa >gi568815577f:33141923_33363497|GENSCAN_predicted_peptide_4|586_aa MLLSQNAFIFRSLNLVLMVYISLVFGISYDSPDYTDESCTFKISLRNFRSILSWELKNHS IVPTHYTLLYTIMSKPEDLKVVKNCANTTRSFCDLTDEWRSTHEAYVTVLEGFSGNTTLF SCSHNFWLAIDMSFEPPEFEIVGFTNHINVMVKFPSIVEEELQFDLSLVIEEQSEGIVKK HKPEIKGNMSGNFTYIIDKLIPNTNYCVSVYLEHSDEQAVIKSPLKCTLLPPGQESVFNQ KIFESTYDLEAPPLRIVPPFQIEPVYILHALIDALCLPKMYKTKLHPDQGFSGSPGAVSW AIGHSYLTQTESLQMRQPPFGQGQFLMCPELVPANGFVVSLTSRMEPQTFLNFHNFLAWP FPNLPPLEAMDMVEVIYINRKKKVWDYNYDDESDSDTEAAPRTSGGGYTMHGLTVRPLGQ ASATSTESQLIDPESEEEPDLPEVDVELPTMPKDSPQQLELLSGPCERRKSPLQDPFPEE DYSSTEGSGGRITFNVDLNSVFLRVLDDEDSDDLEAPLMLSSHLEEMVDPEDPDNVQSNH LLASGEGTQPTFPSPSSEGLWSEDAPSDQSDTSESDVDLGDGYIMR >gi568815577f:33141923_33363497|GENSCAN_predicted_CDS_4|1761_bp atgcttttgagccagaatgccttcatcttcagatcacttaatttggttctcatggtgtat atcagcctcgtgtttggtatttcatatgattcgcctgattacacagatgaatcttgcact ttcaagatatcattgcgaaatttccggtccatcttatcatgggaattaaaaaaccactcc attgtaccaactcactatacattgctgtatacaatcatgagtaaaccagaagatttgaag gtggttaagaactgtgcaaataccacaagatcattttgtgacctcacagatgagtggaga agcacacacgaggcctatgtcaccgtcctagaaggattcagcgggaacacaacgttgttc agttgctcacacaatttctggctggccatagacatgtcttttgaaccaccagagtttgag attgttggttttaccaaccacattaatgtgatggtgaaatttccatctattgttgaggaa gaattacagtttgatttatctctcgtcattgaagaacagtcagagggaattgttaagaag cataaacccgaaataaaaggaaacatgagtggaaatttcacctatatcattgacaagtta attccaaacacgaactactgtgtatctgtttatttagagcacagtgatgagcaagcagta ataaagtctcccttaaaatgcaccctccttccacctggccaggaatcagtttttaatcag aaaatctttgaatctacctatgacctggaagcccccccacttcgaattgtcccacctttc cagattgaaccagtgtacatcttacatgcattgattgatgccttatgtctccctaaaatg tataaaaccaagctgcaccctgaccaagggttctcgggatctcctggggctgtgtcatgg gccattggtcactcatatttgactcagactgaatctcttcaaatgaggcagccccctttt gggcaagggcagttccttatgtgtccggagttggttcctgccaatgggttcgtggtctcg ctgacttcaagaatggagccgcagaccttcctgaattttcataactttttagcctggcca tttcctaacctgccaccgttggaagccatggatatggtggaggtcatttacatcaacaga aagaagaaagtgtgggattataattatgatgatgaaagtgatagcgatactgaggcagcg cccaggacaagtggcggtggctataccatgcatggactgactgtcaggcctctgggtcag gcctctgccacctctacagaatcccagttgatagacccggagtccgaggaggagcctgac ctgcctgaggttgatgtggagctccccacgatgccaaaggacagccctcagcagttggaa ctcttgagtgggccctgtgagaggagaaagagtccactccaggacccttttcccgaagag gactacagctccacggaggggtctgggggcagaattaccttcaatgtggacttaaactct gtgtttttgagagttcttgatgacgaggacagtgacgacttagaagcccctctgatgcta tcgtctcatctggaagagatggttgacccagaggatcctgataatgtgcaatcaaaccat ttgctggccagcggggaagggacacagccaacctttcccagcccctcttcagagggcctg tggtccgaagatgctccatctgatcaaagtgacacttctgagtcagatgttgaccttggg gatggttatataatgagatga >gi568815577f:33141923_33363497|GENSCAN_predicted_peptide_5|480_aa MAWSLGSWLGGCLLVSALGMVPPPENVRMNSVNFKNILQWESPAFAKGNLTFTAQYLRIF QDKCMNTTLTECDFSSLSKYGDHTLRVRAEFADEHSDWVNITFCPVDDTIIGPPGMQVEV LADSLHMRFLAPKIENEYETWTMKNVYNSWTYNVQYWKNGTDEKFQITPQYDFEVLRNLE PWTTYCVQVRGFLPDRNKAGEWSEPVCEQTTHDETVPSWMVAVILMASVFMVCLALLGCF ALLWCVYKKTKYAFSPRNSLPQHLKEFLGHPHHNTLLFFSFPLSDENDVFDKLSVIAEDS ESGKQNPEKRSYYPGHSTEKQKNGKYEWLGYKPEKSSCKRRLASWKLAQVNAAAVPIEKG YLEAKYIQYGGSSSLLFVVMCAGVMVLARKISFLRGSHLHEASCHIGEVTWLGPEDASSS QQRTGNRGSQSNSNRLSGINFSTSQKHFGPQIPKGTLFFILENTMPTYQNKGQIGAFASK >gi568815577f:33141923_33363497|GENSCAN_predicted_CDS_5|1443_bp atggcgtggagccttgggagctggctgggtggctgcctgctggtgtcagcattgggaatg gtaccacctcccgaaaatgtcagaatgaattctgttaatttcaagaacattctacagtgg gagtcacctgcttttgccaaagggaacctgactttcacagctcagtacctaaggatattc caagataaatgcatgaatactaccttgacggaatgtgatttctcaagtctttccaagtat ggtgaccacaccttgagagtcagggctgaatttgcagatgagcattcagactgggtaaac atcaccttctgtcctgtggatgacaccattattggaccccctggaatgcaagtagaagta cttgctgattctttacatatgcgtttcttagcccctaaaattgagaatgaatacgaaact tggactatgaagaatgtgtataactcatggacttataatgtgcaatactggaaaaacggt actgatgaaaagtttcaaattactccccagtatgactttgaggtcctcagaaacctggag ccatggacaacttattgtgttcaagttcgagggtttcttcctgatcggaacaaagctggg gaatggagtgagcctgtctgtgagcaaacaacccatgacgaaacggtcccctcctggatg gtggccgtcatcctcatggcctcggtcttcatggtctgcctggcactcctcggctgcttc gccttgctgtggtgcgtttacaagaagacaaagtacgccttctcccctaggaattctctt ccacagcacctgaaagagtttttgggccatcctcatcataacacacttctgtttttctcc tttccattgtcggatgagaatgatgtttttgacaagctaagtgtcattgcagaagactct gagagcggcaagcagaatcctgaaaaaagaagttattatccagggcacagtacagagaaa caaaagaatggaaaatatgagtggttaggatacaagcctgaaaaatcaagctgcaagcgt agattagcaagctggaagcttgcacaggtgaatgcggcagctgtgccaatagaaaaggga tacctggaagccaagtacatccaatatggaggttcctcctcccttctctttgtcgtcatg tgtgcaggtgtcatggtgctggccaggaagatttctttcctgaggggttcacacttgcat gaagcaagctgccatattggagaagtcacatggctaggacctgaggatgcctcctccagc caacaacgaacagggaatcgaggttctcaatccaacagcaacaggctctctggcatcaat ttttccacttctcagaaacactttggcccacaaattcccaaaggaaccttgttcttcatt ctggaaaacacgatgccaacatatcagaataaaggacagataggagcttttgcttcaaag tga >gi568815577f:33141923_33363497|GENSCAN_predicted_peptide_6|389_aa MMVVLLGATTLVLVAVAPWVLSAAADAQSGKPSVHFAAPKIKPDLGSQINQEKVVFWVLS CRLPVAVYGSSGAPGSHPREMAVPELCVEFDSFRESTAAPLCQVMRRVIQVCEGQLDVQT EGTGAISGYPTTQFMTQVVIQGDITSYDAVNTEGKAAEIHYTFPPLQWEMGQGKLFAKYA SNKSLISKELKQSTTKPKSIKRTGMDNWIKLSGCQNITSTKCNFSSLKLNVYEEIKLRIR AEKENTSSWYEVDSFTPFRKAQIGPPEVHLEAEDKAIVIHISPGTKDSVMWALDGLSFTY SLVIWKNSSGVEYFSEQPLKNLLLSTSEEQIEKCFIIENISTIATVEETNQTDEDHKKYS SQTSQDSGNYSNEDESESKTSEELQQDFV >gi568815577f:33141923_33363497|GENSCAN_predicted_CDS_6|1170_bp atgatggtcgtcctcctgggcgcgacgaccctagtgctcgtcgccgtggcgccatgggtg ttgtccgcagccgcagacgcccagtctgggaaaccttcggtccactttgccgcgccaaag attaaacccgacctgggctcgcaaatcaaccaggagaaagtggtgttctgggtcctctct tgccgcttgcctgtggccgtgtacgggtcctcgggagcgcccgggtcccacccccgtgaa atggcggtgccagagctttgtgtcgagtttgattctttccgggaaagtaccgcggctccg ctgtgtcaagtgatgcgcagggtgatccaggtgtgtgaggggcagctggatgtccagact gagggcactggtgccatcagtggctatcctaccactcaattcatgacccaggtggtgatc cagggtgatatcaccagttatgatgcagttaacacagaggggaaagctgctgagatacac tatactttcccacccctgcagtgggagatggggcaggggaaactatttgcaaaatatgca tccaacaagagcttaatatccaaggaactcaaacaatcaacaacaaaacccaaatccatc aaaagaactgggatggataattggataaaattgtctgggtgtcagaatattactagtacc aaatgcaacttttcttcactcaagctgaatgtttatgaagaaattaaattgcgtataaga gcagaaaaagaaaacacttcttcatggtatgaggttgactcatttacaccatttcgcaaa gctcagattggtcctccagaagtacatttagaagctgaagataaggcaatagtgatacac atctctcctggaacaaaagatagtgttatgtgggctttggatggtttaagctttacatat agcttagttatctggaaaaactcttcaggtgtagaatatttctctgaacagccattgaag aatcttctgctttcaacttctgaggaacaaatcgaaaaatgtttcataattgaaaatata agcacaattgctacagtagaagaaactaatcaaactgatgaagatcataaaaaatacagt tcccaaactagccaagattcaggaaattattctaatgaagatgaaagcgaaagtaaaaca agtgaagaactacagcaggactttgtatga