GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:28:30 Sequence gi568815575f:21743168_21982652 : 239485 bp : 40.44% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9979 10112 134 0 2 77 103 49 0.855 4.96 1.02 Intr + 11260 11354 95 1 2 80 81 37 0.665 0.89 1.03 Intr + 13484 13622 139 0 1 79 63 61 0.341 1.30 1.04 Intr + 19551 19651 101 0 2 76 97 76 0.301 6.13 1.05 Intr + 28076 28254 179 0 2 52 74 145 0.466 8.22 1.06 Intr + 42424 42591 168 0 0 -5 95 144 0.145 5.02 1.07 Intr + 48241 48274 34 0 1 97 41 35 0.080 -3.42 1.08 Term + 49201 49364 164 2 2 69 47 130 0.223 4.12 1.09 PlyA + 49891 49896 6 1.05 2.00 Prom + 56484 56523 40 -5.95 2.01 Init + 60319 60439 121 0 1 107 76 47 0.908 5.80 2.02 Term + 63580 63656 77 2 2 -49 37 231 0.717 1.32 2.03 PlyA + 63857 63862 6 1.05 3.00 Prom + 86558 86597 40 -3.45 3.01 Init + 96568 96642 75 0 0 97 100 89 0.235 12.31 3.02 Intr + 100003 100151 149 0 2 70 21 70 0.246 -3.39 3.03 Intr + 102004 102217 214 1 1 106 103 93 0.490 10.40 3.04 Term + 108112 108201 90 0 0 113 46 62 0.397 1.24 3.05 PlyA + 109261 109266 6 1.05 4.00 Prom + 109903 109942 40 -6.95 4.01 Sngl + 113318 114436 1119 1 0 98 48 1335 0.998 126.97 4.02 PlyA + 114491 114496 6 1.05 5.00 Prom + 120506 120545 40 -6.75 5.01 Init + 121254 121302 49 2 1 70 58 48 0.393 -0.84 5.02 Intr + 125300 125418 119 0 2 135 84 21 0.964 5.66 5.03 Intr + 126331 126511 181 0 1 69 85 107 0.974 7.02 5.04 Intr + 129994 130113 120 2 0 97 32 76 0.688 2.45 5.05 Intr + 134875 134969 95 2 2 60 111 23 0.979 0.56 5.06 Intr + 135330 135525 196 2 1 61 109 99 0.994 7.37 5.07 Term + 139286 139488 203 0 2 100 35 114 0.719 3.87 5.08 PlyA + 140082 140087 6 1.05 6.00 Prom + 143481 143520 40 -5.95 6.01 Init + 150233 150374 142 1 1 83 49 109 0.306 6.84 6.02 Term + 154886 155046 161 0 2 108 55 109 0.979 6.72 6.03 PlyA + 155121 155126 6 1.05 7.03 PlyA - 157666 157661 6 1.05 7.02 Term - 161258 161152 107 0 2 100 48 100 0.656 4.79 7.01 Init - 174055 173977 79 1 1 64 68 44 0.188 1.27 7.00 Prom - 178402 178363 40 -5.55 8.00 Prom + 181565 181604 40 -5.55 8.01 Init + 186149 186278 130 1 1 50 98 84 0.802 4.09 8.02 Intr + 186588 186737 150 1 0 28 67 131 0.196 4.21 8.03 Intr + 192727 192862 136 2 1 49 27 85 0.049 -2.79 8.04 Intr + 196871 197153 283 2 1 52 50 211 0.866 10.20 8.05 Intr + 197633 197706 74 1 2 86 100 75 0.698 5.79 8.06 Intr + 204122 204204 83 2 2 12 84 96 0.018 -0.04 8.07 Term + 211221 211357 137 1 2 73 48 113 0.017 3.00 8.08 PlyA + 212100 212105 6 1.05 9.00 Prom + 215298 215337 40 -5.85 9.01 Init + 216652 216794 143 0 2 64 86 142 0.634 11.34 9.02 Intr + 223946 224149 204 2 0 51 99 109 0.242 5.69 9.03 Intr + 224999 225100 102 2 0 1 97 131 0.186 3.67 9.04 Intr + 233894 234069 176 2 2 94 83 181 0.906 16.86 9.05 Intr + 234793 234882 90 2 0 103 34 142 0.602 9.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:21743168_21982652|GENSCAN_predicted_peptide_1|337_aa MQSGILAKFIGPSLVVCSMTCFQRLLYILPLKNPFSLRICISRDSLSFQSYEDTLLPNAS GEGTLEHQDKSFRRRNVTSIRDVECGTHLPSSSECFIPGPRKENYTLDTHSTPTRATAFR PKRTAGDKVCPLGCMLHDDIPPSPADAGTFKKTDVQGGQPAQKQNIKPPKLRALRESIAR QPPPPEQAPLSMAERPIDSSHHRTLCRQCPVPAWSQKHRSNKLGLATGIKSGSSFVRLIP QPVGSDAISSNVRIELEDTKLASATELIVCLLLKEQQNETSVLGCCQRRPHEDFISAHPP PGVCGDYVGNLDFHSHLAVMKHPSRNWSGVRGGLMKS >gi568815575f:21743168_21982652|GENSCAN_predicted_CDS_1|1014_bp atgcagagtggaatccttgccaaattcataggcccttccttggtggtttgttcaatgaca tgtttccagaggttgctctacatcctgccacttaagaaccccttcagtttaaggatctgt attagtcgtgacagtctcagtttccaaagttacgaagacactctccttcctaatgcctcg ggtgagggaacactagagcaccaggataaatcattcaggaggagaaatgtcacctctatc agggatgtagagtgcggaacacaccttccttcttcatctgaatgcttcatcccgggacca agaaaagaaaactacaccttagatacacactccactcccaccagggccactgcttttcgt ccaaagaggacagcaggtgacaaggtttgtcccctgggatgcatgctgcatgatgacatc cctccttccccagcagatgctggcacctttaaaaagactgatgtgcaaggaggccaacca gcacaaaaacagaacattaaaccaccaaagctaagagccctcagggagtccattgcacgc cagccacctccaccagaacaggcaccgttatccatggctgagagacccatagacagttca catcacaggactctgtgcagacaatgcccagtaccagcttggagccagaagcacaggtca aacaaactggggcttgcaactggcatcaaaagtgggagcagttttgtgagactgatccct caacctgtgggatctgatgctatctccagtaatgtcagaattgaattagaggacacaaaa ctggcgtctgctacagaattgattgtttgcttgctgctgaaagaacagcaaaatgaaaca tctgtgttagggtgttgtcagagaaggccacatgaggactttatttctgctcacccacct cctggtgtctgtggagactacgtggggaacctggactttcattcccatttggcagtaatg aagcacccctcgcgcaactggagtggtgtcagaggaggcctgatgaagagttaa >gi568815575f:21743168_21982652|GENSCAN_predicted_peptide_2|65_aa MEKKRNARVGMAIKAWSHKGLIEEEKKKKTAVQLHKLDTQEEEEEEEEEEDEEEDKEEEE KKKMI >gi568815575f:21743168_21982652|GENSCAN_predicted_CDS_2|198_bp atggagaaaaagagaaatgcaagagttgggatggctatcaaagcctggtcacacaagggc ctgattgaggaggaaaaaaaaaaaaaaacagcagttcagctacacaagctagacacacag gaagaagaagaagaagaagaggaggaggaggatgaagaagaggacaaagaggaagaggag aagaagaagatgatttaa >gi568815575f:21743168_21982652|GENSCAN_predicted_peptide_3|175_aa MIPVSLVVVVVGGWTVVYLTDLVLKSSVYFKHSYEDWLENNGLSISPFHIRWQTAVFNRA FYSWGRRKARMLYQWFNFGMVFGVIAMFSSFFLLGKTLMQTLAQMMADSPSSYSSSSSSS SSSSSSSSSSSSSSSSLHNEQVLQVVTGGVLVPEKLASSKNEIAFLLFIMIEMEK >gi568815575f:21743168_21982652|GENSCAN_predicted_CDS_3|528_bp atgattccggtgtcgctggtggtggtggtggtgggtggctggactgtcgtctacctgacc gacttggtgctgaagtcatctgtctattttaaacattcttatgaagactggctggaaaac aacggactgagcatctcccctttccacataagatggcaaactgctgttttcaatcgtgcc ttttacagttggggacggcggaaagcaaggatgctttaccaatggttcaattttggaatg gtgtttggcgtaattgccatgtttagctcattttttctccttggaaaaacgctgatgcag actttggcacaaatgatggctgactctccctcttcttattcttcctcctcttcttcctct tcctcctcttcttcctcttcctcttcttcatcttcttcctcttcctcgcttcacaatgaa caggtgttacaagttgtgactggtggtgtgcttgttccagagaaactggctagttcgaaa aatgaaatagcgtttttgctttttattatgatagaaatggagaagtga >gi568815575f:21743168_21982652|GENSCAN_predicted_peptide_4|372_aa MASNEDFSITQDLEIPADIVELHDINVEPLPMEDIPTESVQYEDVDGNWIYGGHNHPPLM VLQPLFTNTGYGDHDQEMLMLQTQEEVVGYCDSDNQLGNDLEDQLALPDSIEDEHFQMTL ASLSASAASTSTSTQSRSKKPSKKPSGKSATSTEANPAGSSSSLGTRKWEQKQMQVKTLE GEFSVTMWSPNDNNDQGAVGEGQAENPPDYSEYLKGKKLPPGGLPGIDLSDPKQLAEFTK VKPKRSKGEPPKTVPCSYSGCEKMFRDYAAMRKHLHIHGPRVHVCAECGKAFLESSKLRR HQLVHTGEKPFQCTFEGCGKRFSLDFNLRTHLRIHTGDKPFVCPFDVCNRKFAQSTNLKT HILTHVKTKNNP >gi568815575f:21743168_21982652|GENSCAN_predicted_CDS_4|1119_bp atggcctccaacgaagatttctccatcacacaagacctggagatcccggcagatattgtg gagctccacgacatcaatgtggagccccttcctatggaggacattccgacggaaagcgtc cagtacgaggatgtggatggcaattggatctacggtggccacaaccatccgccattgatg gtgttgcagccgctcttcacgaacacgggctatggcgaccacgaccaggaaatgcttatg ttgcagacacaagaggaagtggtgggctattgcgactcagacaaccagctaggcaacgac ttggaggaccagttggccctcccggatagcattgaagacgagcacttccagatgaccctg gcctctctgtcggcctcggcggcatcaacatcaacatcaacccagagccgcagcaaaaag cccagcaaaaagcccagcggcaagagtgccaccagcactgaggccaacccggcaggcagc agctccagcctgggcacgaggaagtgggagcagaagcaaatgcaggtcaaaacgctggag ggtgagttttccgtgactatgtggtcccctaacgataacaatgaccaaggggcagtgggt gaaggccaggctgaaaacccacctgattattccgagtacttgaaagggaagaaacttcct cctggggggttaccaggcattgatctctcagatcctaaacagctggcagaatttactaaa gtgaagcccaaaaggtccaaaggagaacctcccaaaacagtcccttgctcttatagcggc tgcgaaaagatgttccgggattacgccgccatgagaaaacatctccacatccacgggccc agagtccacgtatgtgcagaatgtggcaaagcttttcttgagagctcaaagctgagacga caccagctggtccacaccggcgagaagccctttcagtgcacattcgaaggctgcgggaaa cgcttttcccttgatttcaatttgcgcacacacttgcgcatccacaccggcgataagccc ttcgtgtgccccttcgatgtttgcaacaggaagttcgctcagtcaaccaacctgaaaacc cacatattaacgcatgtgaagaccaaaaacaacccgtga >gi568815575f:21743168_21982652|GENSCAN_predicted_peptide_5|320_aa MGFHHVGQAGLELLTTGIWHNFVLALLGILALVLLPVILLPFYYTGVGVLITEVAEDSPA IGPRGLFVGDLVTHLQDCPVTNVQDWNECLDTIAYEPQIGYCISASTLQQLSFPVRDAVL DLNSLKVKRPFGETPRRTLSEGLEPFVPGIDLWSYLAYKRLDGSTECCNNHSLTDVCFSY RNNFNKRLHTCLPARKAVEATQVCRTNKDCKKSSSSSFCIIPSLETHTRLIKVKHPPQID MLYVGHPLHLHYTGALAIVNAVPCFALDGQWILNSFLDATLTSVIGDNDVKDLIGFFILL GGSVLLAANVTLGLWMVTAR >gi568815575f:21743168_21982652|GENSCAN_predicted_CDS_5|963_bp atggggtttcaccacgttggccaggctggtctcgaactcctgaccacaggtatctggcat aattttgtccttgcactcttgggtattttagctcttgttctcctcccagtaattctcttg ccattttactacactggagttggggtgctcatcactgaagttgctgaggactctcctgcc attggacccagaggcctttttgtgggagaccttgtcacccatctacaggattgtcctgtt actaatgtgcaagattggaatgaatgtttagataccatcgcctatgagccccaaattggt tactgtataagtgcatcaactttacagcagttaagtttcccagttagagatgcagtatta gatttaaactctttgaaggttaaacggccttttggggaaactccaagaagaacactgagt gagggcttagagccctttgtacctggaatagatctctggtcttatctggcatacaaacga ctagatggttcaactgaatgctgtaacaatcacagcctcacagatgtgtgcttttcctac agaaataattttaataagcgtttgcatacatgtcttcctgcccggaaagcagttgaagca actcaagtttgcagaaccaataaagactgtaaaaaaagctcaagttcaagtttctgtata ataccttctttggaaactcacactcgcttaataaaagtaaaacacccacctcagattgat atgttatacgtaggacatcctctgcatcttcactacacaggagctctggctattgttaat gcagtaccctgctttgctttggatggacaatggattctaaactctttcttggatgccacc cttacctcagtgattggagacaatgatgtcaaagatctaatagggtttttcatcttgctg ggtggcagtgtacttttggctgccaatgtgaccctgggactctggatggttacagcacgg taa >gi568815575f:21743168_21982652|GENSCAN_predicted_peptide_6|100_aa MDPYQSVAWGLETPVVEEFIDDLERETLVWQEKLSQVGVDAGWWSGWGVAAGPHDTSPIL EAECSKSVYKMQKPPISKKNGYASTSLSLLQARAPIRDGP >gi568815575f:21743168_21982652|GENSCAN_predicted_CDS_6|303_bp atggacccgtaccagtccgtggcctgggggttggagacccctgtagtagaagagttcata gatgatcttgagagagaaactctagtgtggcaggagaagttgagccaggttggagtggat gctggatggtggagtgggtggggagttgctgctggaccacatgacaccagtcccattctt gaagcagaatgttcaaagagtgtgtacaaaatgcagaagccacccatttccaagaaaaac ggctatgcttcaaccagcctaagtctactgcaagcaagggcacccataagagatggacct tga >gi568815575f:21743168_21982652|GENSCAN_predicted_peptide_7|61_aa MAFTATWMELEIVILGEVTQEWKTKYVWQLAQFDMVADPQTKVLSHYGGQSCQTTCDGHV A >gi568815575f:21743168_21982652|GENSCAN_predicted_CDS_7|186_bp atggcattcacagcaacctggatggaattggagattgttattctaggtgaagtaactcag gaatggaaaaccaaatatgtgtggcaactggcacagtttgacatggtagctgatccacag actaaggttctgagtcactatggcggtcagagctgccagacaacctgtgatggacatgta gcatga >gi568815575f:21743168_21982652|GENSCAN_predicted_peptide_8|330_aa MSPKCLMWSAPWAVVQSLFLFAIYERVNVSSLRLRKKDQAGEEARFENHWYREKREDVPR MEAVRSSTFTEKTWRQEEVGEGTIKLKKQGEPEAAAPFYIPASNVRGIQFLRILTNTYFL AFKMCFFIYSHIAILVDVDSHNPSVKNKPPAKAIAINRGASDGGVLLDKQLRSLQFPNSI LTTMLSSYWLLPHFILRETVDNVNAQALSIQSSNAKQTLANNGNNPNGFNCSIAWHRASP HYGSSTAQHARLHARRQSLSGLMTGSGECMLGPYSTRTESQLVEGDHLGAMILIARFRAI WKNLEVQISKPFTWLPSRLESIIVHGHLCV >gi568815575f:21743168_21982652|GENSCAN_predicted_CDS_8|993_bp atgagtccaaagtgtctgatgtggtcagctccgtgggctgtggtacagagtctgttctta ttcgctatttatgagcgtgtaaatgtctcaagtttgagattgcgcaagaaagaccaagca ggggaagaagccagatttgagaaccattggtacagggaaaagagagaagacgtgccaagg atggaagctgtgagaagctcaacatttacagaaaaaacatggagacaggaagaagttggt gaaggcaccatcaaattgaagaagcagggggaaccagaagcagctgcaccgttttacatt cccgccagcaatgtacgagggatccaatttctccgcatccttaccaacacttattttctg gcttttaaaatgtgtttctttatttactcacatatagccatccttgtggacgtggacagc cataaccccagtgtcaaaaataagccccccgcaaaggctattgccattaatagaggcgcc agtgatggaggagtcctgctagacaagcaactccggagtctgcagtttcctaattcaata ttaacaacaatgctgtcctcttactggcttctccctcattttattttgcgtgaaactgtg gataacgtaaatgcccaagcgctttctattcagtcctcaaatgcaaagcagacccttgct aataatggtaataatccgaatgggtttaactgcagcattgcatggcacagggcctcgcct cactatggcagcagcacggcacagcacgctcgacttcatgctcggcgccaaagcctttcg gggctgatgactgggagtggggaatgcatgcttgggccctactctacacggactgaatca cagttggtggagggggaccatctaggtgcaatgattttgattgcccgtttcagagccatc tggaaaaatttggaagtacagatttccaagcccttcacatggctgccttctcgtctggaa tccatcatagtgcacggccatctctgtgtttga >gi568815575f:21743168_21982652|GENSCAN_predicted_peptide_9|239_aa MPAFVLPLAGELQTGSIASQLAHHLPVERVQIPERSRQEGAGYKPDQSSVLKLCLIGLPS ERFFLTILPSSPCSPADGETILKGLQSIFQEQGMAESVHTWQDHGYLATYTNKNGRMASS SAAGPQGQPEQSLKQRFAIFLAPRAWYFCKLPPIVRGGAIDRYWPTADGRLVEYDIDEVV YDEDSPYQNIKILHSKQFGNILILSGDVNLAESDLAYTRAIMGSGKEDYTGKDVLILGX >gi568815575f:21743168_21982652|GENSCAN_predicted_CDS_9|717_bp atgcctgcgtttgtgcttccccttgcgggggagttgcagactggctctattgccagccag ctcgcacaccatcttcccgtggagagagtgcagatccctgaaaggagcaggcaagaaggg gccgggtataagcctgaccaaagctcagtcctcaaactgtgcctcattggccttccttct gaacgtttctttctgaccatcttgccttcttccccttgttctccagctgatggtgagacc attctaaaaggcctccagtccattttccaggagcaggggatggcggagtcggtgcacacc tggcaggaccatggctatttagcaacctacacaaacaagaacggcaggatggccagcagc agtgctgcagggccccaggggcagcccgaacagagcctcaagcagcgctttgccatcttc ctggctcctagagcctggtatttctgtaaattaccacccatagtgcgaggaggagccatc gacagatactggcccaccgccgacgggcgcctggttgaatatgacatagatgaagtggta tatgacgaagattcaccttatcaaaatataaaaattctacactcgaagcagtttggaaat attctcatccttagtggggatgttaatttggcagagagtgatttggcatatacccgggcc atcatgggcagtggcaaagaagattacactggcaaagatgtactcattctgggagnn