GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:41:49 Sequence gi568815575f:21756485_21957600 : 201116 bp : 40.28% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 167 305 139 0 1 79 63 61 0.179 1.30 1.02 Intr + 6234 6334 101 0 2 76 97 76 0.344 6.13 1.03 Intr + 14759 14937 179 0 2 52 74 145 0.493 8.22 1.04 Intr + 29107 29274 168 0 0 -5 95 144 0.145 5.02 1.05 Intr + 34924 34957 34 0 1 97 41 35 0.080 -3.42 1.06 Term + 35884 36047 164 2 2 69 47 130 0.223 4.12 1.07 PlyA + 36574 36579 6 1.05 2.00 Prom + 43167 43206 40 -5.95 2.01 Init + 47002 47122 121 0 1 107 76 47 0.908 5.80 2.02 Term + 50263 50339 77 2 2 -49 37 231 0.717 1.32 2.03 PlyA + 50540 50545 6 1.05 3.00 Prom + 73241 73280 40 -3.45 3.01 Init + 83251 83325 75 0 0 97 100 89 0.235 12.31 3.02 Intr + 86686 86834 149 0 2 70 21 70 0.246 -3.39 3.03 Intr + 88687 88900 214 1 1 106 103 93 0.490 10.40 3.04 Term + 94795 94884 90 0 0 113 46 62 0.397 1.24 3.05 PlyA + 95944 95949 6 1.05 4.00 Prom + 96586 96625 40 -6.95 4.01 Sngl + 100001 101119 1119 1 0 98 48 1335 0.998 126.97 4.02 PlyA + 101174 101179 6 1.05 5.00 Prom + 107189 107228 40 -6.75 5.01 Init + 107937 107985 49 2 1 70 58 48 0.393 -0.84 5.02 Intr + 111983 112101 119 0 2 135 84 21 0.964 5.66 5.03 Intr + 113014 113194 181 0 1 69 85 107 0.974 7.02 5.04 Intr + 116677 116796 120 2 0 97 32 76 0.688 2.45 5.05 Intr + 121558 121652 95 2 2 60 111 23 0.979 0.56 5.06 Intr + 122013 122208 196 2 1 61 109 99 0.994 7.37 5.07 Term + 125969 126171 203 0 2 100 35 114 0.719 3.87 5.08 PlyA + 126765 126770 6 1.05 6.00 Prom + 130164 130203 40 -5.95 6.01 Init + 136916 137057 142 1 1 83 49 109 0.306 6.84 6.02 Term + 141569 141729 161 0 2 108 55 109 0.979 6.72 6.03 PlyA + 141804 141809 6 1.05 7.03 PlyA - 144349 144344 6 1.05 7.02 Term - 147941 147835 107 0 2 100 48 100 0.656 4.79 7.01 Init - 160738 160660 79 1 1 64 68 44 0.188 1.27 7.00 Prom - 165085 165046 40 -5.55 8.00 Prom + 168248 168287 40 -5.55 8.01 Init + 172832 172961 130 1 1 50 98 84 0.802 4.09 8.02 Intr + 173271 173420 150 1 0 28 67 131 0.196 4.21 8.03 Intr + 179410 179545 136 2 1 49 27 85 0.049 -2.79 8.04 Intr + 183554 183836 283 2 1 52 50 211 0.866 10.20 8.05 Intr + 184316 184389 74 1 2 86 100 75 0.696 5.79 8.06 Intr + 190805 190887 83 2 2 12 84 96 0.013 -0.04 8.07 Term + 197904 198040 137 1 2 73 48 113 0.007 3.00 8.08 PlyA + 198783 198788 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:21756485_21957600|GENSCAN_predicted_peptide_1|261_aa XTSIRDVECGTHLPSSSECFIPGPRKENYTLDTHSTPTRATAFRPKRTAGDKVCPLGCML HDDIPPSPADAGTFKKTDVQGGQPAQKQNIKPPKLRALRESIARQPPPPEQAPLSMAERP IDSSHHRTLCRQCPVPAWSQKHRSNKLGLATGIKSGSSFVRLIPQPVGSDAISSNVRIEL EDTKLASATELIVCLLLKEQQNETSVLGCCQRRPHEDFISAHPPPGVCGDYVGNLDFHSH LAVMKHPSRNWSGVRGGLMKS >gi568815575f:21756485_21957600|GENSCAN_predicted_CDS_1|786_bp ntcacctctatcagggatgtagagtgcggaacacaccttccttcttcatctgaatgcttc atcccgggaccaagaaaagaaaactacaccttagatacacactccactcccaccagggcc actgcttttcgtccaaagaggacagcaggtgacaaggtttgtcccctgggatgcatgctg catgatgacatccctccttccccagcagatgctggcacctttaaaaagactgatgtgcaa ggaggccaaccagcacaaaaacagaacattaaaccaccaaagctaagagccctcagggag tccattgcacgccagccacctccaccagaacaggcaccgttatccatggctgagagaccc atagacagttcacatcacaggactctgtgcagacaatgcccagtaccagcttggagccag aagcacaggtcaaacaaactggggcttgcaactggcatcaaaagtgggagcagttttgtg agactgatccctcaacctgtgggatctgatgctatctccagtaatgtcagaattgaatta gaggacacaaaactggcgtctgctacagaattgattgtttgcttgctgctgaaagaacag caaaatgaaacatctgtgttagggtgttgtcagagaaggccacatgaggactttatttct gctcacccacctcctggtgtctgtggagactacgtggggaacctggactttcattcccat ttggcagtaatgaagcacccctcgcgcaactggagtggtgtcagaggaggcctgatgaag agttaa >gi568815575f:21756485_21957600|GENSCAN_predicted_peptide_2|65_aa MEKKRNARVGMAIKAWSHKGLIEEEKKKKTAVQLHKLDTQEEEEEEEEEEDEEEDKEEEE KKKMI >gi568815575f:21756485_21957600|GENSCAN_predicted_CDS_2|198_bp atggagaaaaagagaaatgcaagagttgggatggctatcaaagcctggtcacacaagggc ctgattgaggaggaaaaaaaaaaaaaaacagcagttcagctacacaagctagacacacag gaagaagaagaagaagaagaggaggaggaggatgaagaagaggacaaagaggaagaggag aagaagaagatgatttaa >gi568815575f:21756485_21957600|GENSCAN_predicted_peptide_3|175_aa MIPVSLVVVVVGGWTVVYLTDLVLKSSVYFKHSYEDWLENNGLSISPFHIRWQTAVFNRA FYSWGRRKARMLYQWFNFGMVFGVIAMFSSFFLLGKTLMQTLAQMMADSPSSYSSSSSSS SSSSSSSSSSSSSSSSLHNEQVLQVVTGGVLVPEKLASSKNEIAFLLFIMIEMEK >gi568815575f:21756485_21957600|GENSCAN_predicted_CDS_3|528_bp atgattccggtgtcgctggtggtggtggtggtgggtggctggactgtcgtctacctgacc gacttggtgctgaagtcatctgtctattttaaacattcttatgaagactggctggaaaac aacggactgagcatctcccctttccacataagatggcaaactgctgttttcaatcgtgcc ttttacagttggggacggcggaaagcaaggatgctttaccaatggttcaattttggaatg gtgtttggcgtaattgccatgtttagctcattttttctccttggaaaaacgctgatgcag actttggcacaaatgatggctgactctccctcttcttattcttcctcctcttcttcctct tcctcctcttcttcctcttcctcttcttcatcttcttcctcttcctcgcttcacaatgaa caggtgttacaagttgtgactggtggtgtgcttgttccagagaaactggctagttcgaaa aatgaaatagcgtttttgctttttattatgatagaaatggagaagtga >gi568815575f:21756485_21957600|GENSCAN_predicted_peptide_4|372_aa MASNEDFSITQDLEIPADIVELHDINVEPLPMEDIPTESVQYEDVDGNWIYGGHNHPPLM VLQPLFTNTGYGDHDQEMLMLQTQEEVVGYCDSDNQLGNDLEDQLALPDSIEDEHFQMTL ASLSASAASTSTSTQSRSKKPSKKPSGKSATSTEANPAGSSSSLGTRKWEQKQMQVKTLE GEFSVTMWSPNDNNDQGAVGEGQAENPPDYSEYLKGKKLPPGGLPGIDLSDPKQLAEFTK VKPKRSKGEPPKTVPCSYSGCEKMFRDYAAMRKHLHIHGPRVHVCAECGKAFLESSKLRR HQLVHTGEKPFQCTFEGCGKRFSLDFNLRTHLRIHTGDKPFVCPFDVCNRKFAQSTNLKT HILTHVKTKNNP >gi568815575f:21756485_21957600|GENSCAN_predicted_CDS_4|1119_bp atggcctccaacgaagatttctccatcacacaagacctggagatcccggcagatattgtg gagctccacgacatcaatgtggagccccttcctatggaggacattccgacggaaagcgtc cagtacgaggatgtggatggcaattggatctacggtggccacaaccatccgccattgatg gtgttgcagccgctcttcacgaacacgggctatggcgaccacgaccaggaaatgcttatg ttgcagacacaagaggaagtggtgggctattgcgactcagacaaccagctaggcaacgac ttggaggaccagttggccctcccggatagcattgaagacgagcacttccagatgaccctg gcctctctgtcggcctcggcggcatcaacatcaacatcaacccagagccgcagcaaaaag cccagcaaaaagcccagcggcaagagtgccaccagcactgaggccaacccggcaggcagc agctccagcctgggcacgaggaagtgggagcagaagcaaatgcaggtcaaaacgctggag ggtgagttttccgtgactatgtggtcccctaacgataacaatgaccaaggggcagtgggt gaaggccaggctgaaaacccacctgattattccgagtacttgaaagggaagaaacttcct cctggggggttaccaggcattgatctctcagatcctaaacagctggcagaatttactaaa gtgaagcccaaaaggtccaaaggagaacctcccaaaacagtcccttgctcttatagcggc tgcgaaaagatgttccgggattacgccgccatgagaaaacatctccacatccacgggccc agagtccacgtatgtgcagaatgtggcaaagcttttcttgagagctcaaagctgagacga caccagctggtccacaccggcgagaagccctttcagtgcacattcgaaggctgcgggaaa cgcttttcccttgatttcaatttgcgcacacacttgcgcatccacaccggcgataagccc ttcgtgtgccccttcgatgtttgcaacaggaagttcgctcagtcaaccaacctgaaaacc cacatattaacgcatgtgaagaccaaaaacaacccgtga >gi568815575f:21756485_21957600|GENSCAN_predicted_peptide_5|320_aa MGFHHVGQAGLELLTTGIWHNFVLALLGILALVLLPVILLPFYYTGVGVLITEVAEDSPA IGPRGLFVGDLVTHLQDCPVTNVQDWNECLDTIAYEPQIGYCISASTLQQLSFPVRDAVL DLNSLKVKRPFGETPRRTLSEGLEPFVPGIDLWSYLAYKRLDGSTECCNNHSLTDVCFSY RNNFNKRLHTCLPARKAVEATQVCRTNKDCKKSSSSSFCIIPSLETHTRLIKVKHPPQID MLYVGHPLHLHYTGALAIVNAVPCFALDGQWILNSFLDATLTSVIGDNDVKDLIGFFILL GGSVLLAANVTLGLWMVTAR >gi568815575f:21756485_21957600|GENSCAN_predicted_CDS_5|963_bp atggggtttcaccacgttggccaggctggtctcgaactcctgaccacaggtatctggcat aattttgtccttgcactcttgggtattttagctcttgttctcctcccagtaattctcttg ccattttactacactggagttggggtgctcatcactgaagttgctgaggactctcctgcc attggacccagaggcctttttgtgggagaccttgtcacccatctacaggattgtcctgtt actaatgtgcaagattggaatgaatgtttagataccatcgcctatgagccccaaattggt tactgtataagtgcatcaactttacagcagttaagtttcccagttagagatgcagtatta gatttaaactctttgaaggttaaacggccttttggggaaactccaagaagaacactgagt gagggcttagagccctttgtacctggaatagatctctggtcttatctggcatacaaacga ctagatggttcaactgaatgctgtaacaatcacagcctcacagatgtgtgcttttcctac agaaataattttaataagcgtttgcatacatgtcttcctgcccggaaagcagttgaagca actcaagtttgcagaaccaataaagactgtaaaaaaagctcaagttcaagtttctgtata ataccttctttggaaactcacactcgcttaataaaagtaaaacacccacctcagattgat atgttatacgtaggacatcctctgcatcttcactacacaggagctctggctattgttaat gcagtaccctgctttgctttggatggacaatggattctaaactctttcttggatgccacc cttacctcagtgattggagacaatgatgtcaaagatctaatagggtttttcatcttgctg ggtggcagtgtacttttggctgccaatgtgaccctgggactctggatggttacagcacgg taa >gi568815575f:21756485_21957600|GENSCAN_predicted_peptide_6|100_aa MDPYQSVAWGLETPVVEEFIDDLERETLVWQEKLSQVGVDAGWWSGWGVAAGPHDTSPIL EAECSKSVYKMQKPPISKKNGYASTSLSLLQARAPIRDGP >gi568815575f:21756485_21957600|GENSCAN_predicted_CDS_6|303_bp atggacccgtaccagtccgtggcctgggggttggagacccctgtagtagaagagttcata gatgatcttgagagagaaactctagtgtggcaggagaagttgagccaggttggagtggat gctggatggtggagtgggtggggagttgctgctggaccacatgacaccagtcccattctt gaagcagaatgttcaaagagtgtgtacaaaatgcagaagccacccatttccaagaaaaac ggctatgcttcaaccagcctaagtctactgcaagcaagggcacccataagagatggacct tga >gi568815575f:21756485_21957600|GENSCAN_predicted_peptide_7|61_aa MAFTATWMELEIVILGEVTQEWKTKYVWQLAQFDMVADPQTKVLSHYGGQSCQTTCDGHV A >gi568815575f:21756485_21957600|GENSCAN_predicted_CDS_7|186_bp atggcattcacagcaacctggatggaattggagattgttattctaggtgaagtaactcag gaatggaaaaccaaatatgtgtggcaactggcacagtttgacatggtagctgatccacag actaaggttctgagtcactatggcggtcagagctgccagacaacctgtgatggacatgta gcatga >gi568815575f:21756485_21957600|GENSCAN_predicted_peptide_8|330_aa MSPKCLMWSAPWAVVQSLFLFAIYERVNVSSLRLRKKDQAGEEARFENHWYREKREDVPR MEAVRSSTFTEKTWRQEEVGEGTIKLKKQGEPEAAAPFYIPASNVRGIQFLRILTNTYFL AFKMCFFIYSHIAILVDVDSHNPSVKNKPPAKAIAINRGASDGGVLLDKQLRSLQFPNSI LTTMLSSYWLLPHFILRETVDNVNAQALSIQSSNAKQTLANNGNNPNGFNCSIAWHRASP HYGSSTAQHARLHARRQSLSGLMTGSGECMLGPYSTRTESQLVEGDHLGAMILIARFRAI WKNLEVQISKPFTWLPSRLESIIVHGHLCV >gi568815575f:21756485_21957600|GENSCAN_predicted_CDS_8|993_bp atgagtccaaagtgtctgatgtggtcagctccgtgggctgtggtacagagtctgttctta ttcgctatttatgagcgtgtaaatgtctcaagtttgagattgcgcaagaaagaccaagca ggggaagaagccagatttgagaaccattggtacagggaaaagagagaagacgtgccaagg atggaagctgtgagaagctcaacatttacagaaaaaacatggagacaggaagaagttggt gaaggcaccatcaaattgaagaagcagggggaaccagaagcagctgcaccgttttacatt cccgccagcaatgtacgagggatccaatttctccgcatccttaccaacacttattttctg gcttttaaaatgtgtttctttatttactcacatatagccatccttgtggacgtggacagc cataaccccagtgtcaaaaataagccccccgcaaaggctattgccattaatagaggcgcc agtgatggaggagtcctgctagacaagcaactccggagtctgcagtttcctaattcaata ttaacaacaatgctgtcctcttactggcttctccctcattttattttgcgtgaaactgtg gataacgtaaatgcccaagcgctttctattcagtcctcaaatgcaaagcagacccttgct aataatggtaataatccgaatgggtttaactgcagcattgcatggcacagggcctcgcct cactatggcagcagcacggcacagcacgctcgacttcatgctcggcgccaaagcctttcg gggctgatgactgggagtggggaatgcatgcttgggccctactctacacggactgaatca cagttggtggagggggaccatctaggtgcaatgattttgattgcccgtttcagagccatc tggaaaaatttggaagtacagatttccaagcccttcacatggctgccttctcgtctggaa tccatcatagtgcacggccatctctgtgtttga