GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:15:58 Sequence gi568815584f:75178983_75381421 : 202439 bp : 46.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9728 9947 220 2 1 96 22 128 0.029 5.30 1.02 Intr + 28620 28749 130 2 1 27 88 91 0.127 3.27 1.03 Intr + 41435 41525 91 0 1 112 85 63 0.955 7.45 1.04 Intr + 43259 43442 184 2 1 54 82 124 0.638 8.19 1.05 Intr + 50036 50072 37 1 1 75 71 71 0.358 1.94 1.06 Term + 65703 65875 173 1 2 88 37 75 0.183 0.39 1.07 PlyA + 66295 66300 6 1.05 2.00 Prom + 83049 83088 40 -2.56 2.01 Init + 93191 93390 200 1 2 91 116 6 0.493 2.10 2.02 Intr + 94499 94562 64 2 1 74 82 41 0.702 0.82 2.03 Term + 98392 98826 435 0 0 112 47 201 0.944 13.69 2.04 PlyA + 98997 99002 6 1.05 3.00 Prom + 99807 99846 40 -4.16 3.01 Init + 100001 100141 141 1 0 98 110 184 0.962 21.73 3.02 Intr + 100895 101146 252 1 0 104 94 139 0.970 13.73 3.03 Intr + 101578 101685 108 0 0 50 78 156 0.999 11.28 3.04 Term + 101801 102442 642 1 0 87 55 814 0.999 72.17 3.05 PlyA + 102947 102952 6 1.05 4.03 PlyA - 103076 103071 6 1.05 4.02 Term - 113382 113124 259 2 1 71 45 349 0.633 23.92 4.01 Init - 115331 115189 143 2 2 68 -52 187 0.628 0.31 4.00 Prom - 118314 118275 40 -3.26 5.04 PlyA - 118325 118320 6 1.05 5.03 Term - 129046 128821 226 0 1 70 47 158 0.703 6.15 5.02 Intr - 138405 138211 195 2 0 91 75 32 0.393 0.73 5.01 Init - 146221 146130 92 1 2 89 57 101 0.726 7.06 5.00 Prom - 156102 156063 40 -2.76 6.00 Prom + 158777 158816 40 -5.86 6.01 Init + 169438 169596 159 0 0 39 62 117 0.090 4.02 6.02 Intr + 176107 176206 100 0 1 86 78 49 0.166 3.38 6.03 Intr + 176424 176449 26 1 2 107 78 14 0.035 0.04 6.04 Intr + 185149 185243 95 0 2 89 77 19 0.019 -0.34 6.05 Intr + 196465 196665 201 1 0 82 36 346 0.990 27.20 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 41115 41202 88 2 1 63 96 47 0.957 3.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:75178983_75381421|GENSCAN_predicted_peptide_1|278_aa XKSSWHVDKGSINNLSKDALLQHLVFQNRNQCCQFYRWPTSAFWMTASKDSSTCIPKAED TTMGKKTILISKPEAVTGKALAKSSEVETFCCGGGEKGIPVHKKKGDTGLAFEKQGRGRG NTLAVYFGAVGSVVNEPQDGPQDVFLPCPTCGEYQKYKNYPTSEWVNRVLPGEYDTPRRA REVHSLPRTAPQTQTEAYGGRVHSRKRKWGRYSGSQNAEVEDMRVITICWRPREAQLRNE WAAILKVICIQESFPSQELHVLNGKVTLSGVKDDSQLK >gi568815584f:75178983_75381421|GENSCAN_predicted_CDS_1|837_bp nnaaagtcatcgtggcacgtggacaaaggaagtattaataacctttctaaagacgcactt ctccaacatttggtgtttcaaaaccgcaaccagtgctgtcagttctacagatggccaact tctgcattttggatgacggcttccaaagatagctcaacttgtatccctaaagcagaagac acaactatggggaagaagaccatcctcatttctaagccagaggcagtgacaggcaaagcc ttagccaaatcatcggaagtggaaacattctgctgcggaggtggggaaaagggaatccct gtgcataagaagaaaggagacacagggctggcttttgagaaacagggacgaggaagagga aacaccctggctgtttatttcggggctgtaggatctgttgtgaatgagccccaggatggg ccgcaggatgtctttcttccctgccccacctgcggggagtaccaaaagtacaagaattac ccaaccagtgagtgggtcaatcgggttctccctggggagtatgacactccccgacgagca agggaagtacactcactccccaggacggctccgcagacgcagaccgaagcttatggtggg agggtgcattctaggaagaggaaatgggggagatacagcggcagccagaatgcagaggtg gaagacatgagggttattactatttgctggaggcctagggaagctcaactccgaaatgaa tgggctgccatcctgaaagtcatctgtatccaggaaagttttcctagccaagagctccac gttcttaacgggaaagtgacattgagtggtgtcaaagatgatagtcagttaaaatag >gi568815584f:75178983_75381421|GENSCAN_predicted_peptide_2|232_aa MDGEDRRGFLEKVTFEQMLVRRKSQLVKVCEEEGRVGTEAQRQKQLGIFEGEKGQCSWSG VQESGSRFSDVDGVPLEFQQLSPPELQKVPTAGPDPRARRVEKGRPGLCGDMRLRALEVA SALGLSPGSVTPAVLRSRAGDWGAWHTLEHLASPKPRVPGRGAAPGVPAVEVFRPGAAGH CPSSPRPPPSKSALENHPLRAPVSTASWRDRTSPQRRFEFPAGSTPRLVIPN >gi568815584f:75178983_75381421|GENSCAN_predicted_CDS_2|699_bp atggatggggaagacaggaggggcttcctggagaaggtgacatttgagcagatgttggtc aggaggaagagtcaacttgtcaaggtctgtgaggaagagggaagagtgggtacagaggcc cagaggcagaagcagcttggcatatttgagggagagaaaggccaatgtagctggagtgga gtgcaggaaagtggttcaaggtttagtgacgtcgacggggttcctttggaatttcagcag ctaagtccgccggagctgcagaaggtgcccaccgcgggccccgacccccgggcccgaaga gtggagaagggaagaccggggctgtgcggggacatgcgtcttcgcgccctggaggtggcc agcgcgctggggctgagccccggcagcgtgaccccggctgtcctacgcagcagggcagga gattggggggcgtggcacactctggagcaccttgcctccccaaagccccgtgttccagga cgtggagccgctcctggggtcccagcagtcgaggtattccgcccaggcgcagctggacac tgtccttccagcccccgtcctccaccctccaagtccgcgctggaaaatcacccgctgcgg gctcccgtaagcacagcttcctggcgggaccgaaccagccctcagcgcagatttgagttc cccgcaggaagcacaccccgccttgtcatcccgaactga >gi568815584f:75178983_75381421|GENSCAN_predicted_peptide_3|380_aa MMFSGFNADYEASSSRCSSASPAGDSLSYYHSPADSFSSMGSPVNAQDFCTDLAVSSANF IPTVTAISTSPDLQWLVQPALVSSVAPSQTRAPHPFGVPAPSAGAYSRAGVVKTMTGGRA QSIGRRGKVEQLSPEEEEKRRIRRERNKMAAAKCRNRRRELTDTLQAETDQLEDEKSALQ TEIANLLKEKEKLEFILAAHRPACKIPDDLGFPEEMSVASLDLTGGLPEVATPESEEAFT LPLLNDPEPKPSVEPVKSISSMELKTEPFDDFLFPASSRPSGSETARSVPDMDLSGSFYA ADWEPLHSGSLGMGPMATELEPLCTPVVTCTPSCTAYTSSFVFTYPEADSFPSCAAAHRK GSSSNEPSSDSLSSPTLLAL >gi568815584f:75178983_75381421|GENSCAN_predicted_CDS_3|1143_bp atgatgttctcgggcttcaacgcagactacgaggcgtcatcctcccgctgcagcagcgcg tccccggccggggatagcctctcttactaccactcacccgcagactccttctccagcatg ggctcgcctgtcaacgcgcaggacttctgcacggacctggccgtctccagtgccaacttc attcccacggtcactgccatctcgaccagtccggacctgcagtggctggtgcagcccgcc ctcgtctcctccgtggccccatcgcagaccagagcccctcaccctttcggagtccccgcc ccctccgctggggcttactccagggctggcgttgtgaagaccatgacaggaggccgagcg cagagcattggcaggaggggcaaggtggaacagttatctccagaagaagaagagaaaagg agaatccgaagggaaaggaataagatggctgcagccaaatgccgcaaccggaggagggag ctgactgatacactccaagcggagacagaccaactagaagatgagaagtctgctttgcag accgagattgccaacctgctgaaggagaaggaaaaactagagttcatcctggcagctcac cgacctgcctgcaagatccctgatgacctgggcttcccagaagagatgtctgtggcttcc cttgatctgactgggggcctgccagaggttgccaccccggagtctgaggaggccttcacc ctgcctctcctcaatgaccctgagcccaagccctcagtggaacctgtcaagagcatcagc agcatggagctgaagaccgagccctttgatgacttcctgttcccagcatcatccaggccc agtggctctgagacagcccgctccgtgccagacatggacctatctgggtccttctatgca gcagactgggagcctctgcacagtggctccctggggatggggcccatggccacagagctg gagcccctgtgcactccggtggtcacctgtactcccagctgcactgcttacacgtcttcc ttcgtcttcacctaccccgaggctgactccttccccagctgtgcagctgcccaccgcaag ggcagcagcagcaatgagccttcctctgactcgctcagctcacccacgctgctggccctg tga >gi568815584f:75178983_75381421|GENSCAN_predicted_peptide_4|133_aa MGRAEAAGRRHGAGWGPRCGAVTGVADGAYVSPWKPNSKQRTRELLAPRPVWIIPYIEQM SKAMLQLKALESSDLTEVVVYSSYWYKLQTKWMLQSMAEWHCQHQEQGMLKLAEAMNALK LDPWMKRTSFRPM >gi568815584f:75178983_75381421|GENSCAN_predicted_CDS_4|402_bp atgggcagagcggaagcggcgggccggcgtcacggcgccgggtgggggccgcgctgcggg gcggtgacgggagtcgctgacggcgcctacgtgtcaccgtggaaaccaaacagtaaacag aggactcgcgagctcctggcacccaggcccgtatggataatcccttacatcgagcagatg agcaaggccatgctccagctgaaggctctggagtcttcagacctcaccgaggtcgtggtt tacagctcctattggtacaagctccaaaccaagtggatgctccagtccatggctgagtgg cactgccagcaccaggagcaagggatgctcaaacttgcagaagccatgaatgccctcaaa ctagacccttggatgaagcgaaccagcttccggccaatgtga >gi568815584f:75178983_75381421|GENSCAN_predicted_peptide_5|170_aa MTCSSFPNGDLTLKQVQQQQVAVGSSPHFPTPPSPERASLSTTGTRIPISGFAPGKPDLR LPCPLEDYLEAMVVASNYSSRQATPEITEGLKTKLRNILAVSFKPCFAEERPLLARDPSP ATLPPAAAAFGIVGYGVHSSTKPVLSTEALTIQPESDTGSPTCGADNHLN >gi568815584f:75178983_75381421|GENSCAN_predicted_CDS_5|513_bp atgacatgttcatccttccccaatggagacctcacactgaagcaggtgcagcagcagcaa gtggctgtcggcagctctccccacttccccacaccaccttctccagagagagcttcctta tcaactactggcacaaggatccctatctctggctttgctcccgggaagcctgatctaaga ctcccctgtcccttggaagattaccttgaggcaatggtagtagcctccaactactcttcc aggcaggctaccccagagatcacagaaggcttgaagaccaaattaagaaacatcctggcg gtttccttcaaaccttgttttgctgaagagcgccctcttctggctcgggacccttctcca gccacgctgcctcctgcggctgcagcttttgggattgtaggctatggcgtccattcctcc accaaacctgtactgagcactgaggccctgacaatacagccggaatcagacaccggctcc cccacctgtggggctgacaatcatttgaactga >gi568815584f:75178983_75381421|GENSCAN_predicted_peptide_6|194_aa MFDVVQTADPTHYFIHGAWLETEEARHHHHKSSDLITKYLMVLEISTCCLGLLFRGLDLT VFILSITKPVSLCIQEATKCLEIEEGEYYEFQHGETGFLHIMESMVAGSPRLSPPSRKDP VFSRKLCTAIATNAITITITITTITTTITITTTTINTYTITTISTIISTIIITMPPPPPL PLPMPEWKRSGGAS >gi568815584f:75178983_75381421|GENSCAN_predicted_CDS_6|582_bp atgtttgatgttgtacagacagcagaccctacccactacttcattcatggtgcctggttg gagactgaggaagccaggcaccaccatcataaatcctcagacctcatcacgaaatacctg atggttctggagataagtacctgctgcctgggcctcttgtttcggggtctggacctcact gtgtttatactcagcatcacgaagcccgtgagcttgtgtatacaggaggccaccaaatgt ttggagatagaggagggagagtattatgagtttcagcatggtgagacaggctttctccac atcatggaatcaatggttgctggcagtcccagactctcacctcccagcagaaaggaccct gttttttcccgcaaactttgcactgccattgccacaaacgccatcactatcaccatcacc atcacaaccattactactaccatcaccatcaccaccaccaccatcaacacctataccata actaccatctccaccatcatcagcaccattatcatcaccatgccaccaccaccaccactg ccattgccaatgcctgagtggaagaggagtggaggagcgagn