GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:03:40 Sequence gi568815576r:31183087_31392344 : 209258 bp : 47.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4009 4184 176 0 2 72 65 153 0.791 10.32 1.02 Intr + 9919 10054 136 1 1 49 77 23 0.028 -2.13 1.03 Intr + 12383 12495 113 1 2 100 94 38 0.201 4.78 1.04 Intr + 18245 18296 52 2 1 33 97 12 0.065 -4.49 1.05 Intr + 18412 18529 118 0 1 119 110 61 0.992 11.44 1.06 Term + 21403 21500 98 2 2 124 38 141 0.932 10.83 1.07 PlyA + 21614 21619 6 -8.16 2.05 PlyA - 21638 21633 6 -4.04 2.04 Term - 21790 21641 150 1 0 131 45 126 0.194 10.51 2.03 Intr - 22332 22236 97 2 1 38 76 47 0.102 -1.49 2.02 Intr - 29223 29000 224 0 2 100 107 71 0.536 7.13 2.01 Init - 31912 31910 3 1 0 93 95 0 0.549 1.20 2.00 Prom - 37299 37260 40 -2.16 3.00 Prom + 54716 54755 40 -3.36 3.01 Init + 65552 65683 132 1 0 71 109 77 0.634 8.26 3.02 Intr + 70108 70220 113 0 2 80 70 6 0.330 -2.82 3.03 Intr + 75205 75340 136 1 1 94 101 159 0.958 18.37 3.04 Intr + 76035 76144 110 2 2 111 99 89 0.999 11.38 3.05 Intr + 76803 76991 189 0 0 76 89 245 0.999 22.10 3.06 Intr + 79048 79153 106 1 1 71 65 227 0.999 18.92 3.07 Intr + 79509 79705 197 2 2 81 45 155 0.999 8.71 3.08 Intr + 82860 83046 187 0 1 96 105 153 0.972 17.49 3.09 Intr + 83898 83984 87 2 0 82 94 109 0.999 11.07 3.10 Intr + 84690 84821 132 2 0 96 89 224 0.999 24.14 3.11 Intr + 85058 85114 57 1 0 92 101 33 0.904 4.08 3.12 Intr + 88050 88115 66 2 0 89 76 89 0.794 6.90 3.13 Intr + 89444 89618 175 1 1 100 113 334 0.999 36.61 3.14 Intr + 90366 90421 56 1 2 77 83 36 0.997 0.70 3.15 Intr + 92065 92222 158 0 2 82 77 207 0.893 17.81 3.16 Intr + 93703 94050 348 1 0 29 53 636 0.571 48.67 3.17 Term + 95211 95355 145 0 1 131 41 196 0.942 16.58 3.18 PlyA + 96619 96624 6 1.05 4.07 PlyA - 99110 99105 6 1.05 4.06 Term - 100202 99998 205 1 1 90 52 197 0.998 13.14 4.05 Intr - 106307 106229 79 0 1 93 78 151 0.981 13.21 4.04 Intr - 106613 106413 201 0 0 71 51 233 0.989 17.16 4.03 Intr - 107998 107879 120 2 0 110 82 199 0.999 21.97 4.02 Intr - 108210 108094 117 1 0 94 114 142 0.999 17.84 4.01 Init - 109258 109189 70 1 1 74 78 71 0.913 6.08 4.00 Prom - 125529 125490 40 -3.76 5.00 Prom + 131197 131236 40 -6.86 5.01 Init + 134534 134600 67 1 1 74 62 107 0.886 7.93 5.02 Term + 138463 138590 128 2 2 22 53 115 0.584 -0.16 5.03 PlyA + 139369 139374 6 1.05 6.05 PlyA - 139690 139685 6 1.05 6.04 Term - 144223 143805 419 2 2 100 54 386 0.943 31.84 6.03 Intr - 152777 152606 172 2 1 105 74 167 0.703 16.52 6.02 Intr - 159874 159811 64 0 1 108 91 35 0.812 4.52 6.01 Init - 162516 161246 1271 0 2 111 94 1499 0.987 144.58 6.00 Prom - 181304 181265 40 -0.96 7.02 PlyA - 183338 183333 6 1.05 7.01 Sngl - 202505 202140 366 2 0 76 41 169 0.976 7.10 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:31183087_31392344|GENSCAN_predicted_peptide_1|230_aa MASKGPSASASPENSSAGGPSGSSNGAGESGGQDSTFECNICLDTAKDAVISLCGHLFCA SYHINSPPALKGYENIRKGKIFLVSTKSFCEERKVYFPSFSTYLWLETRPNRQVCPVCKA GISRDKVIPLYGRGSTGQQDPRVYMGGKAVPHLRNMWHKGFQGFGFGDGGFQMSFGIGAF PFGIFATAFNINDGRPPPAVPGTPQYVDEQFLSRLFLFVALVIMFWLLIA >gi568815576r:31183087_31392344|GENSCAN_predicted_CDS_1|693_bp atggcaagcaaggggccctcggcctctgcatctcctgagaactccagtgcaggggggccc agtgggagcagcaatggcgctggcgagagcggagggcaggacagcactttcgagtgcaac atctgcttggacacagccaaggatgccgtcatcagcctgtgtggccacctcttctgtgcc tcctatcatatcaacagtcccccagcccttaaaggttatgaaaatattaggaaggggaaa atttttttggtgtcaacaaagagcttttgtgaggagagaaaagtctattttcctagcttt agcacctacttgtggttggagaccagacctaacagacaggtgtgtcctgtttgcaaagct ggcatcagccgagacaaggtcatccccctctatggaaggggcagcactgggcaacaggac cccagggtatatatgggaggaaaagctgtcccccaccttcgaaacatgtggcacaaggga tttcaaggatttggatttggagatggtggcttccagatgtcttttggaattggggcattt ccctttgggatatttgccacagcatttaatataaatgatgggcggcctcctccagctgtc cctgggacaccccagtatgtggacgagcagttcctgtcacgcctcttcctatttgtggcc ctggtgatcatgttctggctcctgattgcctaa >gi568815576r:31183087_31392344|GENSCAN_predicted_peptide_2|157_aa MAREDHSPRRERDAPPTAPWEDQTTPGTGDCPPTPRPAALRPRSRDAAPPPGSGSGSAAA LAPPSASGVPSRCSRRSQSQLRQYSRLYVLWQREADEHSFREKADGKPSQDLQNWLKQDF MVSPMRDDSMPRPEVIECLQRWRYSGVIANWSRVPKL >gi568815576r:31183087_31392344|GENSCAN_predicted_CDS_2|474_bp atggcgcgggaagaccacagccccaggcgcgaacgggatgcaccgcctacggcgccctgg gaggaccaaaccacacctggcacaggggactgcccgccgacgccccgccccgccgctctc aggcctcgctcccgagacgcggccccgccccctggctctggctcgggctctgcagcggcc ctggcgccccctagtgcgagcggcgtcccctcgaggtgctcccgccgtagtcagtctcag cttcggcagtattctcggctgtatgttctctggcagagagaggcagatgaacatagtttt agggagaaagctgatgggaaacctagtcaggacctgcagaactggctgaaacaagatttc atggtgtcacccatgagagatgactcaatgccaaggcctgaagttatagagtgtttacag cggtggcgatattcaggggtcatcgccaactggtctcgagttccaaagctctga >gi568815576r:31183087_31392344|GENSCAN_predicted_peptide_3|797_aa MVAIYSRPEAVTDGFAAEPVYLVWEEDGELLVSPGLLHLQRPVSDSGMTGSDLSFTTSQL GTLLPTARADFDFHLKLPKIERCSECQDSLTNWYYEKDGKLYCPKDYWGKFGEFCHGCSL LMTGPFMVAGEFKYHPECFACMSCKVIIEDGDAYALVQHATLYCGKCHNEVVLAPMFERL STESVQEQLPYSVTLISMPATTEGRRGFSVSVESACSNYATTVQVKEVNRMHISPNNRNA IHPGDRILEINGTPVRTLRVEEVEDAISQTSQTLQLLIEHDPVSQRLDQLRLEARLAPHM QNAGHPHALSTLDTKENLEGTLRRRSLRRSNSISKSPGPSSPKEPLLFSRDISRSESLRC SSSYSQQIFRPCDLIHGEVLGKGFFGQAIKVTHKATGKVMVMKELIRCDEETQKTFLTEV KVMRSLDHPNVLKFIGVLYKDKKLNLLTEYIEGGTLKDFLRSMDPFPWQQKVRFAKGIAS GMAYLHSMCIIHRDLNSHNCLIKLDKTVVVADFGLSRLIVEERKRAPMEKATTKKRTLRK NDRKKRYTVVGNPYWMAPEMLNGKSYDETVDIFSFGIVLCEIIGQVYADPDCLPRTLDFG LNVKLFWEKFVPTDCPPAFFPLAAICCRLEPESRAPPGAAGEGPGCADDEGPVRRQGKVT IKYDPKELRKHLNLEEWILEQLTRLYDCQEEEISELEIDVDELLDMESDDAWASRVKELL VDCYKPTEAFISGLLDKIRAMQKLSTPQKKPAFSKLEDSFEALSLYLGELGIPLPAELEE LDHTVSMQYGLTRDSPP >gi568815576r:31183087_31392344|GENSCAN_predicted_CDS_3|2394_bp atggtggccatctacagccggcctgaggcagtcacagacggatttgcagctgagcctgtc tatctggtgtgggaagaagatggggagttacttgtcagtcccggcttacttcacctccag agacctgtttcggacagtgggatgactggttcagacctcagctttaccacctcccagctg ggtactcttctacctacagccagggcagattttgactttcacttgaaacttccaaaaatt gaaaggtgttcagaatgccaggattccctcaccaactggtactatgagaaggatgggaag ctctactgccccaaggactactgggggaagtttggggagttctgtcatgggtgctccctg ctgatgacagggccttttatggtggctggggagttcaagtaccacccagagtgctttgcc tgtatgagctgcaaggtgatcattgaggatggggatgcatatgcactggtgcagcatgcc accctctactgtgggaagtgccacaatgaggtggtgctggcacccatgtttgagagactc tccacagagtctgttcaggagcagctgccctactctgtcacgctcatctccatgccggcc accactgaaggcaggcggggcttctccgtgtccgtggagagtgcctgctccaactacgcc accactgtgcaagtgaaagaggtcaaccggatgcacatcagtcccaacaatcgaaacgcc atccaccctggggaccgcatcctggagatcaatgggacccccgtccgcacacttcgagtg gaggaggtggaggatgcaattagccagacgagccagacacttcagctgttgattgaacat gaccccgtctcccaacgcctggaccagctgcggctggaggcccggctcgctcctcacatg cagaatgccggacacccccacgccctcagcaccctggacaccaaggagaatctggagggg acactgaggagacgttccctaaggcgcagtaacagtatctccaagtcccctggccccagc tccccaaaggagcccctgctgttcagccgtgacatcagccgctcagaatcccttcgttgt tccagcagctattcacagcagatcttccggccctgtgacctaatccatggggaggtcctg gggaagggcttctttgggcaggctatcaaggtgacacacaaagccacgggcaaagtgatg gtcatgaaagagttaattcgatgtgatgaggagacccagaaaacttttctgactgaggtg aaagtgatgcgcagcctggaccaccccaatgtgctcaagttcattggtgtgctgtacaag gataagaagctgaacctcctgacagagtacattgaggggggcacactgaaggactttctg cgcagtatggatccgttcccctggcagcagaaggtcaggtttgccaaaggaatcgcctcc ggaatggcctatttgcactctatgtgcatcatccaccgggatctgaactcgcacaactgc ctcatcaagttggacaagactgtggtggtggcagactttgggctgtcacggctcatagtg gaagagaggaaaagggcccccatggagaaggccaccaccaagaaacgcaccttgcgcaag aacgaccgcaagaagcgctacacggtggtgggaaacccctactggatggcccctgagatg ctgaacggaaagagctatgatgagacggtggatatcttctcctttgggatcgttctctgt gagatcattgggcaggtgtatgcagatcctgactgccttccccgaacactggactttggc ctcaacgtgaagcttttctgggagaagtttgttcccacagattgtcccccggccttcttc ccgctggccgccatctgctgcagactggagcctgagagcagagccccccccggggccgca ggagagggcccgggctgcgcggatgatgagggcccagtgaggcgccaagggaaggtcacc atcaagtatgaccccaaggagctacggaagcacctcaacctagaggagtggatcctggag cagctcacgcgcctctacgactgccaggaagaggagatctcagaactagagattgacgtg gatgagctcctggacatggagagtgacgatgcctgggcttccagggtcaaggagctgctg gttgactgttacaaacccacagaggccttcatctctggcctgctggacaagatccgggcc atgcagaagctgagcacaccccagaagaaaccagcattctcgaaattggaggactccttt gaggccctctccctgtacctgggggagctgggcatcccgctgcctgcagagctggaggag ttggaccacactgtgagcatgcagtacggcctgacccgggactcacctccctag >gi568815576r:31183087_31392344|GENSCAN_predicted_peptide_4|263_aa MLLAWVQAFLVSNMLLAEAYGSGGCFWDNGHLYREDQTSPAPGLRCLNWLDAQSGLASAP VSGAGNHSYCRNPDEDPRGPWCYVSGEAGVPEKRPCEDLRCPETTSQALPAFTTEIQEAS EGPGADEVQVFAPANALPARSEAAAVQPVIGISQRVRMNSKEKKDLGTLGYVLGITMMVI IIAIGAGIILGYSYKRGKDLKEQHDQKVCEREMQRITLPLSAFTNPTCEIVDEKTVVVHT SQTPVDPQEGTTPLMGQAGTPGA >gi568815576r:31183087_31392344|GENSCAN_predicted_CDS_4|792_bp atgctgttggcctgggtacaagcattcctcgtcagcaacatgctcctagcagaagcctat ggatctggaggctgtttctgggacaacggccacctgtaccgggaggaccagacctccccc gcgccgggcctccgctgcctcaactggctggacgcgcagagcgggctggcctcggccccc gtgtcgggggccggcaatcacagttactgccgaaacccggacgaggacccgcgcgggccc tggtgctacgtcagtggcgaggccggcgtccctgagaaacggccttgcgaggacctgcgc tgtccagagaccacctcccaggccctgccagccttcacgacagaaatccaggaagcgtct gaagggccaggtgcagatgaggtgcaggtgttcgctcctgccaacgccctgcccgctcgg agtgaggcggcagctgtgcagccagtgattgggatcagccagcgggtgcggatgaactcc aaggagaaaaaggacctgggaactctgggctacgtgctgggcattaccatgatggtgatc atcattgccatcggagctggcatcatcttgggctactcctacaagagggggaaggatttg aaagaacagcatgatcagaaagtatgtgagagggagatgcagcgaatcactctgcccttg tctgccttcaccaaccccacctgtgagattgtggatgagaagactgtcgtggtccacacc agccagactccagttgaccctcaggagggcaccaccccccttatgggccaggccgggact cctggggcctga >gi568815576r:31183087_31392344|GENSCAN_predicted_peptide_5|64_aa MAKIQTTNADKDVEQQELSFIAAFSSSFYSLKGPFVSLGDMKRFVTSNVSCYLDLKGKFA HVRL >gi568815576r:31183087_31392344|GENSCAN_predicted_CDS_5|195_bp atggccaaaatccagacaacaaatgctgacaaggatgtggagcaacaggaactttcattc attgctgcattttccagcagcttctacagtctgaaagggccctttgtctcccttggtgac atgaagaggtttgtcacttcaaatgtcagctgttacttggacttgaagggcaagtttgcc catgtccgcctgtga >gi568815576r:31183087_31392344|GENSCAN_predicted_peptide_6|641_aa MERVNDASCGPSGCYTYQVSRHSTEMLHNLNQQRKNGGRFCDVLLRVGDESFPAHRAVLA ACSEYFESVFSAQLGDGGAADGGPADVGGATAAPGGGAGGSRELEMHTISSKVFGDILDF AYTSRIVVRLESFPELMTAAKFLLMRSVIEICQEVIKQSNVQILVPPARADIMLFRPPGT SDLGFPLDMTNGAALAANSNGIAGSMQPEEEAARAAGAAIAGQASLPVLPGVDRLPMVAG PLSPQLLTSPFPSVASSAPPLTGKRGRGRPRKANLLDSMFGSPGGLREAGILPCGLCGKV FTDANRLRQHEAQHGVTSLQLGYIDLPPPRLGENGLPISEDPDGPRKRSRTRKQVACEIC GKIFRDVYHLNRHKLSHSGEKPYSCPVCGLRFKRKDRMSYHVRSHDGSVGKPYICQSCGK GFSRPDHLNGHIKQVHTSERPHKCQTCNASFATRDRLRSHLACHEDKVPCQVCGKYLRAA YMADHLKKHSEGPSNFCSICNREGQKCSHQDPIESSDSYGDLSDASDLKTPEKQSANGSF SCDMAVPKNKMESDGEKKYPCPECGSFFRSKSYLNKHIQKVHVRALGGPLGDLGPALGSP FSPQQNMSLLESFGFQIVQSAFASSLVDPEVDQQPMGPEGK >gi568815576r:31183087_31392344|GENSCAN_predicted_CDS_6|1926_bp atggagcgggtgaacgacgcttcgtgcggcccgtctggctgctacacataccaggtgagc agacacagcacggagatgctgcacaacctgaaccagcagcgcaaaaacggcgggcgcttc tgcgacgtgctcttgcgggtaggcgacgagagcttcccagcgcaccgcgccgtgctggcc gcctgcagcgagtactttgagtcggtgttcagcgcccagttgggcgacggcggagctgcg gacgggggtccggctgatgtagggggcgcgacggcagcaccaggcggcggggccgggggc agccgggagctggagatgcacactatcagctccaaggtatttggggacattctggacttc gcctacacttcccgcatcgtggtgcgcttggagagctttcccgaactcatgacggccgcc aagttcctgctgatgaggtcggttatcgagatctgccaggaagtcatcaaacagtccaac gtacagatcctggtaccccctgcccgcgccgatataatgctctttcgcccccctgggacc tcggacttgggcttccctttggacatgaccaacggggcagccttggcagccaacagcaat ggcatcgccggcagcatgcagccagaggaggaggcagctcgggcggctggtgcagccatt gcaggccaagcctctttgcctgtgttacctggggtggaccgcttgcccatggtggctgga cccctatccccccaactgctgacttccccattccccagtgtggcatccagtgcccctccc ctgactggcaagcgaggccggggccgcccaaggaaggccaacctgctggactcaatgttt gggtccccagggggcctgagggaggcaggcatccttccatgcggtctatgtggtaaggtg ttcactgatgccaaccggctccggcagcacgaggcccagcacggtgtcaccagcctccag ctgggctacatcgaccttcctcctccgaggctgggtgagaatgggctacccatctctgaa gaccccgacggcccccgaaagaggagccggaccaggaagcaggtggcttgtgagatctgc ggcaagatcttccgtgatgtgtatcatcttaaccggcacaagctgtcccactctggggag aagccctactcctgccctgtgtgtgggttgcggttcaagagaaaagaccgcatgtcctac catgtgcggtcccatgatgggtccgtgggcaagccttacatctgccagagctgtgggaaa ggcttctccaggcctgatcacttgaacggacatatcaagcaggtgcacacttctgagcgg cctcacaagtgtcagacctgcaatgcttcttttgccacccgagaccgtctgcgctcccac ctggcctgtcatgaagacaaggtgccctgccaggtgtgtgggaagtacttgcgggcagca tacatggcagaccacctgaagaagcacagcgaggggcccagcaacttctgcagtatctgt aaccgagaaggccagaaatgctcacatcaggatccgattgagagctctgactcctatggt gacctctcagatgccagcgacctgaagacgccagagaagcagagtgccaatggctctttc tcctgcgacatggcagtccccaaaaacaaaatggagtctgatggggagaagaagtaccca tgccctgaatgtgggagcttcttccgctctaagtcctacttgaacaaacacatccagaag gtgcatgtccgggctctcgggggccccctgggggacctgggccctgcccttggctcacct ttctctcctcagcagaacatgtctctcctcgagtcctttgggtttcagattgttcagtcg gcatttgcgtcatctttagtagatcctgaggttgaccagcagcccatggggcctgaaggg aaatga >gi568815576r:31183087_31392344|GENSCAN_predicted_peptide_7|121_aa MDTFLDTYTFPKLKQEEIDSLNRTIMSSKMKSVINSLLTKKSPGPDEVTAEFHQMYKDGI VLFLLKLFQKIEAEGLLPNSFCEPGIILTLKPGRDTPKIENFKPISLMNIDAEILNKILA N >gi568815576r:31183087_31392344|GENSCAN_predicted_CDS_7|366_bp atggatacattcctggacacatacaccttcccaaaactgaaacaagaagaaatagattcc ctgaacagaacaataatgagctccaaaatgaaatcagtaataaatagcctactaacaaag aaaagcccaggaccagatgaagtcacagccgaattccaccagatgtacaaagacgggata gtactatttcttctgaaactatttcaaaaaattgaggcagagggactcctccccaactca ttctgtgagcccggcatcatcctgacactaaaacctggcagagacacacccaaaatagaa aacttcaagccaatatccttgatgaacatcgatgcagaaatcctcaacaaaatacttgca aattga