GENSCAN 1.0 Date run: 7-Nov-116 Time: 16:20:43 Sequence gi568815584r:76925405_77127792 : 202388 bp : 49.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7439 7598 160 1 1 114 27 60 0.065 2.06 1.02 Intr + 11883 11978 96 1 0 111 28 108 0.156 7.08 1.03 Intr + 14894 15030 137 0 2 99 75 32 0.090 3.29 1.04 Term + 22991 23053 63 1 0 67 41 62 0.029 -2.71 1.05 PlyA + 26615 26620 6 1.05 2.03 PlyA - 27504 27499 6 1.05 2.02 Term - 33958 33842 117 1 0 64 55 65 0.535 -0.66 2.01 Init - 35091 34984 108 0 0 38 105 107 0.985 7.62 2.00 Prom - 37505 37466 40 -1.36 3.00 Prom + 38115 38154 40 -7.06 3.01 Init + 43279 43513 235 0 1 107 43 263 0.917 22.00 3.02 Term + 43663 43742 80 2 2 68 40 139 0.999 5.03 3.03 PlyA + 43806 43811 6 1.05 4.05 PlyA - 44012 44007 6 1.05 4.04 Term - 49674 49441 234 0 0 92 44 174 0.985 9.72 4.03 Intr - 60560 60477 84 2 0 47 72 116 0.540 5.92 4.02 Intr - 61567 61438 130 0 1 95 96 7 0.462 2.90 4.01 Init - 62695 62679 17 1 2 77 77 16 0.388 -0.94 4.00 Prom - 69678 69639 40 -2.66 5.00 Prom + 71277 71316 40 -5.86 5.01 Init + 71559 71589 31 2 1 75 67 21 0.083 -1.40 5.02 Intr + 72004 72106 103 2 1 107 76 41 0.086 4.03 5.03 Intr + 74961 75075 115 0 1 43 90 41 0.042 0.25 5.04 Term + 85582 85743 162 0 0 49 43 134 0.208 2.94 5.05 PlyA + 90788 90793 6 1.05 6.08 PlyA - 96273 96268 6 1.05 6.07 Term - 102398 99998 2401 1 1 45 46 3261 0.242 302.98 6.06 Intr - 103154 102626 529 0 1 10 59 234 0.224 4.70 6.05 Intr - 107790 107661 130 0 1 123 31 78 0.501 5.87 6.04 Intr - 118686 118643 44 1 2 33 85 69 0.023 -0.94 6.03 Intr - 136434 136325 110 2 2 95 80 28 0.072 2.63 6.02 Intr - 144453 144286 168 2 0 42 61 102 0.115 1.96 6.01 Init - 147889 147747 143 1 2 74 65 56 0.218 1.60 6.00 Prom - 149262 149223 40 -2.26 7.00 Prom + 182985 183024 40 -4.06 7.01 Init + 183788 183797 10 1 1 77 99 2 0.706 0.83 7.02 Intr + 184408 184577 170 2 2 80 94 88 0.978 8.27 7.03 Intr + 188021 188755 735 1 0 112 75 147 0.411 7.44 7.04 Intr + 192929 193010 82 1 1 91 99 23 0.485 2.91 7.05 Intr + 198979 199152 174 2 0 61 15 119 0.242 1.81 7.06 Term + 199642 199934 293 2 2 53 28 206 0.337 6.61 7.07 PlyA + 201706 201711 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:76925405_77127792|GENSCAN_predicted_peptide_1|151_aa LGPLTGSSSLWTAICFCNWVICMHLSFGDAFVDTVLVEQLCRAGCQQESGPQRGPVQGAE GATPSWLEMQQLCLPLKDPTANQQPGPAPIQPPSSLNWAPNLGSLEYQSPITMPMIPFPD PSPAAGPQSGELQHRHSGTEPVLSNVDQVLP >gi568815584r:76925405_77127792|GENSCAN_predicted_CDS_1|456_bp cttgggccactcaccggctcttcttctctctggactgctatttgcttctgtaactgggtg atttgcatgcacttgtcgtttggggatgcatttgtggacaccgtcttggtggaacagttg tgccgagctgggtgtcagcaggagagtggcccccagagggggccggtgcagggggctgag ggtgccacgcccagctggctggagatgcagcagctctgcctgcccctgaaggacccaact gcgaatcagcagcctgggccggctcccatccagcccccctcctctctcaactgggctccc aacctgggaagtctggaataccagtctcctatcaccatgcccatgatcccctttccagat ccctccccagctgcggggccacagtcaggggagctacagcacaggcactccggcaccgag ccagtgctcagcaatgttgatcaggtgctgccctga >gi568815584r:76925405_77127792|GENSCAN_predicted_peptide_2|74_aa MDLNLQLEKPTTLEELIPYSAGSSIPDRWKLAAIFLAPKAEMAVASPTNPEWRTQHPVSS SVLDSLEQECQMWQ >gi568815584r:76925405_77127792|GENSCAN_predicted_CDS_2|225_bp atggatctcaacctacaactggaaaaacccactactctggaggagctgattccgtactct gctggatcttcaattcctgaccgctggaaattggctgcaattttcctggccccgaaagca gaaatggctgtggcctcacctacgaatcctgaatggcgaacccagcatcctgtctctagc tcagtgctggattccctggagcaggagtgccaaatgtggcagtga >gi568815584r:76925405_77127792|GENSCAN_predicted_peptide_3|104_aa MASISELACIYSALILHDNEVTVTEYKIKALIKAAGVNVEPFRPGLFAKAPANVNIRSLI CNVGAGGPAPAAGAAPAGAEEKKMEAKKEEFEDSDDDMGFGLSD >gi568815584r:76925405_77127792|GENSCAN_predicted_CDS_3|315_bp atggcctccatctccgagctcgcctgcatctactcagccctcattctgcacgacaatgag gtgactgtcacagagtataagatcaaggccctcattaaagcagctggtgtaaatgttgaa ccttttcggcctggcttgtttgcaaaggccccggccaatgtcaacattaggagcctcatc tgcaatgtaggggctggaggacctgctccagcagctggtgctgcaccagcaggagctgag gagaagaaaatggaagcaaagaaagaagaatttgaggactctgatgatgacatgggcttt ggtctttctgactaa >gi568815584r:76925405_77127792|GENSCAN_predicted_peptide_4|154_aa MEIQDWPALVLDVRRVVLGLSLRTPGGPFMETGLACRGEGRRVKSPLTKSGELQLKAESL GLPGGKVSLGCAGESPWPWLVVLSLGPSSPSASLLELSLSKALPQDGLPLEVQGRDVDSG YFKALRVFLRQLQIESLTPEEKARGHFPLDDLEL >gi568815584r:76925405_77127792|GENSCAN_predicted_CDS_4|465_bp atggaaattcaagattggcctgccctggtgctggacgtgaggagggtggttctagggctg agcttgaggaccccagggggtcctttcatggaaactggcctggcgtgcagaggggagggt cgcagagtcaagtcccctctcaccaagtctggggagctgcagctgaaagcagaatccctg ggccttcctggtggaaaagtctccctgggctgtgctggtgagagcccctggccctggctg gtggttctcagtcttggcccatcaagcccatcagcgtccctgttggagctttcactgagc aaggccctgccccaggatggtctccctctggaggtgcaggggcgggacgtggactctgga tattttaaagctctccgggtctttctgagacaattacagattgagagcttgacccctgag gaaaaggcccgtgggcatttccccctggatgacctggagctgtga >gi568815584r:76925405_77127792|GENSCAN_predicted_peptide_5|136_aa MVEEFSQVHTVVPPIGQTQPEARKHESSADAAHTDQPPGLQSGGGLKLAGIGMHHKSPGC SCDWIIYSSSLTSLTRNCHTKGKWGWHQVQEAVAETQCQQMIEHWGFTGDLHSGKRVQRQ RVGQKKRNCLQKACSL >gi568815584r:76925405_77127792|GENSCAN_predicted_CDS_5|411_bp atggttgaggaattcagccaagttcacacagtggtgccgcccattgggcaaacccagcca gaagccaggaagcatgagagttcagctgatgcagcccatacagatcagcctcctggactt cagagtgggggaggcctgaagctggctggcattggcatgcaccataaatctccaggatgc agctgtgactggattatttatagctccagtcttacatctctaactcgaaactgtcacaca aagggaaagtggggatggcaccaggttcaagaggccgtagcagagacccagtgccagcaa atgatagagcactggggctttactggggacttacattcagggaagagagttcagcggcag cgggttgggcagaagaaacgcaactgcttgcaaaaggcatgcagtttatag >gi568815584r:76925405_77127792|GENSCAN_predicted_peptide_6|1174_aa MGKKDSTALQGREKRDITEALKRRSIQWYDLEETRRGPSSRVCKNSGSNLSSGARILHCS TALGLHTHVSGADPSQSALPSEGSITCRSNAMDSSGLSSRALAWPPLGTLMCLRLPTSGP LLHKDLQAVVLKLEHASEAPATTAIQPDSNNEPPQAQGSNCASGVPGARRSGAIERPEEG PTQTFVFIFSKADLAGQRTNVSAASRGGPGAGQAWGRREPACVHSSPLLSCVLEAVAAAA PSGVQTQSPPAERPTLRRGEGPFSAATPSPGPPGKQRLQQDWTRAPGGTEPSGPRPRGPV GERTQLVGGSPGRGRDSPFRASTHQPLRGGGQEATAATTGHPRPLPSGRELPAQVCSTRK KAPELPSRRVNRSRRAGIMSAAQVSSSRRQSCYLCDLPRMPWAMIWDFSEPVCRGCVNYE GADRIEFVIETARQLKRAHGCFQDGRSPGPPPPVGVKTVALSAKEAAAAAAAAAAAAAAA QQQQQQQQQQQQQQQQQQQQQQQQQLNHVDGSSKPAVLAAPSGLERYGLSAAAAAAAAAA AAVEQRSRFEYPPPPVSLGSSSHTARLPNGLGGPNGFPKPTPEEGPPELNRQSPNSSSAA ASVASRRGTHGGLVTGLPNPGGGGGPQLTVPPNLLPQTLLNGPASAAVLPPPPPHALGSR GPPTPAPPGAPGGPACLGGTPGVSATSSSASSSTSSSVAEVGVGAGGKRPGSVSSTDQER ELKEKQRNAEALAELSESLRNRAEEWASKPKMVRDTLLTLAGCTPYEVRFKKDHSLLGRV FAFDAVSKPGMDYELKLFIEYPTGSGNVYSSASGVAKQMYQDCMKDFGRGLSSGFKYLEY EKKHGSGDWRLLGDLLPEAVRFFKEGVPGADMLPQPYLDASCPMLPTALVSLSRAPSAPP GTGALPPAAPSGRGAAASLRKRKASPEPPDSAEGALKLGEEQQRQQWMANQSEALKLTMS AGGFAAPGHAAGGPPPPPPPLGPHSNRTTPPESAPQNGPSPMAALMSVADTLGTAHSPKD GSSVHSTTASARRNSSSPVSPASVPGQRRLASRNGDLNLQVAPPPPSAHPGMDQVHPQNI PDSPMANSGPLCCTICHERLEDTHFVQCPSVPSHKFCFPCSRESIKAQGATGEVYCPSGE KCPLVGSNVPWAFMQGEIATILAGDVKVKKERDP >gi568815584r:76925405_77127792|GENSCAN_predicted_CDS_6|3525_bp atgggcaagaaggacagcactgctttgcagggaagggaaaagagagacatcactgaggca ctgaagaggagaagcattcagtggtatgaccttgaagagacaagaagaggaccaagttca agagtctgcaagaacagtgggagtaacctcagcagtggagcgaggattctgcactgttcc actgccctggggctgcacacccacgtgagcggggccgacccgtcccagagcgctttgccc tcagaaggcagcatcacctgcaggagcaatgccatggacagctctggcttgtcctccagg gccctggcatggccccccttgggtactctaatgtgcctgcgcctgcccacctcggggcct ttgttacacaaagacttgcaagcagtggttctgaaactcgaacatgcatcggaagcccct gcaacaactgccatccagccagacagcaacaatgaaccaccacaggcccagggctcaaac tgcgcctctggggttcctggcgcgcggcgatcaggggccatcgaaaggccagaggagggc ccaacgcagacgtttgttttcatcttctccaaggcggatctcgcaggccagagaaccaac gtgagcgccgccagccgcggcggcccgggcgccggccaggcctgggggcggcgggagcct gcgtgcgtgcactcctctcctctgctctcgtgcgtcctggaagcagtggccgcggcggct ccctccggggtgcaaacccagtcgccgccagcagaacggccgacgctgcggaggggagaa ggtcctttctcggctgccaccccctcccccggtcctccggggaagcagcggcttcagcaa gattggacccgggcaccgggtggcactgaaccctctggccctcgccccagggggcccgtc ggggagaggacgcagctcgtaggggggtccccggggagaggaagagacagcccctttcga gcttccacgcaccagccactccggggagggggccaagaggcaacggcggccaccaccggg caccctcgccccctcccctcgggccgggagcttccagcccaagtctgcagcaccaggaag aaggcgcctgagctcccctcgcgacgagtcaaccgcagtaggagggcaggcatcatgtcg gcggcgcaggtgtcctcgtcccggagacaatcttgctacctgtgcgacctgccccgcatg ccctgggccatgatctgggacttctcggaacccgtatgccgcggttgcgtcaactacgag ggcgctgatcgcatcgaattcgtgatcgagacagcgcgccagctgaagcgggcgcacggc tgcttccaggacggccgctcccccgggccgccgccgcccgtcggggtcaagacagtggcc ctgtcggctaaggaagcggcggcggcggcggcagcagcggcggccgccgccgccgccgcg caacagcaacagcaacagcagcagcagcagcagcaacagcagcagcagcagcagcagcag cagcagcaacaacagctcaaccacgttgatggttccagcaagcctgcggtgctggcggcc ccgtctggcctggagcgctacggcctaagcgctgccgccgccgccgccgccgccgccgcc gctgcggtggaacagcgcagccgcttcgagtacccgccaccgccggtgagcctgggaagc agcagccacaccgcgcgactgcccaacggcctggggggcccaaacggcttccccaaacca acaccagaggagggacccccagagctgaaccgtcagagccccaattcttcttcagcggcg gcgtcggtggcgtctcggcgtggaacgcacggtgggctggttacggggctgcccaacccg gggggtggcggaggcccccagctcaccgtgccccccaacctgctaccgcagacgctgctt aacggcccggccagcgctgcggtactccccccaccccctccccacgccctgggcagccgt gggcccccgacgcctgctcccccaggggctcctgggggccccgcttgtctcgggggtacc ccgggtgtatcggccacgtcgtcctccgcgtcgtcttcgacctcttcgtcggtggcagag gtgggcgtgggtgctggtggtaagaggcccggctcggtgtcgagcacagaccaggagcgc gagttgaaggagaagcagcgcaacgccgaggccctggccgagctgagcgagagcctgcgc aaccgcgccgaggagtgggccagcaagcccaagatggtccgcgacacgctgctcacgctg gcaggctgcacgccctacgaggttcgcttcaagaaggaccactcgctgctgggccgcgtt ttcgccttcgacgccgtctccaagcccggcatggactacgaattgaagctgttcattgag taccccacgggctcgggcaacgtgtactccagtgcatctggtgtggccaagcagatgtat caggactgcatgaaggacttcggccggggcctatcctcgggtttcaagtacctggagtac gaaaagaagcacggctccggggactggcgcctgcttggagacctgctccccgaagccgtg cgcttcttcaaggagggcgtgcccggcgccgacatgctgccccagccctacctggacgcc agctgtcccatgctgcccactgctctggtgagtctgagccgcgcccccagcgcacccccg gggaccggggccttgccgcccgccgcgccgtcgggccggggcgcagccgccagcctgcgc aagagaaaggcctctccggagcccccggactcagccgagggcgcgctgaagctgggcgag gaacagcagaggcagcagtggatggcgaaccagagcgaggcgctgaagctcaccatgtcc gccgggggcttcgcggcgccggggcacgcggcggggggtccgcctccgccgcccccacct ctgggaccccattccaaccggaccaccccacctgagtcagccccccagaacggtccgtcc cctatggccgctctcatgtcggtggcagatactctgggcacagcgcactcgcccaaggat ggcagttccgtgcactctaccactgcgtcggcgcggcgaaacagcagcagcccagtctcg ccggcctccgtgccggggcagcgccgcttggcatcacgtaacggggacctgaatttacag gtggcgcccccgccgcctagcgcccacccgggcatggaccaagtgcacccccaaaacatt ccggattcccccatggccaacagcggacccctctgctgcaccatttgccacgaacgtttg gaggatacgcatttcgttcagtgcccttccgtccccagccacaaattttgcttcccttgc tctagagagagtatcaaggcccagggggccaccggcgaggtgtattgccccagcggagag aaatgccccctagtcgggtcgaatgtaccttgggccttcatgcagggcgaaatcgcgact atcttagctggggatgttaaagtgaaaaaggagagagacccttga >gi568815584r:76925405_77127792|GENSCAN_predicted_peptide_7|487_aa MEPDGSSECLSSAEQMESEDMLSALGWSREDRPRQNSKTAKNAFPTLSPMVVMKNVLVKQ GSSSSQLQSWTVQPSFEVISAQPQLLFLHPPVPSPVSPCHTGEKKSDSRNYLPILNSYTK IAPHPGKRGLSLGPEEKGTSGVQKKICTERLGPSLSSSEPTKAGAVPSSPSTPAPPSAKL AEDSALQGVPSLVAGGSPQTLQPVSSSHVAKAPSLTFASPASPVCASDSTLHGLESNSPL SPLSANYSSPLWAAEHLCRSPDIFSEQRQSKHRRFQNTLVVLHKSGLLEITLKTKELIRQ NQATQLNDEEFELVTLKFNRSANLCKIYSCVTAMSTPFDKQGKICRSFNKSFLSTHVRGS VLETENEKRALPCWGRAQKPMGCGFQCLPSSNTGSQARGARAGRVERGGPTAPPPGGPEP RPRALAGAGEATRDGVAAGAQTDAPRCAAFIPRLGRWKGGPDTRRGLRNRLRDSVVFVVR SRHHCFL >gi568815584r:76925405_77127792|GENSCAN_predicted_CDS_7|1464_bp atggaaccagatgggagctcggaatgtctgagctctgcagagcagatggagtccgaggac atgctgagcgccttaggctggagcagagaagacaggccgaggcagaactccaaaactgca aagaatgccttccctaccctgtctcccatggtcgtcatgaagaatgtgcttgtcaaacag ggcagcagctcatcccagctccagtcgtggactgtccagccctcctttgaagtgatctca gcacagccacagctcttattccttcatccacctgtaccatctcctgtcagtccatgtcac actggtgagaaaaagtccgactccaggaactacttgcccattctgaattcttacaccaaa atagccccacatccaggcaaaaggggcctttcccttggcccagaagaaaaaggaacaagt ggagtgcagaagaaaatctgtactgagagacttgggcctagcttgtcttccagtgagcca accaaggctggtgctgtcccatccagtccctcgacgccagcaccacccagcgccaaactt gccgaggactcagctctgcagggtgtgccctctctggtggcaggtggaagtccacagact cttcagccggtatccagcagtcacgtggctaaagctcccagtctgaccttcgcttccccc gccagtcctgtctgcgcatcagacagcactctccatgggttagagagcaactctcccctt tcaccactgtccgctaattatagctcacctttatgggctgcagagcacctctgccgcagc ccagatatcttttcagagcagcggcagagcaaacataggcgctttcagaataccctagta gtcctacataaatctggtttgctggagatcactttgaaaaccaaggagttgattcgtcag aatcaggcaactcagctgaatgatgaagaatttgaattagttaccttgaagtttaatcgc agtgctaacctttgtaagatctatagttgtgttacagctatgagcaccccctttgacaaa caaggaaaaatctgccgttccttcaacaaatccttcctgagcacccacgtgcgaggctct gtgctagagactgagaatgaaaaacgggcgctcccctgctggggaagagctcagaagcca atgggatgtggatttcaatgtcttcccagctccaacactggcagccaagcgcggggcgcg cgcgccgggagagtggagcgcggaggcccgacggcgccccctcccggcgggcccgagcca cggccgcgggctctggcgggtgccggggaggccacgcgcgacggcgtcgcagccggagcc cagacagacgccccacgctgcgcggccttcatcccacggctggggcgatggaaaggtggt cccgacacgcgcaggggtctgaggaatcggcttcgggattctgtggtctttgtggtccgc agccggcaccactgcttcctgtaa