GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:22:38 Sequence gi568815588r:35539348_35741429 : 202082 bp : 43.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 13672 13838 167 0 2 92 84 159 0.592 14.66 1.02 Intr + 26676 26838 163 0 1 99 94 134 0.934 15.08 1.03 Intr + 28496 28620 125 1 2 45 60 52 0.195 -2.52 1.04 Intr + 29191 29356 166 1 1 59 94 55 0.231 3.16 1.05 Term + 29707 29823 117 0 0 131 36 139 0.983 11.54 1.06 PlyA + 30083 30088 6 1.05 2.03 PlyA - 31875 31870 6 1.05 2.02 Term - 39746 39592 155 0 2 76 42 73 0.171 -0.42 2.01 Init - 46135 45859 277 1 1 71 105 247 0.561 21.85 2.00 Prom - 53688 53649 40 -6.46 3.00 Prom + 55011 55050 40 -2.96 3.01 Init + 66221 66284 64 1 1 81 92 65 0.737 7.55 3.02 Term + 68231 69279 1049 0 2 108 48 1122 0.973 102.50 3.03 PlyA + 70111 70116 6 1.05 4.04 PlyA - 71488 71483 6 1.05 4.03 Term - 71629 71524 106 0 1 84 48 82 0.405 1.68 4.02 Intr - 79790 79676 115 0 1 109 43 80 0.335 5.11 4.01 Init - 86153 86102 52 2 1 76 53 23 0.475 -1.07 4.00 Prom - 89444 89405 40 -1.36 5.04 PlyA - 90966 90961 6 1.05 5.03 Term - 102327 99998 2330 1 2 46 53 3667 0.047 344.38 5.02 Intr - 105243 105109 135 1 0 67 78 103 0.839 7.74 5.01 Init - 114180 114072 109 0 1 103 41 85 0.712 5.68 5.00 Prom - 119058 119019 40 -6.26 6.00 Prom + 121912 121951 40 -5.36 6.01 Sngl + 127779 128504 726 2 0 79 47 681 0.634 59.33 6.02 PlyA + 129196 129201 6 1.05 7.02 PlyA - 129436 129431 6 1.05 7.01 Sngl - 136638 136369 270 0 0 60 48 170 0.941 5.48 7.00 Prom - 136899 136860 40 -5.46 8.02 PlyA - 136991 136986 6 1.05 8.01 Sngl - 139237 138608 630 1 0 86 37 274 0.992 18.30 8.00 Prom - 141594 141555 40 -4.66 9.00 Prom + 155038 155077 40 -4.16 9.01 Init + 158067 158258 192 2 0 52 47 156 0.552 6.87 9.02 Intr + 158277 158597 321 2 0 -46 60 226 0.109 2.46 9.03 Term + 162004 162102 99 0 0 47 32 106 0.084 -0.97 9.04 PlyA + 163870 163875 6 1.05 10.05 PlyA - 165236 165231 6 1.05 10.04 Term - 166057 165923 135 1 0 106 54 54 0.125 1.82 10.03 Intr - 171413 171398 16 1 1 82 75 0 0.022 -6.35 10.02 Intr - 172055 171902 154 0 1 111 24 118 0.409 6.83 10.01 Intr - 173489 173369 121 2 1 13 97 95 0.431 2.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 102082 99998 2085 1 0 85 53 3632 0.936 351.69 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:35539348_35741429|GENSCAN_predicted_peptide_1|245_aa VYLERLLTYAEIDICPANWKRIVLGAILLASKVWDDQAVWNVDYCQILKDITVEDMNELE RQFLELLQFNINVPSSVYAKYYFDLRSLAEANNLSFPLEPLSRERAHKLEPLEWKAHPVS QLSSIRGKMENPPSAGLPEGKCCETFASTDVRGDLSPPSFSVESLAGGLAAAVFAASGRA LLSVRLSSKPVLHASPARMVLFLPRLQAISRLCEDKYKDLRRSARKRSASADNLTLPRWS PAIIS >gi568815588r:35539348_35741429|GENSCAN_predicted_CDS_1|738_bp gtgtaccttgaaagacttttaacatacgcagagatagatatctgtccggccaactggaag cggattgttttaggggcgatcctgctggcctccaaggtgtgggatgaccaggctgtatgg aatgtggattactgccagatcctgaaagacatcacggtggaggacatgaacgagctagag cgacagtttcttgaattgctgcagttcaacatcaatgttccttccagtgtctatgccaag tattattttgatcttcgttctctggcagaagcgaacaacctgagctttcccttggagccc ctgagcagggagagggctcacaagcttgagcctttggaatggaaagcccacccagtctcc cagttatcatctattcgaggaaaaatggaaaaccctccctctgcaggacttcctgaaggc aaatgttgtgaaaccttcgcaagcacagatgtgagaggtgacttaagtcctccatccttt tctgtggagtccctagctggtggtctggctgcagctgtctttgccgccagcggccgagct ctcctctctgtgaggctcagctccaagcccgtgcttcatgccagccctgccaggatggtg ctgttcttgccccgcttgcaggccatctctcgcctctgcgaggacaagtacaaggaccta agaagatccgcgaggaagcgctcagccagtgcagacaacctgactctgccccggtggtcc ccagccatcatctcttaa >gi568815588r:35539348_35741429|GENSCAN_predicted_peptide_2|143_aa MGRKQHMQCPKAGSEPAGYRALCAMQDKQSLGGNKVRVEAGPCLVLGQCGFYPIDNGKSL KVSIGGDTVGSAFEEDGITCCEEKDPEGGSGGGFLGSNHISFKVTEHEINDCRTHECLQE EMMFKNHFIYISIPPAFLLLLKM >gi568815588r:35539348_35741429|GENSCAN_predicted_CDS_2|432_bp atgggcaggaagcagcacatgcaatgccctaaagctgggagtgaacctgcggggtaccgg gcactgtgtgcaatgcaggacaaacaaagcctgggcgggaacaaggtgagggtggaagca gggccctgtttggtcctgggacagtgtgggttctacccgatcgacaatggaaagtcactg aaggtctccattggaggtgacacagtcggatctgcatttgaggaagatggcatcacctgt tgtgaagagaaggacccagaaggcggcagcggtggaggttttcttggtagcaaccacatt agcttcaaggttactgaacacgagataaatgactgtagaacccatgagtgcctgcaggaa gaaatgatgttcaagaaccactttatctacatttctatacctccagctttcctccttttg ctcaagatgtga >gi568815588r:35539348_35741429|GENSCAN_predicted_peptide_3|370_aa MEGVDLLGFLIITLNCNVTMVGKLWFVLTMLLRMLVIVLAGRPVYQDEQERFVCNTLQPG CANVCYDVFSPVSHLRFWLIQGVCVLLPSAVFSVYVLHRGATLAALGPRRCPDPREPASG QRRCPRPFGERGGLQVPDFSAGYIIHLLLRTLLEAAFGALHYFLFGFLAPKKFPCTRPPC TGVVDCYVSRPTEKSLLMLFLWAVSALSFLLGLADLVCSLRRRMRRRPGPPTSPSIRKQS GASGHAEGRRTDEEGGREEEGAPAPPGARAGGEGAGSPRRTSRVSGHTKIPDEDESEVTS SASEKLGRQPRGRPHREAAQDPRGSGSEEQPSAAPSRLAAPPSCSSLQPPDPPASSSGAP HLRARKSEWV >gi568815588r:35539348_35741429|GENSCAN_predicted_CDS_3|1113_bp atggaaggcgtggacttgctagggtttctcatcatcacattaaactgcaacgtgaccatg gtgggaaagctctggttcgtcctcacgatgctgctgcggatgctggtgattgtcttggcg gggcgacccgtctaccaggacgagcaggagaggtttgtctgcaacacgctgcagccggga tgcgccaatgtttgctacgacgtcttctcccccgtgtctcacctgcggttctggctgatc cagggcgtgtgcgtcctcctcccctccgccgtcttcagcgtctatgtcctgcaccgagga gccacgctcgccgcgctgggcccccgccgctgccccgacccccgggagccggcctccggg cagagacgctgcccgcggccattcggggagcgcggcggcctccaggtgcccgacttttcg gccggctacatcatccacctcctcctccggaccctgctggaggcagccttcggggccttg cactactttctctttggattcctggccccgaagaagttcccttgcacgcgccctccgtgc acgggcgtggtggactgctacgtgtcgcggcccacagagaagtccctgctgatgctgttc ctctgggcggtcagcgcgctgtcttttctgctgggcctcgccgacctggtctgcagcctg cggcggcggatgcgcaggaggccgggaccccccacaagcccctccatccggaagcagagc ggagcctcaggccacgcggagggacgccggactgacgaggagggtgggcgggaggaagag ggggcaccggcgcccccgggtgcacgcgccggaggggagggggctggcagccccaggcgt acatccagggtgtcagggcacacgaagattccggatgaggatgagagtgaggtgacatcc tccgccagcgaaaagctgggcagacagccccggggcaggccccaccgagaggccgcccag gaccccaggggctcaggatccgaggagcagccctcagcagcccccagccgcctggccgcg cccccttcctgcagcagcctgcagccccctgacccgcctgccagctccagtggtgctccc cacctgagagccaggaagtctgagtgggtgtga >gi568815588r:35539348_35741429|GENSCAN_predicted_peptide_4|90_aa MDTKRETTDARAYLRMEGEDGPEYLEAEIEQIRQGAGGIWEDHLFIKAVGSHRLEWSVHS ALSHHSVEYEVEDSSSDHLPPLSTWSLTLL >gi568815588r:35539348_35741429|GENSCAN_predicted_CDS_4|273_bp atggacacaaagagagaaacaacagatgccagggcctacttgaggatggagggcgaggat gggcctgagtacctcgaagctgaaatagagcagatccggcagggggccgggggaatatgg gaagaccatctcttcatcaaagcagtggggagccacaggctggaatggtctgtccactca gctctcagccatcactcagtagagtatgaagttgaggactcgtcctcggaccacctgcct ccactttccacgtggtccctgactcttttatga >gi568815588r:35539348_35741429|GENSCAN_predicted_peptide_5|857_aa MGKTPEGDRAEKQKEPESLMTFLKICIYQPGKLPYFYQHWNLWQLPAADAFWRLALWKQG ANRRDSEFLHSTYKWEFRFAGAAFKPPERRKGVRKGPAQAAGRAAADAPLEGVRLSADPP PGRSIRETPDHPLYPSAALSWAPPPAPLCAPPGGRGHRAEAHSMEWGYLLEVTSLLAALA LLQRSSGAAAASAKELACQEITVPLCKGIGYNYTYMPNQFNHDTQDEAGLEVHQFWPLVE IQCSPDLKFFLCSMYTPICLEDYKKPLPPCRSVCERAKAGCAPLMRQYGFAWPDRMRCDR LPEQGNPDTLCMDYNRTDLTTAAPSPPRRLPPPPPGEQPPSGSGHGRPPGARPPHRGGGR GGGGGDAAAPPARGGGGGGKARPPGGGAAPCEPGCQCRAPMVSVSSERHPLYNRVKTGQI ANCALPCHNPFFSQDERAFTVFWIGLWSVLCFVSTFATVSTFLIDMERFKYPERPIIFLS ACYLFVSVGYLVRLVAGHEKVACSGGAPGAGGAGGAGGAAAGAGAAGAGAGGPGGRGEYE ELGAVEQHVRYETTGPALCTVVFLLVYFFGMASSIWWVILSLTWFLAAGMKWGNEAIAGY SQYFHLAAWLVPSVKSIAVLALSSVDGDPVAGICYVGNQSLDNLRGFVLAPLVIYLFIGT MFLLAGFVSLFRIRSVIKQQDGPTKTHKLEKLMIRLGLFTVLYTVPAAVVVACLFYEQHN RPRWEATHNCPCLRDLQPDQARRPDYAVFMLKYFMCLVVGITSGVWVWSGKTLESWRSLC TRCCWASKGAAVGGGAGATAAGGGGGPGGGGGGGPGGGGGPGGGGGSLYSDVSTGLTWRS GTASSVSYPKQMPLSQV >gi568815588r:35539348_35741429|GENSCAN_predicted_CDS_5|2574_bp atgggtaaaacccctgagggggacagggcagagaaacagaaggaacctgaatccttgatg acgtttttgaaaatctgtatctatcaacctggaaaactgccctacttttatcagcactgg aacctctggcaattacctgcagctgatgcgttctggagactcgccctctggaaacaaggt gccaaccgcagagatagtgaatttctacattcaacatacaagtgggaattccgatttgca ggagccgccttcaaacccccggagcggcggaaaggagtgcggaagggccctgcccaggct gcgggccgcgccgctgcggacgcgcccctagaaggggttagattatcggcagacccccct cccgggcgaagcataagggagaccccagatcacccgctgtacccgtctgcggctctctcg tgggccccccccccggcacccctctgcgcgccgcctggagggcgagggcaccgggccgag gcgcacagcatggagtggggttacctgttggaagtgacctcgctgctggccgccttggcg ctgctgcagcgctctagcggcgctgcggccgcctcggccaaggagctggcatgccaagag atcaccgtgccgctgtgtaagggcatcggctacaactacacctacatgcccaatcagttc aaccacgacacgcaagacgaggcgggcctggaggtgcaccagttctggccgctggtggag atccagtgctcgcccgatctcaagttcttcctgtgcagcatgtacacgcccatctgccta gaggactacaagaagccgctgccgccctgccgctcggtgtgcgagcgcgccaaggccggc tgcgcgccgctcatgcgccagtacggcttcgcctggcccgaccgcatgcgctgcgaccgg ctgcccgagcaaggcaaccctgacacgctgtgcatggactacaaccgcaccgacctaacc accgccgcgcccagcccgccgcgccgcctgccgccgccgccgcccggcgagcagccgcct tcgggcagcggccacggccgcccgccgggggccaggcccccgcaccgcggcggcggcagg ggcggtggcggcggggacgcggcggcgcccccagctcgcggcggcggcggtggcgggaag gcgcggccccctggcggcggcgcggctccctgcgagcccgggtgccagtgccgcgcgcct atggtgagcgtgtccagcgagcgccacccgctctacaaccgcgtcaagacaggccagatc gctaactgcgcgctgccctgccacaacccctttttcagccaggacgagcgcgccttcacc gtcttctggatcggcctgtggtcggtgctctgcttcgtgtccaccttcgccaccgtctcc accttccttatcgacatggagcgcttcaagtacccggagcggcccattatcttcctctcg gcctgctacctcttcgtgtcggtgggctacctagtgcgcctggtggcgggccacgagaag gtggcgtgcagcggtggcgcgccgggcgcggggggcgctgggggcgcgggcggcgcggcg gcgggcgcgggcgcggcgggcgcgggcgcgggcggcccgggcgggcgcggcgagtacgag gagctgggcgcggtggagcagcacgtgcgctacgagaccaccggccccgcgctgtgcacc gtggtcttcttgctggtctacttcttcggcatggccagctccatctggtgggtgatcttg tcgctcacatggttcctggcggccggtatgaagtggggcaacgaagccatcgccggctac tcgcagtacttccacctggccgcgtggcttgtgcccagcgtcaagtccatcgcggtgctg gcgctcagctcggtggacggcgacccggtggcgggcatctgctacgtgggcaaccagagc ctggacaacctgcgcggcttcgtgctggcgccgctggtcatctacctcttcatcggcacc atgttcctgctggccggcttcgtgtccctcttccgcatccgctcggtcatcaagcaacag gacggccccaccaagacgcacaagctggagaagctgatgatccgcctgggcctgttcacc gtgctctacaccgtgcccgccgcggtggtggtcgcctgcctcttctacgagcagcacaac cgcccgcgctgggaggccacgcacaactgcccgtgcctgcgggacctgcagcccgaccag gcacgcaggcccgactacgccgtcttcatgctcaagtacttcatgtgcctagtggtgggc atcacctcgggcgtgtgggtctggtccggcaagacgctggagtcctggcgctccctgtgc acccgctgctgctgggccagcaagggcgccgcggtgggcgggggcgcgggcgccacggcc gcggggggtggcggcgggccggggggcggcggcggcgggggacccggcggcggcgggggg ccgggcggcggcgggggctccctctacagcgacgtcagcactggcctgacgtggcggtcg ggcacggcgagctccgtgtcttatccaaagcagatgccattgtcccaggtctga >gi568815588r:35539348_35741429|GENSCAN_predicted_peptide_6|241_aa MNITTIITAITTTITITTIPTIITTITNPITTTTTITTTTTIITTITTTTIFTTATTIIT TTTIITTITTTTTIITTITTTTIIITITTIPHHHHHHHHHHHHHHHHHYHYHHHHHYHHH HHHHHHHHHHHHHHHHHYHHHHHHYHPPPSSPPSPSPPPSLPPPTITTTITITPPSLPPP SSSPPPSPSPPPSLPPPPSSPPSSLALPLPSPLHHHHDFLALSSQMSPPYLAHTRHLTNI C >gi568815588r:35539348_35741429|GENSCAN_predicted_CDS_6|726_bp atgaacatcactaccatcatcactgccatcaccaccaccatcaccatcactaccatcccc accatcatcaccaccatcaccaaccctatcaccaccaccaccaccatcaccaccaccacc accatcatcaccaccatcaccaccaccaccatcttcaccaccgccaccaccatcatcacc accaccaccatcatcaccaccatcaccaccaccaccaccatcatcaccaccatcaccact accaccatcatcatcaccatcactaccatcccccaccatcaccaccatcaccatcaccac caccaccatcatcaccaccatcactaccactaccatcatcatcaccactatcaccatcat caccaccaccaccatcaccaccatcaccaccatcatcaccaccatcaccactaccatcat catcaccatcactaccaccccccaccatcatcaccaccatcaccatcaccaccaccatca ctaccaccccccaccatcaccaccaccatcaccatcacaccaccatcactaccaccccca tcatcatcaccaccaccatcaccatcaccaccaccatcactaccacccccaccatcatca ccaccatcatcactagcactgccgctaccatcaccactgcaccatcatcatgacttcctg gcactatcatcacaaatgtcaccaccatacttggcacatactagacacttaacgaatatt tgctga >gi568815588r:35539348_35741429|GENSCAN_predicted_peptide_7|89_aa MEEAEKSPLCSNNKEKAEETENQPFLGSRQREVIGQNIAPRTGGSERQIKRIPSPAEKPM STNFHGNKCEGRKTSTVIDELLKTQFGQD >gi568815588r:35539348_35741429|GENSCAN_predicted_CDS_7|270_bp atggaagaagctgagaagtctccactctgctctaacaacaaggaaaaagctgaggaaact gaaaatcaacctttcttaggttctcggcaaagggaagtcatagggcaaaatatagccccc agaactggagggtcagaaaggcagataaagagaattccatcaccagcggagaaacccatg agcacaaacttccatgggaacaagtgtgagggtaggaaaacctcaactgtgattgatgaa ttgctgaagactcaatttggacaagactga >gi568815588r:35539348_35741429|GENSCAN_predicted_peptide_8|209_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIRNQKRARIAKSILSQKNKAGGIMLPD FKLYYQSTVNKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPDKNKQWGKDSLFNK WCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNIRPKTIKTLEENLGNTIQDIGMGKDF MSKTPKAMATKAKIDKWDLIKLKSFCTAK >gi568815588r:35539348_35741429|GENSCAN_predicted_CDS_8|630_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatacggaaccaaaaaagagcc cgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcatgctacctgac ttcaaactatactaccagtctacagtaaacaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataatgccgcatatctacaactat ctgatctttgacaaacctgacaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaatcaattcaagatggattaaagacttaaacattagacctaaaaccata aaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaataa >gi568815588r:35539348_35741429|GENSCAN_predicted_peptide_9|203_aa MLRKARRKLIYQKANHYHKECRQMYRTEIRMARLARKAGNIYVPAEPKLAFVIRIRDING VSPKLLCSHQIFNGTFVKLNKASINMLRIVEPHIAWEYPNLKSVNEVIYKRGYGKINKKG ITLTDNTLIAVSLNKYGIICMEDLIHEIYTVGKCFKEASDFPCPSNYLLHEHLALTSDAA LYPGASADPHCLPPPEQPQAVVR >gi568815588r:35539348_35741429|GENSCAN_predicted_CDS_9|612_bp atgcttcgaaaggcaaggaggaaacttatctatcaaaaagcaaaccactatcacaaggaa tgtaggcagatgtacagaactgaaattcgaatggcgaggctggcaagaaaagctggcaac atctatgtacctgcagaacccaaattggcgtttgtcatcaggatcagagatatcaatggt gtgagcccaaagcttctttgctctcatcaaatcttcaatggaacctttgtgaagctcaac aaggcttcaattaacatgctgaggattgtagaaccacatattgcatgggagtacccaaat ctgaagtcagtaaatgaagtaatctacaagcgtggttatggcaaaatcaataagaaggga attactttgacagataacactttgattgctgtgtctcttaataaatatggcatcatctgc atggaggatctgattcatgagatctatactgttggaaagtgcttcaaagaagcaagtgac ttcccgtgcccttcaaattatcttctccacgagcacctggcgctgacttcggatgctgcc ctgtacccaggggcctctgctgatccccactgcctgccccctccggagcagccccaggcc gtggtgcggtaa >gi568815588r:35539348_35741429|GENSCAN_predicted_peptide_10|141_aa SSRKAEELDILKSTQLISLYCLGLDEDTCTFPWQELSASLGRHTLWDQVLNHSGRATEVQ RRKKSPGHNSGKAFGGTQALSGTLRDKWAVESYSVRKDVLLEVNDYPEKTSASENSWTWI HLTKSVFSEFEHECSVSPAMS >gi568815588r:35539348_35741429|GENSCAN_predicted_CDS_10|426_bp agttcaagaaaagctgaagaattggacatcctgaagtccactcagctgatttccttgtac tgcttgggcctggatgaggacacttgtacctttccatggcaggagttatcagccagcctc ggaaggcacacgctgtgggaccaggtgctaaaccattcaggaagggcaacagaagttcag cgaaggaagaagtctccggggcacaatagtggaaaagctttcggaggaactcaggccttg agtgggactctgagagataagtgggctgtggagagctacagtgtaaggaaggatgtcttg ctggaagtcaacgattaccctgagaaaacatctgcatctgaaaacagctggacgtggatt catctcaccaaaagtgttttttcagagttcgaacatgaatgcagtgtgtcaccagcaatg tcatga