GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:13:35 Sequence gi568815588r:62713210_62916029 : 202820 bp : 41.92% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4643 4741 99 0 0 61 81 47 0.027 0.46 1.02 Term + 9585 9799 215 1 2 119 39 120 0.045 6.61 1.03 PlyA + 12689 12694 6 1.05 2.09 PlyA - 14835 14830 6 1.05 2.08 Term - 15862 15656 207 1 0 87 41 134 0.186 4.96 2.07 Intr - 21372 21309 64 2 1 159 91 -4 0.057 4.80 2.06 Intr - 22201 22097 105 0 0 68 105 59 0.854 4.01 2.05 Intr - 22619 22576 44 2 2 62 52 64 0.882 -3.68 2.04 Intr - 23448 23368 81 0 0 109 93 45 0.748 6.02 2.03 Intr - 23931 23803 129 0 0 52 87 74 0.264 3.67 2.02 Intr - 31932 31912 21 0 0 121 79 13 0.344 0.52 2.01 Init - 33570 33430 141 0 0 95 91 68 0.269 7.98 2.00 Prom - 44906 44867 40 -6.55 3.03 PlyA - 46488 46483 6 1.05 3.02 Term - 48324 48134 191 1 2 82 43 150 0.737 6.53 3.01 Init - 51430 51391 40 1 1 101 110 9 0.407 4.81 3.00 Prom - 53603 53564 40 -5.45 4.00 Prom + 58611 58650 40 -6.05 4.01 Init + 61440 61539 100 2 1 82 50 56 0.419 1.67 4.02 Intr + 62275 62360 86 2 2 81 25 82 0.134 -0.18 4.03 Intr + 68720 68862 143 1 2 83 113 22 0.645 2.43 4.04 Intr + 69637 69938 302 1 2 42 50 190 0.597 6.05 4.05 Intr + 70095 70281 187 1 1 -12 98 159 0.179 4.73 4.06 Intr + 72864 72991 128 0 2 14 72 76 0.017 -2.00 4.07 Term + 74063 74790 728 0 2 81 47 232 0.043 11.05 4.08 PlyA + 75734 75739 6 1.05 5.00 Prom + 91050 91089 40 -3.95 5.01 Sngl + 91851 92663 813 2 0 95 48 1011 0.788 91.32 5.02 PlyA + 94089 94094 6 1.05 6.08 PlyA - 94408 94403 6 1.05 6.07 Term - 101259 99998 1262 1 2 122 50 1154 0.961 105.75 6.06 Intr - 102306 102218 89 2 2 90 -5 74 0.819 -3.00 6.05 Intr - 102836 102438 399 1 0 -10 31 595 0.499 36.70 6.04 Intr - 105122 104960 163 0 1 -53 99 173 0.521 2.41 6.03 Intr - 105438 105332 107 2 2 77 83 112 0.789 8.54 6.02 Intr - 105744 105541 204 2 0 73 80 70 0.752 2.09 6.01 Init - 108657 108581 77 0 2 50 72 79 0.942 3.11 6.00 Prom - 108862 108823 40 -1.75 7.00 Prom + 113709 113748 40 -5.05 7.01 Init + 116662 116713 52 0 1 37 99 70 0.860 4.37 7.02 Intr + 131384 131460 77 0 2 78 96 10 0.044 -0.88 7.03 Intr + 136414 136476 63 0 0 90 105 56 0.706 5.50 7.04 Intr + 145466 145602 137 1 2 77 45 96 0.489 2.65 7.05 Intr + 146770 146889 120 1 0 100 99 67 0.304 7.69 7.06 Intr + 154543 154666 124 1 1 0 94 54 0.011 -3.13 7.07 Intr + 156709 156896 188 0 2 83 83 104 0.058 6.97 7.08 Intr + 170543 170750 208 2 1 19 66 113 0.008 0.36 7.09 Intr + 175700 175821 122 1 2 85 60 84 0.165 3.67 7.10 Intr + 183708 183829 122 0 2 80 84 63 0.608 4.32 7.11 Term + 185589 185662 74 1 2 64 44 89 0.301 -0.81 7.12 PlyA + 186698 186703 6 1.05 8.04 PlyA - 188961 188956 6 1.05 8.03 Term - 197883 197706 178 2 1 45 39 165 0.662 3.48 8.02 Intr - 198600 198471 130 1 1 87 37 96 0.728 3.23 8.01 Init - 200780 200744 37 2 1 100 82 18 0.388 2.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 20431 20251 181 0 1 55 33 198 0.868 7.10 S.002 Sngl + 73804 74790 987 0 0 60 47 295 0.881 19.19 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:62713210_62916029|GENSCAN_predicted_peptide_1|104_aa XYTRSITLASASGEGLRLLPLMAEGKGRAGMSYGYLTFPSAPHAQAKQMIQKLLKRKVPQ SDPPNSYQVVKRAEFAWQLFHWGNKSTWEGREVWKEDICGAPDT >gi568815588r:62713210_62916029|GENSCAN_predicted_CDS_1|315_bp nactatacaagaagcataacactggcatctgcatctggtgagggcctcaggctgcttcca ctcatggcagaaggcaaagggagagcagggatgtcatatggttacctgactttcccgtca gctccccatgcacaggccaagcagatgatacagaagttactgaagagaaaggtaccacag tcagatccacctaattcttatcaagttgtaaagagggcagaatttgcttggcagctcttc cactggggaaataaaagcacctgggaagggagagaagtgtggaaggaggacatttgtgga gctccagacacctaa >gi568815588r:62713210_62916029|GENSCAN_predicted_peptide_2|263_aa MGQRVGEYPEKEKWGPELMQVPVWDGVPDPRHLRKTWKNLHKTSKVQVFLGAFPILICGL PWPSPFRSPWQTLITNMAVIPVEASGAPESSTLPYSQTTGCIAGLPCPWLPVGIGQLEAP ERDLLGVDEVHPSDTWTLRKNGKIGVLIGKQVSKSVRHPESLRNGRRVKHSTVRHLDLLL SLILPSTFPPRPFYQNSCYMQRSGEKNSTMLLHMEPSRNLPETPLQLETKRQFTLLALRA EMEMTKSQGHQILSLLLADLLSF >gi568815588r:62713210_62916029|GENSCAN_predicted_CDS_2|792_bp atggggcagagggttggagagtatcctgaaaaggagaaatgggggccagagctcatgcaa gtcccagtgtgggatggggtgcctgatcccagacacctgaggaagacttggaaaaatttg cacaagacatccaaagtgcaggtgttcctcggagcttttcccatattaatttgtggactt ccctggccctcacccttccggagtccatggcagacattgatcaccaacatggcagtcatt cctgttgaagccagtggggcccccgaatcctcaacattaccttatagtcagaccactggc tgcattgctgggcttccttgcccttggcttcctgttggaattggccagttggaggcacca gaaagagatctgcttggtgtggatgaagtccacccaagtgacacctggaccctgaggaag aatggcaaaattggagtactgattgggaagcaggtaagcaaaagtgtgagacatccagaa tcactcagaaatggcagacgtgtcaaacacagcacagtgaggcaccttgacctacttttg tccttaatcttgccaagcacgtttccccctcggcccttttaccagaacagctgttacatg cagcgctcaggagaaaaaaacagtacgatgctgcttcacatggaacccagcaggaatctg ccagagacccctctgcagctggagacaaagagacaatttaccttgttggccctcagggca gaaatggaaatgacaaagagtcaaggacatcagattttgagcctgctgctggctgatctt cttagcttctag >gi568815588r:62713210_62916029|GENSCAN_predicted_peptide_3|76_aa MARTVHISLEENEEEKPRHRERKGPQTPQTGGYQTGVGAQVTCALIHAFHSDKRACKKKQ PKKKEVVSDDSSNLLF >gi568815588r:62713210_62916029|GENSCAN_predicted_CDS_3|231_bp atggctaggacagtccatatctctcttgaggaaaatgaggaagagaaaccaaggcacaga gagaggaagggacctcagactccacagacaggtggctaccagactggggttggggcccaa gtcacctgtgccctgatccatgccttccactctgacaaacgtgcctgcaagaagaagcag ccaaagaagaaagaggttgtttctgatgactcatcgaatttgctcttctaa >gi568815588r:62713210_62916029|GENSCAN_predicted_peptide_4|557_aa MRGKSAEDLLGKTFFFDKETNGETVPFFISKKITSAEPDILKSTWSENSTSVMVTKGSEA LQKSKQSPNLISRNSRRLVRYKSITLTGGWQDGRIGTVLVCSFQRDQHRRAPTGRGGCRC SFSRLKRSCLLALKRAVNLSAQSSSSAKRQSTSSSGSLTPMPPDWEIPPSRGQQTPHTGE LWLASGRCPSGTKIPEEGTGSNLCCSAASAGHQHQRPKVDKPTKIRKNQRKKAENFKNQN ASSPPKDHNSLPAREQNWTENEFDEPTEVSFRRLNQEEIKSLNRPITKSEIEAVINSLPT KKAQDQTDSQLNSTRELEKTTLKFIWNQKRAHIAKTILSKKNKGGGIMLPDFKLYYKVTV TKTAWYWYQNRDIDQWNRTEASEITPHIYNHLIFDKPDKNKQWGKDCLFNKWCWENWLAI CRKLKLDPFLTPYTKINSRWIKDLNIRPKTIKTLEENLGNTIQDIGMGKAFMTKTPKAMT TKAKIDEWDLIKLKSFCKAKETIIRVNRQPTEWEKICAIYPSDKGLISGIYKELKQIYKK NTSNPIKKCVKNMNRHF >gi568815588r:62713210_62916029|GENSCAN_predicted_CDS_4|1674_bp atgaggggaaagtctgctgaggaccttctaggaaaaactttcttctttgataaagagaca aatggagaaactgtccctttcttcatctcaaagaaaataacctcagcagaaccagatatt ctgaagtctacgtggtcagaaaactccacttcagtcatggttacaaaaggctctgaggct cttcagaaatctaaacagagtcccaatcttatatccaggaacagcagaagactggtgcga tataaaagtattactctcactggtggctggcaagatggccgaataggaacagttctggtc tgcagcttccagcgagatcaacacagaagagcacctacgggaaggggtggctgtcggtgc agcttcagcagacttaaacgttcctgcctgctggctctgaagagagcagtcaatctctca gcacagagctccagctctgctaagagacagagtacctcctcaagtggatccctgaccccc atgcctcctgactgggagatacctcctagcaggggtcaacagacacctcatacaggagag ctctggctggcatccggcaggtgcccctctgggacgaagattccagaggaaggaacaggc agcaatctttgctgttctgcagcctccgctggtcaccaacatcaaagaccaaaggtagac aaacccacgaagatcaggaaaaaccagcgcaaaaaggctgaaaatttcaaaaaccagaat gcctcttctcctccaaaggatcacaactccttgccagcaagggaacaaaactggactgag aatgagtttgacgaaccgacagaagtaagcttcagaagactaaaccaggaagaaatcaaa tccctgaatagaccaataacaaagtctgaaattgaggcagtaattaatagcctaccaacc aaaaaagcccaggaccagacggattcacagctgaattctaccagagaattggaaaaaact actttaaagttcatatggaaccaaaaaagagcccacattgccaagacaatcctaagcaaa aagaacaaaggtggaggcatcatgctacctgacttcaaactatactacaaggttacggta accaaaacagcatggtactggtaccaaaacagagatatagaccaatggaacagaacagag gcctcagaaataacgccacacatctacaaccatctaatctttgacaaacctgacaaaaac aagcaatggggaaaggattgcctatttaataaatggtgttgggaaaactggctagccata tgcagaaaactgaaactggaccccttccttacaccttatacaaaaattaattcaagatgg attaaagacttaaacataagacctaaaaccataaaaaccctagaagaaaacctaggcaat accattcaggacataggcatgggcaaagccttcatgactaaaacaccgaaagcaatgaca acaaaagccaaaattgatgaatgggatctaattaaactaaagagcttttgcaaggcaaaa gaaactatcatcagagtgaacaggcaacctacagaatgggagaaaatttgtgcaatctat ccatctgacaaagggctaatatccggaatctacaaagaacttaaacaaatttacaagaaa aatacaagcaaccccatcaaaaagtgtgtgaagaatatgaacagacacttctga >gi568815588r:62713210_62916029|GENSCAN_predicted_peptide_5|270_aa MPRDNMASLIQRIARQACLTFRGSGGGRGASDRDAASGPEAPMQPGFPENLSKLKSLLTQ LRAEDLNIAPRKATLQPLPPNLPPVTYMHIYETDGFSLGVFLLKSGTSIPLHDHPGMHGM LKVLYGTVRISCMDKLDAGGGQRPRALPPEQQFEPPLQPREREAVRPGVLRSRAEYTEAS GPCILTPHRDNLHQIDAVEGPAAFLDILAPPYDPDDGRDCHYYRVLEPVRPKEASSSACD LPREVWLLETPQADDFWCEGEPYPGPKVFP >gi568815588r:62713210_62916029|GENSCAN_predicted_CDS_5|813_bp atgccccgagacaacatggcctccttgatccaacggatcgcccgccaggcttgcctcacc ttccggggcagcgggggcggccgcggcgcttccgatcgcgacgcggcttctggcccggag gcgccgatgcagccgggcttccccgagaacctgagcaagctgaagagcctcctgacccag ctccgcgccgaggacttgaacatcgccccgcgcaaggccacactgcagccgctgccgccc aacctgccgccagtcacctacatgcacatctacgagacggacggcttcagcctgggcgtg ttcctgctcaagagcggcacgtccatcccgctgcacgaccacccgggcatgcacggcatg ctcaaggtgctgtacggcaccgtgcgcatcagctgcatggacaagctagacgcgggcggc gggcaacggccgcgggccttgccgcccgagcagcagttcgagccgccgctgcagccccgg gagcgagaagccgtgcggccgggcgtgctgcgttcgcgggccgagtacaccgaggccagc ggcccctgcatcctcacaccgcaccgggacaacctgcaccagatcgacgccgtggaaggg cctgccgccttcctggacatcctggccccgccctacgacccggacgatggccgggactgc cactattaccgggtgctggagccggtcaggcccaaggaggcctccagctcggcctgtgac ctgcctcgagaggtgtggctcctggagaccccacaggccgatgacttctggtgcgaggga gaaccctatccaggtcccaaggtcttcccttga >gi568815588r:62713210_62916029|GENSCAN_predicted_peptide_6|766_aa MILDEDYALSLYDQTDSLSLIKVDISPGPALRALLAGHSGVQELPLGVARGDRRRVWQQR QSRGGPGNSPACRSAHVRREERSASSRLARACCRARDASRTPFRGVVLPMVSAGAAGPAR AQAEWAGARERWPAAGYRLFWREGEKQRFSGRAEGETAPGSLRALTAVRWARRELAHGCS GRGRLCEEQMMTAKAVDKIPVTLSGFVHQLSDNIYPVEDLAATSVTIFPNAELGGPFDQM NGVAGGKPRVFAGPDLRGPEVVDGVQWVCGNRRRGLGLQVDGADCTGFEDWKWVQEEPAT AETRARGQMRQVASKPAGKVTVKFGALGESCAPGEPSGGEAPGESVDGMINIDMTGEKRS LDLPYPSSFAPVSAPRNQTFTYMGKFSIDPQYPGASCYPEGIINIVSAGILQGVTSPAST TASSSVTSASPNPLATGPLGVCTMSQTQPDLDHLYSPPPPPPPYSGCAGDLYQDPSAFLS AATTSTSSSLAYPPPPSYPSPKPATDPGLFPMIPDYPGFFPSQCQRDLHGTAGPDRKPFP CPLDTLRVPPPLTPLSTIRNFTLGGPSAGVTGPGASGGSEGPRLPGSSSAAAAAAAAAAY NPHHLPLRPILRPRKYPNRPSKTPVHERPYPCPAEGCDRRFSRSDELTRHIRIHTGHKPF QCRICMRNFSRSDHLTTHIRTHTGEKPFACDYCGRKFARSDERKRHTKIHLRQKERKSSA PSASVPAPSTASCSGGVQPGGTLCSSNSSSLGGGPLAPCSSRTRTP >gi568815588r:62713210_62916029|GENSCAN_predicted_CDS_6|2301_bp atgattttggatgaagattatgctttgagcctttatgatcagactgacagcctatcactt attaaagtggacatcagccccggtcctgccctccgcgccctgctggccgggcactcgggc gtccaggagctccccctcggggtggcgcgcggcgaccgccggcgggtttggcagcagcgc cagtcgcggggcggcccgggcaactcgcccgcctgccggtccgcccacgtgcgcagagag gagcgaagcgcgagcagtcgcctcgcccgcgcttgctgccgggcccgagatgcgagtcgg actcccttccgaggcgtcgtcttgcccatggtcagcgcgggggccgcgggacccgccaga gcgcaggcggagtgggctggcgcacgtgagagatggccggccgctggataccggctgttt tggagggagggggagaagcagcgcttcagtggaagagcagagggagagaccgctcccggc agcctcagggccctgaccgcggtgcgctgggcccggcgagagctggcgcacggctgcagc ggtcgaggcaggttgtgcgaggagcaaatgatgaccgccaaggccgtagacaaaatccca gtaactctcagtggttttgtgcaccagctgtctgacaacatctacccggtggaggacctc gccgccacgtcggtgaccatctttcccaatgccgaactgggaggcccctttgaccagatg aacggagtggccggaggtaagccgcgtgtcttcgcaggcccggacctgcgcggcccggag gtggtggatggggtgcagtgggtgtgcgggaatcgcaggagaggattggggctccaggtg gacggtgctgactgcactggctttgaagactggaagtgggtgcaggaggaacctgcgaca gctgagaccagggcgcgcgggcagatgcgccaggtcgccagcaaaccggcgggcaaagtg acggtaaagttcggggctctcggggagagctgcgcgcctggagagccgagcggcggagaa gcgccaggggaaagcgtcgatggcatgatcaacattgacatgactggagagaagaggtcg ttggatctcccatatcccagcagctttgctcccgtctctgcacctagaaaccagaccttc acttacatgggcaagttctccattgaccctcagtaccctggtgccagctgctacccagaa ggcataatcaatattgtgagtgcaggcatcttgcaaggggtcacttccccagcttcaacc acagcctcatccagcgtcacctctgcctcccccaacccactggccacaggacccctgggt gtgtgcaccatgtcccagacccagcctgacctggaccacctgtactctccgccaccgcct cctcctccttattctggctgtgcaggagacctctaccaggacccttctgcgttcctgtca gcagccaccacctccacctcttcctctctggcctacccaccacctccttcctatccatcc cccaagccagccacggacccaggtctcttcccaatgatcccagactatcctggattcttt ccatctcagtgccagagagacctacatggtacagctggcccagaccgtaagccctttccc tgcccactggacaccctgcgggtgccccctccactcactccactctctacaatccgtaac tttaccctggggggccccagtgctggggtgaccggaccaggggccagtggaggcagcgag ggaccccggctgcctggtagcagctcagcagcagcagcagccgccgccgccgccgcctat aacccacaccacctgccactgcggcccattctgaggcctcgcaagtaccccaacagaccc agcaagacgccggtgcacgagaggccctacccgtgcccagcagaaggctgcgaccggcgg ttctcccgctctgacgagctgacacggcacatccgaatccacactgggcataagcccttc cagtgtcggatctgcatgcgcaacttcagccgcagtgaccacctcaccacccatatccgc acccacaccggtgagaagcccttcgcctgtgactactgtggccgaaagtttgcccggagt gatgagaggaagcgccacaccaagatccacctgagacagaaagagcggaaaagcagtgcc ccctctgcatcggtgccagccccctctacagcctcctgctctgggggcgtgcagcctggg ggtaccctgtgcagcagtaacagcagcagtcttggcggagggccgctcgccccttgctcc tctcggacccggacaccttga >gi568815588r:62713210_62916029|GENSCAN_predicted_peptide_7|428_aa MKDECESMSLGTHSPERACHKGLCSPFGVSSAQSQDCSRARVKNRSPQTGTGQWPVRNKA TQQEPDPNHLALTQQRWRRKRHISFVQGIRSGPNRRGCKVVSYELPPAFRKYMLQCPAGS LQNLDLTKGEGDQLAKEAGSTLKAPRERSREKKSKQDITRLNMDGMQTAVWRRSAHLGES CCRQVNRAQTQFCKHVDFMHVEGCSDWTHQTELAPYSLSCWQYFKAQESARFWRPHAKPE GDFSILLCQPERKDLTPKALSISPVQWCIPSDHAVSKCDCSKVQVPAVKGPEEERTTLLT TWQKYHVTLSSMENSRSPSLDLQVLCVLRAPGALVDSRHPPLTIKTTGPSLGLRFLPKPV RCISLEVTSTKQLSLTTTLKTGQDVHPGSPFHVCTQTCTPLIITAGEDNSRHNSTAGEDK IAGKAARL >gi568815588r:62713210_62916029|GENSCAN_predicted_CDS_7|1287_bp atgaaagatgaatgtgagtccatgtctcttggaacccatagtccagaaagagcctgccac aaagggctctgttctccctttggtgtctcctcagcacagagtcaagactgctccagggca agggttaagaacagaagtccacagactggtactggtcagtggcctgttaggaacaaggct acacagcaggagccagatcccaaccacttggctttaacccagcagagatggagacggaaa agacacatttccttcgttcaaggaattcgcagtggacctaatagaaggggctgtaaagtg gtgtcctatgagctgcctccagccttcaggaagtacatgctccaatgcccagctggatcc ctgcagaacttagaccttaccaaaggtgaaggggaccagctagctaaggaggcagggagt acattaaaagcacctagagagagatccagagagaagaaatccaagcaagacatcaccagg ttgaacatggatggcatgcagactgcagtgtggaggaggtcagctcaccttggggagtca tgctgtagacaggtcaacagggcacagacccagttctgtaaacacgtggactttatgcat gtggaaggatgctctgattggacccatcaaactgaattagctccttacagcctatcttgt tggcagtatttcaaagctcaagaatctgctaggttttggcgtccacatgctaagcccgaa ggggacttcagcattctgctttgtcagcctgaaaggaaggacctaactccaaaggctcta agcataagcccagttcagtggtgtatccccagcgaccatgcagtgagcaagtgtgattgc agtaaggtacaggtccctgcagtaaaaggtcctgaggaagagagaacaacacttctcacc acctggcaaaaataccatgtgacactgtcctccatggaaaattccagaagcccctccctg gaccttcaggtgctgtgtgtactgagagcacctggagcactggtggacagcaggcatcca ccattgacaatcaagaccacgggaccttccctgggcctcaggttccttcctaaaccagtt agatgcatcagcttagaggtcacttccacaaagcagctttctctgaccaccaccctcaaa actggacaagatgttcaccccgggtcacccttccacgtgtgcacccaaacatgcacacca ctcatcatcacagcaggtgaggacaacagcaggcacaactccacagcaggtgaggacaaa atagcaggcaaagctgccagactgtag >gi568815588r:62713210_62916029|GENSCAN_predicted_peptide_8|114_aa MALMWQLIRYGVEQHSSNSSLWGIPVNKEPLSLADPQKWGLLIGWLVSDRKPQALSHCSC QWRRPQEEASETGVNRTGLGHRQLHPAAKATSRGKDRGRGVSAAELLPKYGSTG >gi568815588r:62713210_62916029|GENSCAN_predicted_CDS_8|345_bp atggcactaatgtggcagctgatcagatatggggtagagcaacacagctccaactcctcg ctctggggcattccagttaacaaagaacctttgtctctggcagatcctcagaaatggggc cttctgataggctggcttgttagtgatcgaaagccccaggcactcagtcactgctcatgc caatggagaaggcctcaggaagaagcttcagagactggagtgaaccggactggccttggt caccggcagcttcatcctgctgcgaaggccaccagtagaggaaaagatagagggagagga gtctctgcagcagagcttcttccaaaatatgggtccacaggctaa