GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:59:05 Sequence gi568815592r:33695503_33901163 : 205661 bp : 52.20% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 210 278 69 2 0 73 55 93 0.799 2.73 1.02 PlyA + 1041 1046 6 1.05 2.05 PlyA - 1289 1284 6 1.05 2.04 Term - 2248 2151 98 2 2 133 49 111 0.998 10.13 2.03 Intr - 5011 4942 70 1 1 41 95 82 0.852 3.45 2.02 Intr - 5918 5844 75 2 0 126 113 75 0.999 14.01 2.01 Init - 16184 16047 138 2 0 105 60 179 0.943 16.91 2.00 Prom - 21688 21649 40 -1.51 3.06 PlyA - 22321 22316 6 1.05 3.05 Term - 27685 27218 468 1 0 29 47 428 0.996 28.16 3.04 Intr - 30052 29939 114 1 0 -8 96 147 0.262 7.05 3.03 Intr - 31404 31161 244 2 1 109 90 185 0.920 18.83 3.02 Intr - 32798 32585 214 0 1 132 78 237 0.998 25.40 3.01 Init - 39974 39776 199 2 1 106 113 275 0.936 30.84 3.00 Prom - 58907 58868 40 -0.21 4.14 PlyA - 60352 60347 6 1.05 4.13 Term - 62657 62637 21 2 0 116 33 11 0.051 -3.01 4.12 Intr - 66237 65996 242 1 2 100 64 56 0.055 2.20 4.11 Intr - 75096 75006 91 0 1 116 53 8 0.029 0.17 4.10 Intr - 76409 76031 379 1 1 67 33 211 0.728 9.03 4.09 Intr - 81554 81452 103 0 1 100 117 150 0.989 18.83 4.08 Intr - 81737 81636 102 0 0 96 82 25 0.906 3.35 4.07 Intr - 82428 82207 222 1 0 84 -19 137 0.594 1.23 4.06 Intr - 82885 82740 146 0 2 95 78 184 0.999 18.54 4.05 Intr - 84677 84598 80 2 2 101 105 122 0.984 14.04 4.04 Intr - 85651 85575 77 2 2 106 97 27 0.996 5.13 4.03 Intr - 88925 88850 76 2 1 107 105 100 0.975 13.18 4.02 Intr - 91272 91232 41 1 2 87 105 47 0.870 4.73 4.01 Init - 93731 92879 853 2 1 115 60 836 0.494 76.44 4.00 Prom - 93993 93954 40 -4.11 5.06 PlyA - 94089 94084 6 1.05 5.05 Term - 98525 98416 110 0 2 76 51 52 0.384 -0.73 5.04 Intr - 99405 99272 134 2 2 92 61 20 0.429 0.50 5.03 Intr - 102760 102633 128 1 2 62 64 102 0.359 5.18 5.02 Intr - 103719 103603 117 0 0 101 105 63 0.999 10.47 5.01 Init - 105661 105545 117 1 0 72 110 210 0.963 19.80 5.00 Prom - 108540 108501 40 -0.71 6.11 PlyA - 109320 109315 6 1.05 6.10 Term - 110674 110592 83 2 2 88 54 33 0.032 -1.95 6.09 Intr - 112447 112399 49 1 1 108 40 54 0.058 1.34 6.08 Intr - 114377 114225 153 2 0 64 25 113 0.168 3.38 6.07 Intr - 114516 114484 33 0 0 76 64 49 0.356 0.20 6.06 Intr - 119793 119683 111 0 0 65 110 35 0.377 4.48 6.05 Intr - 124984 124843 142 0 1 67 31 140 0.236 7.06 6.04 Intr - 135589 135486 104 1 2 6 94 113 0.060 3.17 6.03 Intr - 136161 136045 117 0 0 44 43 87 0.273 0.97 6.02 Intr - 138420 138229 192 0 0 54 68 81 0.100 2.91 6.01 Init - 147167 147111 57 2 0 72 77 34 0.225 1.96 6.00 Prom - 149080 149041 40 -3.91 7.00 Prom + 150779 150818 40 -3.61 7.01 Init + 150934 151033 100 0 1 37 94 82 0.805 4.08 7.02 Intr + 153311 153522 212 0 2 62 64 109 0.527 5.06 7.03 Intr + 164297 164426 130 1 1 61 81 -8 0.243 -3.43 7.04 Intr + 165678 165808 131 1 2 52 103 74 0.634 6.22 7.05 Intr + 165814 165914 101 0 2 -4 87 138 0.534 3.91 7.06 Term + 171081 171087 7 0 1 135 54 0 0.042 -1.08 7.07 PlyA + 171480 171485 6 1.05 8.08 PlyA - 171612 171607 6 -0.45 8.07 Term - 172324 172025 300 1 0 118 38 94 0.601 2.96 8.06 Intr - 174837 174716 122 1 2 64 68 84 0.192 4.72 8.05 Intr - 180878 180756 123 0 0 129 13 79 0.238 5.56 8.04 Intr - 186968 186884 85 2 1 67 -2 111 0.001 -0.11 8.03 Intr - 190312 190222 91 0 1 102 68 47 0.178 4.60 8.02 Intr - 191652 191367 286 1 1 0 52 164 0.131 0.94 8.01 Intr - 205093 205045 49 1 1 80 109 70 0.573 7.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:33695503_33901163|GENSCAN_predicted_peptide_1|22_aa MTEQRKRRQRLGFVDVQNCISR >gi568815592r:33695503_33901163|GENSCAN_predicted_CDS_1|69_bp atgacggagcagcggaaacgcaggcaacgcctaggctttgtggatgtccagaactgcatt agccgctga >gi568815592r:33695503_33901163|GENSCAN_predicted_peptide_2|126_aa MAASRYRRFLKLCEEWPVDETKRGRDLGAYLRQRVAQAFREGENTQVAEPEACDQMYESL ARLHSNYYKHKYPRPRDTSFSGLSLEEYKLILSTDTLEELKEIDKGMWKKLQEKFAPKGP EEDHKA >gi568815592r:33695503_33901163|GENSCAN_predicted_CDS_2|381_bp atggcggccagccggtaccggcgttttcttaagctctgtgaggaatggccagtggacgag accaaacggggccgggacttgggcgcttacctgcgacagcgggtagcacaggcctttcgg gagggagagaatacccaggttgcagagcctgaggcctgtgatcagatgtacgagagctta gcgcgactccattcaaactactacaaacacaagtaccctcgccccagagacaccagcttc agtggcctgtcgttggaagagtacaagctgatcctgtccacagacaccttggaagagctt aaggaaatagataaaggcatgtggaagaaactgcaggagaagtttgcccccaagggtcct gaggaggatcataaggcctga >gi568815592r:33695503_33901163|GENSCAN_predicted_peptide_3|412_aa MVVQNSADAGDMRAGVQLEPFLHQVGGHMSVMKYDEHTVCKPLVSREQRFYESLPLAMKR FTPQYKGTVTVHLWKDSTGHLSLVANPVKESQEPFKVSTESAAVAIWQTLQQTTGSNGSD CTLAQWPHAQLARSPKESPAKALLRSEPHLNTPAFSLVEDTNGNQVERKSFNPWGLQCHQ AHLTRLCSEYPENKRHRILAIPLTCGPKGGATERHRGAEMGTRQHGDDASEEKKARHMRK CAQSTSACLGVRICGMQVYQTDKKYFLCKDKYYGRKLSVEGFRQALYQFLHNGSHLRREL LEPILHQLRALLSVIRSQSSYRFYSSSLLVIYDGQEPPERAPGSPHPHEAPQAAHGSSPG GLTKVDIRMIDFAHTTYKGYWNEHTTYDGPDPGYIFGLENLIRILQDIQEGE >gi568815592r:33695503_33901163|GENSCAN_predicted_CDS_3|1239_bp atggttgtgcaaaacagcgcagacgccggggacatgagggcaggcgtgcagctggagccc ttcctgcaccaggtcggggggcacatgagcgtgatgaagtatgacgagcatacggtgtgc aagcccctcgtctcccgggagcagaggttctatgaatccctgccgctggccatgaagcgg ttcaccccacagtacaaaggtaccgtcacagtgcacctctggaaagacagcacaggccat ctcagcttggttgccaacccagtgaaggagagccaggagcccttcaaggtctccacagag tcggcggcggtggccatatggcagacgctccagcagaccaccggcagcaatggcagcgac tgcacccttgcccagtggccgcatgcccagctggcacgctcacccaaggagagcccggcc aaggctcttctgaggtccgagccccacctcaacactccagccttctcgctggtggaagac accaacggaaaccaggttgagaggaagagcttcaacccgtggggcctgcaatgccaccag gcccacctgacccgcctgtgctccgagtacccagagaacaagcggcatcgtatccttgcc atccccctcacatgtggtcccaagggcggggcgacagagagacacagaggggcagagatg gggacccggcagcacggcgatgatgcatcggaggagaagaaggcccgccacatgaggaag tgtgcgcagagcacctcagcctgcctgggtgtgcgcatctgcggcatgcaggtttatcaa acagataagaagtactttctctgcaaagacaagtactatggaagaaaactctcagtggag gggttcagacaagccctctatcagttcctacataatggaagccacctccggagggagctc ctggagcccatcctgcaccagctccgggccctcctctctgtcattaggagccagagttca taccgcttctattccagctctctccttgtcatctatgatgggcaggaaccaccagaaaga gccccaggcagcccgcatcctcacgaggctccccaggcagcccacggtagctctcccggt ggtctcaccaaggttgacatccgcatgattgactttgctcataccacatacaagggctac tggaatgagcacaccacctacgatggaccagaccctggctatatttttggcctggaaaac ctcatcaggatcctgcaggatatccaagagggagaatga >gi568815592r:33695503_33901163|GENSCAN_predicted_peptide_4|810_aa MATDLPIMARGPARSAAPAGGSSSGCGARQGRAGGGVLAMAGLSDLELRRELQALGFQPG PITDTTRDVYRNKLRRLRGEARLRDEERLREEARPRGEERLREEARLREDAPLRARPAAA SPRAEPWLSQPASGSAYATPGAYGDIRPSAASWVGSRGLAYPARPAQLRRRASVRGSSEE DEDARTPDRATQGPGLAARRWWAASPAPARLPSSLLGPDPRPGLRATRAGPAGAARARPE VGRRLERWLSRLLLWASLGLLLVFLGILWVKMGKPSAPQEAEDNMKLLPVDCERKTDEFC QAKQKAALLELLHELYNFLAIQAGNFECGNPENLKSKCIPVMEAQEYIANVTSSSSAKFE AALTWILSSNKDVGIWLKGEDQSELVTTVDKVVCLESAHPRMGVGCRLSRALLTAVTNVL IFFWWIPVWTSFEQEAQQGLGLLLLPEKYKHLLESYRYLMKARRTQGAALKGGHQSYPLT SGAAFEQRPQGVSEGERPSLAFLWGLLILLKYRWRKLEEEEQAMYEMVKKIIDVVQDHYV DWEQDMERYPYVGILHVRDSLIPPQSRSCDSRPGLQPERRFRELGSPASGPRASHLLCAK IALLYCDHSPDAGLFNKNFSAAGVSVTPAAALGGGRGAVAAGCPLPATREQRPRRRPPAL PPAPQPAPWEAGSRSTPTGGLIKCEKPLQRVFPRPQGQSPQEGSQSCHAPWGLWAVVVTF PVHSLQSMGPETRLPWREGDRQSMKLSPTLERRGEPDTSLQPALVHKAPRAYYWGPQQKM RGLLLHLTLGGSHSERAAASPATEVVTYLR >gi568815592r:33695503_33901163|GENSCAN_predicted_CDS_4|2433_bp atggcgaccgaccttcccatcatggcgcgtggccccgcccgctccgccgcgcctgcggga gggagcagttccgggtgcggtgcgcgccaggggcgggcggggggcggcgtcctggccatg gccggcctgtcggacctggaactgcggcgggagctgcaggccctgggcttccagccagga cccatcaccgacaccacccgggatgtctaccgcaacaagctgcgccgcctgcggggcgag gcccggctgcgcgacgaggagcggctgcgggaggaggcccggccgcggggcgaggagcgg ttacgggaagaggcccggttacgcgaggatgcgccgctgcgcgcccggcccgccgcggcc tctccgcgggcggagccctggctctcccagccggcctcgggctcggcctacgcgacccct ggggcctacggtgatatccggccctccgcggcttcctgggtagggagccgcggcctcgcc tatcctgcccgcccggcgcaactcaggcgccgcgcctcggtccggggcagctccgaggag gacgaggacgcccggacgcccgacagggccacgcagggcccgggtctcgcggcccgccgc tggtgggcagcgtctcccgccccggcgcggctgccttcctccctcctcggtcccgacccg cgcccgggcctgcgggcgactcgagcgggccctgctggcgcggcgagggcccggcctgag gtggggcgccggctggagcgctggctctctcggcttctgctctgggccagcctagggcta ctgctcgtcttcctgggcatcctttgggtgaagatgggcaagccctcagcgccgcaggag gcggaggacaacatgaagttattgccagtggactgtgagagaaaaacagatgagttctgt caggccaagcagaaggcagccttgctggagctgctgcatgaactctacaatttcctggcc atccaagctggtaattttgagtgtggaaatccagagaatctaaaaagcaaatgcattcct gttatggaagcccaagaatatatagccaatgtgaccagcagctcctccgccaagtttgaa gccgcactgacctggatactgagcagtaacaaggacgtgggcatctggttgaaaggagaa gaccagtctgaattggtgacgactgtggacaaggtggtctgcctggaatctgcccacccc cgcatgggtgttggctgccgcctgagccgggccttgctcactgctgtcaccaacgtgctc atcttcttctggtggatcccagtgtggaccagcttcgagcaggaggcacagcaggggctt ggcctacttttactccctgaaaaatataaacatctcctggagagctacagatacctgatg aaagcaagaaggactcagggtgctgcactgaaaggagggcaccagtcttatcccctgaca tcgggggccgcctttgagcagagacctcagggcgtctctgaaggcgaaaggccaagcttg gcttttttgtgggggctcctaattctcctaaaatatcggtggcgaaagttagaagaggag gaacaagccatgtatgagatggtgaagaagattatagacgtggtccaggaccattacgtg gactgggagcaggacatggagcgctatccatatgtaggcatcctgcacgtgcgcgacagc ttgatccctccacagagccgaagctgcgactctagaccaggcctgcagccagaacgccga ttccgggagcttgggagccctgcgtcagggcccagagcctcgcacttgctgtgtgcgaag atcgccttgctttactgtgaccacagccccgacgcggggctgtttaacaagaacttcagc gcagccggcgtttctgttaccccggccgcggctcttggcggcgggagaggcgcagtggct gcaggctgccccctgccggccacaagggagcagcgtccgcgccgccgcccaccggccctc ccgccagctcctcagcctgctccctgggaggcaggctccagaagcaccccgacggggggc cttattaagtgcgagaagcctctccagcgagtcttccccaggccccaaggtcagtcaccc caagaaggcagccagtcctgccatgccccatggggcttgtgggctgttgtggtcaccttt cccgttcacagcctgcagagcatggggcctgagacaaggctgccctggagagaaggtgac aggcagagtatgaaactgtcccccactctggagaggaggggagagcctgatacatccctg cagcctgccctggtgcacaaagcccccagagcctactactgggggccccagcagaagatg aggggcttattgctgcatctgactcttggaggctcccattcagagagagcagctgcatcc ccagcgactgaggtagtcacatacctccggtaa >gi568815592r:33695503_33901163|GENSCAN_predicted_peptide_5|201_aa MVSRKAVAALLVVHVAAMLASQTEAFVPIFTYGELQRMQEKERNKGQKKSLSVWQRSGEE GPVDPAEPIREEENEMIKERRNKILGRLKQTTNKVKTGSKLSSVAGNDMCYGGKQSRKET RQVLEGSFISRPDDGTERKPLLFSSSQVMATLGRRWTDLGGPSCPSYQQLDTVSILHFHP GLATESSYQNKNSTAEQGTLI >gi568815592r:33695503_33901163|GENSCAN_predicted_CDS_5|606_bp atggtatcccgtaaggctgtggctgctctgctggtggtgcatgtagctgccatgctggcc tcccagacggaagccttcgtccccatcttcacctatggcgaactccagaggatgcaggaa aaggaacggaataaagggcaaaagaaatccctgagtgtatggcagaggtctggggaggaa ggtcctgtagaccctgcggagcccatcagggaagaagaaaacgaaatgatcaaggaaaga agaaacaagatcctaggacgactgaaacagaccacaaacaaggtaaagacaggaagcaaa ctcagcagtgtggcaggcaatgacatgtgctatggagggaaacagagccgcaaggagacg cggcaagttttagagggcagtttcataagcagacctgatgatggtactgagcgcaaacct ctgcttttctccagcagccaagtgatggccacgctggggagaaggtggacagatttggga ggcccctcctgcccaagttaccagcaattagatacagtgtccattctgcatttccaccca ggcttagccacagaaagtagttaccagaacaagaactccactgctgaacaaggcaccctc atttga >gi568815592r:33695503_33901163|GENSCAN_predicted_peptide_6|346_aa MMGEPPPQFTRSNSSTGPLETPSEMETRANTDGQGNCFGDNKKGWLGMYVGFTKEGFLEE VEKEAISRQLRPRICGKREKMQWQLYCLLKVPEPTCLGSIGIPAPAMVLPPGVPAAFSPA LWEETKDNHEEEELSLAQSAIHVPPWLESGHWGPEGSPPPGEPEATAVHSPVAAVSRHAQ PHLARELSKISYIARQLNTACRLNGQPGCVPPVEHPLLLAILRKEMGGSEVMTGGSRFGK QSLVPPYEKHPQSGLVEGDPCDCHPQLSTAEAACGPLFPFAAAGSPPNPKVISPADVGKS TTETVNEDVMAVPGKQYFQGARDAGVMCPITAARAYGWSSPEHPRA >gi568815592r:33695503_33901163|GENSCAN_predicted_CDS_6|1041_bp atgatgggagagccacctccccagttcactcgcagcaacagcagcactgggcctttggaa actcccagtgagatggagacacgggcaaatacagatggccaaggaaattgttttggtgac aacaagaaagggtggcttggaatgtatgttggcttcaccaaggaaggcttcctggaggag gtggagaaagaggcaatctcaagacagcttaggccgaggatttgtggcaagagagagaag atgcagtggcagctgtattgcctgttaaaagttcctgagcccacctgcctgggctctatc ggcatccctgcccctgccatggtcctgccccctggtgtacctgctgccttctccccagct ctatgggaagagaccaaggataaccacgaggaagaggagctcagcttggcgcagtcagcc attcacgtccccccgtggctggagagtgggcactggggtcctgaagggagtccacctcct ggggagcctgaggccactgctgttcattccccagtggctgctgtcagccggcacgcccag ccccatctggctcgagaactgtcaaagatctcctacatcgcacgccagctgaacacggcc tgcagactgaacggtcaaccaggatgtgtgccccctgtggaacaccccctgcttctggct atcttaagaaaagaaatggggggctcagaagtgatgacaggaggctctagatttggaaaa cagtctctcgttccaccttacgagaaacacccacagagtgggctggttgaaggagaccca tgtgactgccacccacagctgtcaacggcagaggctgcatgtggcccgctgtttccattt gcagctgctggcagccccccaaatccaaaagtcatcagcccagcagatgtggggaagtca acaactgagaccgttaatgaagatgttatggcggtacctggcaaacaatattttcaaggt gctagagatgcaggtgtgatgtgtcccatcacagcagcccgtgcttatggatggagctcc ccagagcatcccagagcttga >gi568815592r:33695503_33901163|GENSCAN_predicted_peptide_7|226_aa MVSLHTHKHGWDEKRDAPSADEDVHQLELSDTAGNLDCYCSSFWGLSTVSITPGAPSLAP LPGPVPRGHSSFFSLQGYSKLWMFPHRPQGLARYTCVAHQSVPEGRDTEPRSTKGPERRA PEQAAGARVAAGTVAWQGFQCQSLRDGGAILEALMQKWVAKRAAASSAIVKLQSLEASCS CWLQNYILPATETAESDNYLLNRKVAATVAGSKTMLLLPLSTPENS >gi568815592r:33695503_33901163|GENSCAN_predicted_CDS_7|681_bp atggtatcactccacacccacaagcatggctgggatgaaaagagagatgcgcccagtgct gacgaggatgtgcatcaactggaactctcagacactgcagggaacctcgactgctactgc tcctccttctggggcctgagcactgtctccatcaccccaggggccccttctttggccccg ctgccgggtcctgtgccccggggtcactcttcattcttcagccttcaaggctactccaag ctttggatgtttccacacaggcctcagggccttgccagatacacctgcgtggcccaccag agtgttccagagggcagagacacagagccacgttccaccaagggacccgagagaagggcc cctgagcaggctgctggggcgagggtggctgcaggcacagtggcgtggcaggggtttcag tgtcaaagcctccgtgacggcggagcaattcttgaagccttgatgcagaagtgggttgcc aagagagctgctgcctcttctgcaatcgtgaagctgcagagtctagaagcttcctgcagc tgttggctccagaattacatcctgcctgccacggaaactgctgagtcagataactacctg ctgaatcgaaaagttgctgctaccgttgctggttccaaaaccatgctgcttctgccactg tctacacctgaaaattcttga >gi568815592r:33695503_33901163|GENSCAN_predicted_peptide_8|351_aa GSQVIQTNCPMNEGYTGWTNHIKREPTTLKNVSSPPESCLNWRRRSQEVVTPLQKLGLQA CANVRFPVPSLLWWKLLEGRHYVSRFFAVTFQLSALMFLHPTGADGSSPAAGVVTHESLK FMEPMGCPCSEEPSWVSTRPGQFCKGDLIQVLPMLAVGARLMALQNGAFGVIFLASMPNA NHSCSEYFPDQPSEKQNPFALCRFFIPDHISGPPGHPAAFSLHREAQVGAEGHVQELLSC QPFSASRDSHLQVGCDRCPWPRQWSTQASSSMLGKGKPPILVMEVAMTMSGFQRQLCKNR NQQLPNIWPYLLCKPQGPGPFRGASGILWNQLLSGISDAIPGQLPLLCAIL >gi568815592r:33695503_33901163|GENSCAN_predicted_CDS_8|1056_bp ggcagccaagtcatccaaaccaactgcccaatgaacgagggctacacaggttggaccaac cacatcaagagagagccaactacgctgaagaacgtgtcatcacctcctgagagctgcctg aattggaggcgcaggtcccaggaagtagtgactcctcttcagaagctgggtctgcaagcc tgtgcaaatgtgaggtttccggttccctccctcctctggtggaagctgctagagggcaga cactacgtatctcgattctttgctgtcacctttcagctgtcagctctgatgtttcttcat ccaacgggtgctgatggaagctccccagctgctggggtggtcacccatgagtcactgaag tttatggagcccatgggctgtccctgctctgaggagcccagctgggtctccacaaggcct ggccagttctgcaaaggagacctcatccaggtgctccccatgctggctgtgggcgccagg ctgatggccctccagaacggtgcctttggagtgatctttttagcttcaatgcccaatgct aatcactcatgctctgagtacttcccagatcaaccttctgagaagcagaacccatttgcc ctgtgtcgtttcttcatcccagaccacatctcaggaccccctggccatcctgcagctttc tccctgcacagggaagcacaggtaggagccgagggccacgtccaagagctgctcagctgc cagcccttctcagcttcccgcgactcccacctccaggttggctgtgataggtgtccctgg ccacgtcagtggagcacccaggcctcaagcagcatgctggggaagggaaaaccccccatt ctggtgatggaggtggccatgaccatgtctggttttcaaaggcagctttgtaagaacaga aaccagcagctccccaacatctggccctatctgctgtgcaaaccacagggaccaggacct ttccggggtgccagtggcatcctctggaaccagcttctcagtggaatttcagatgctatt cctggtcaattgcctttgctatgtgctatcctttaa