GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:52:08 Sequence gi568815597f:207221766_207459607 : 237842 bp : 38.21% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6536 6737 202 2 1 62 106 39 0.213 1.47 1.02 Intr + 10676 10837 162 1 0 97 47 98 0.847 5.85 1.03 Term + 22141 22347 207 0 0 59 41 135 0.623 2.26 1.04 PlyA + 23339 23344 6 1.05 2.06 PlyA - 23501 23496 6 1.05 2.05 Term - 34021 33382 640 0 1 43 42 251 0.435 8.69 2.04 Intr - 34347 34149 199 1 1 52 2 141 0.348 -0.61 2.03 Intr - 36821 36757 65 1 2 87 97 57 0.204 3.94 2.02 Intr - 45844 45719 126 0 0 65 62 78 0.239 1.77 2.01 Init - 51563 51490 74 2 2 75 71 42 0.225 1.79 2.00 Prom - 52268 52229 40 -6.35 3.03 PlyA - 52298 52293 6 1.05 3.02 Term - 58474 58290 185 2 2 110 44 114 0.918 5.92 3.01 Init - 59690 59612 79 2 1 91 87 25 0.395 3.97 3.00 Prom - 62081 62042 40 -4.55 4.03 PlyA - 62481 62476 6 1.05 4.02 Term - 70414 70112 303 1 0 22 48 144 0.084 -2.01 4.01 Init - 72749 72693 57 2 0 92 101 31 0.279 6.16 4.00 Prom - 76372 76333 40 -3.95 5.00 Prom + 84893 84932 40 -4.05 5.01 Init + 85127 85296 170 1 2 44 81 88 0.590 2.75 5.02 Intr + 86702 86806 105 2 0 97 86 88 0.367 7.91 5.03 Term + 96988 97006 19 1 1 90 38 13 0.023 -6.69 5.04 PlyA + 97515 97520 6 1.05 6.00 Prom + 98941 98980 40 -3.95 6.01 Init + 100001 100100 100 1 1 98 84 254 0.999 24.47 6.02 Intr + 100617 100802 186 1 0 127 97 240 0.998 27.54 6.03 Intr + 102794 102985 192 0 0 62 87 164 0.988 12.24 6.04 Intr + 103857 103956 100 1 1 74 110 51 0.968 4.15 6.05 Intr + 104987 105072 86 2 2 105 94 25 0.963 3.34 6.06 Intr + 109343 109531 189 0 0 17 74 155 0.686 5.64 6.07 Intr + 114928 115053 126 2 0 113 87 86 0.993 10.73 6.08 Intr + 115264 115485 222 2 0 11 113 171 0.370 9.08 6.09 Intr + 115513 115644 132 2 0 52 82 55 0.434 1.00 6.10 Intr + 117632 117652 21 0 0 122 119 17 0.663 4.80 6.11 Intr + 142859 142920 62 0 2 118 67 33 0.059 1.73 6.12 Intr + 144822 144948 127 2 1 39 69 114 0.350 3.93 6.13 Intr + 148769 148966 198 0 0 40 79 155 0.335 8.20 6.14 Intr + 149047 149109 63 2 0 52 84 94 0.631 3.17 6.15 Intr + 149606 149731 126 0 0 53 70 56 0.407 0.03 6.16 Term + 185622 185728 107 1 2 -55 33 315 0.893 9.29 6.17 PlyA + 185911 185916 6 1.05 7.06 PlyA - 185965 185960 6 1.05 7.05 Term - 193043 192767 277 1 1 36 44 130 0.085 -2.65 7.04 Intr - 194100 193979 122 0 2 30 86 98 0.185 2.17 7.03 Intr - 201499 201218 282 1 0 36 44 169 0.080 3.99 7.02 Intr - 202192 202055 138 1 0 9 94 128 0.493 5.24 7.01 Init - 203208 203077 132 0 0 71 12 146 0.501 5.49 7.00 Prom - 229966 229927 40 -3.25 8.03 PlyA - 230666 230661 6 1.05 8.02 Term - 233086 232533 554 2 2 48 36 256 0.421 9.89 8.01 Init - 234553 234412 142 1 1 59 89 125 0.524 10.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 124938 124814 125 1 2 26 54 189 0.887 6.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:207221766_207459607|GENSCAN_predicted_peptide_1|190_aa XSVSETRDRKWKAVWVRICMLNLKMSPSSVTLAIKWLVLKISLWLVLKISLDQSTEHGRH PEVPSCEWRLGYNGHGVLHESQCLCNDSIYSIDKRQEEALMDADIRCLGKSKDTEIWEDI IKFAIRFPGLHTDTMKILVNSSLSTLMQCECKQGYALTGAANICRFAGCSSLASQRNGNP NSPAKIVLSS >gi568815597f:207221766_207459607|GENSCAN_predicted_CDS_1|573_bp nnctctgtgtctgaaaccagagacagaaaatggaaagctgtctgggtaaggatctgtatg ttgaacttgaaaatgtcaccatccagtgtgactctggctataaagtggttggtcctcaaa atatcactttggttggtcctcaaaatatcacttgatcagagcacagaacatggcaggcac ccagaggtgcccagttgtgagtggagacttggctataacgggcatggggtcttgcatgaa tcccaatgtctatgcaatgatagtatttatagtattgacaaacgtcaagaggaggctctg atggatgcggacatcagatgcttgggtaaatcaaaagatacagaaatatgggaggacatt atcaagtttgcaattcgcttccccggattacacactgacactatgaaaatattagtgaat tcctcactcagcactctaatgcagtgtgaatgcaagcaaggatatgctctgactggagca gctaacatctgcaggtttgcaggttgttcatctctagcttctcaacgcaacgggaatcca aactccccagccaaaatagtgctgagcagctaa >gi568815597f:207221766_207459607|GENSCAN_predicted_peptide_2|367_aa MENTLEVPQKSKNRTTIPSSNPTPGKCSEDLPERSITYLQEAEVFGSVHWFGVVSRMKTI LYKMGVLNWQTSTSIQTIHENMVLQNELNLWNFEFERDDLGYLAEEISKEQGIQEEAEHK SLENLQPDNAIGKKNKKQNDIFWGEIQASCRHLLNFGTWGTASKLLQLQLWLKGANVQLG PLLQRVQAPSLGNFHVVVEPVGAQKSRIEVWEPSPRFQRLYGNHLDVQAEVCYRAEHSWK TSTRALWKGNVWLEPLHRVPTGALHSRAVRRGPLFSRPQNSRSTDRLCYVPGKATGTQHQ PIKATRRGAVPCKATGVELPKAMGAHLLHQHDLNVRYRVKRGHFGTLRFNDFPIGFQTCV GPITPWF >gi568815597f:207221766_207459607|GENSCAN_predicted_CDS_2|1104_bp atggagaacactttagaggttcctcaaaaatctaaaaatagaaccaccataccatccagc aatcctactcctggaaaatgctctgaagatcttccagaaagatctatcacatatttacaa gaagctgaagtttttggcagtgttcattggtttggtgtggtgtcaaggatgaaaacaata ctttataaaatgggtgtgttaaattggcaaacatccacaagcatccagaccatccatgaa aacatggttttacaaaatgaactaaatctgtggaactttgaatttgagagagatgattta gggtatctggcagaagaaatttctaaggagcaaggcattcaagaggaagcagagcataaa agtttggaaaatttgcagcctgacaatgcaataggaaaaaaaaacaaaaaacaaaacgac attttctggggagaaattcaagccagctgcagacatttgcttaactttgggacatggggc actgcgtccaaacttcttcagctccagctgtggctgaaaggagccaatgtacagcttggg ccattgcttcagagggtgcaagccccaagccttggcaacttccacgtggttgttgagcct gtaggtgcacagaagtcaagaattgaggtttgggaaccttcacctagatttcagaggttg tatggaaaccacctagatgtccaggcagaagtttgctacagggcagagcactcatggaaa acctctactagggcactgtggaagggaaatgtgtggttggagcccctacacagagtcccc actggggcactgcatagtagagctgtgagaagagggccattgttctccagaccccagaat agtagatccactgacagattgtgctatgtgcctggaaaagccacaggcactcaacaccag cccatcaaagcaacaaggaggggagctgtaccctgcaaagccacaggagtggagctgccc aaggccatgggagcccacctcttgcatcagcatgacctgaatgtgagatatagagttaaa agaggtcattttggaactttaaggtttaatgacttccctattggatttcagacttgtgtg gggcctataaccccttggttttag >gi568815597f:207221766_207459607|GENSCAN_predicted_peptide_3|87_aa MEVPSEEVALKLRCEGWVGINQVREESSSHMVCGGGRAQLLWDFALKLRAALLQQKATQG EIQLVLMGEAFRQALARGDSSIPVARP >gi568815597f:207221766_207459607|GENSCAN_predicted_CDS_3|264_bp atggaagtgccctctgaggaagtggcattgaagctaaggtgtgagggatgggtaggaatt aaccaggtgagagaagaaagcagcagccacatggtgtgtggaggagggagagcacagtta ctgtgggattttgcattaaaactcagagctgccctgttacagcagaaggcaacacagggt gaaattcagctggtgctcatgggggaagcatttagacaagccctagccagaggggactct tccatcccagtggccagaccttga >gi568815597f:207221766_207459607|GENSCAN_predicted_peptide_4|119_aa MGKYISQGYKVSGTLVGGNLCLCGFAGYNLPPGCFHGLVLNVCSFSRLSIQAVGGSTILG SSRLWPSSHSSIGGAPVGTLCGGSDPTFPFDTALAEVLHESPTPAANFCLDIQALPYIL >gi568815597f:207221766_207459607|GENSCAN_predicted_CDS_4|360_bp atggggaagtatattagtcaagggtacaaagtttcaggtactctggttggtggaaatctc tgcctctgtggctttgcagggtacaacctccctcctggctgctttcatgggctggtattg aatgtctgcagcttttccaggctctcaatacaagctgtcggtggatctaccattctgggg tctagcagactgtggccctcttctcacagctccataggtggtgctccagtaggaactctg tgtgggggctctgaccccacatttcccttcgacactgccctagcagaggttctccatgag agccccacccctgcagcaaacttctgcctggacatccaggcacttccatacatcctctga >gi568815597f:207221766_207459607|GENSCAN_predicted_peptide_5|97_aa MVLLLIEVLFDKEGFCHFSYPEKRQGALNGKIKIRVVLNSRKTDSERRRVGKANIEWKLL ENTHLRNAEKYKKRKTRVQEARHPTIITKEARMLKFG >gi568815597f:207221766_207459607|GENSCAN_predicted_CDS_5|294_bp atggttttgcttctaattgaagtgctgtttgacaaagaaggtttttgccatttctcctat ccagagaagaggcaaggggcattaaatggaaaaatcaaaatcagagtagttctcaactct aggaaaactgacagtgaaagaagaagggtaggaaaggcaaacattgaatggaagctattg gagaatacacaccttcgaaatgcggagaaatataagaaaaggaagacaagagttcaagaa gccaggcatccaacaataataactaaagaggcaagaatgttgaaatttggctga >gi568815597f:207221766_207459607|GENSCAN_predicted_peptide_6|678_aa MTVARPSVPAALPLLGELPRLLLLVLLCLPAVWGDCGLPPDVPNAQPALEGRTSFPEDTV ITYKCEESFVKIPGEKDSVICLKGSQWSDIEEFCNRSCEVPTRLNSASLKQPYITQNYFP VGTVVEYECRPGYRREPSLSPKLTCLQNLKWSTAVEFCKKKSCPNPGEIRNGQIDVPGGI LFGATISFSCNTGYKLFGSTSSFCLISGSSVQWSDPLPECREIYCPAPPQIDNGIIQGER DHYGYRQSVTYACNKGFTMIGEHSIYCTVNNDEGEWSGPPPECRGKSLTSKVPPTVQKPT TVNVPTTEVSPTSQKTTTKTTTPNAQGTLPTLQKPTRANDSATKSPAAAQTSFISKTLST KTPSAAQNPMMTNASATQATLTAQKFTTAKVAFTQSPSAALTNGLKSTQRFPSAHITATR STPVSRTTKHFHETTPNKGSGTTSGTTRLLSGTTPFKGYQLRSHYVYGPFLKGLRPKDQG CWCADANPGVQRIENQELWCLRAVEDGCSSSRREIINFMGIKICGSVSGIAGFLVSLTSR MKPQTLTVSVTVLKDDVSGVSSYWWVCGLADFRIEAADLHRMKLQTFVVSVTAHKGGADP KELPASRTLRRGHSSALGQSMGPGVAEQGAAPIWEAQATQEPTEEEEEEEEEEEEEEEEE EEEEEEEKEKKKKQNPKS >gi568815597f:207221766_207459607|GENSCAN_predicted_CDS_6|2037_bp atgaccgtcgcgcggccgagcgtgcccgcggcgctgcccctcctcggggagctgccccgg ctgctgctgctggtgctgttgtgcctgccggccgtgtggggtgactgtggccttccccca gatgtacctaatgcccagccagctttggaaggccgtacaagttttcccgaggatactgta ataacgtacaaatgtgaagaaagctttgtgaaaattcctggcgagaaggactcagtgatc tgccttaagggcagtcaatggtcagatattgaagagttctgcaatcgtagctgcgaggtg ccaacaaggctaaattctgcatccctcaaacagccttatatcactcagaattattttcca gtcggtactgttgtggaatatgagtgccgtccaggttacagaagagaaccttctctatca ccaaaactaacttgccttcagaatttaaaatggtccacagcagtcgaattttgtaaaaag aaatcatgccctaatccgggagaaatacgaaatggtcagattgatgtaccaggtggcata ttatttggtgcaaccatctccttctcatgtaacacagggtacaaattatttggctcgact tctagtttttgtcttatttcaggcagctctgtccagtggagtgacccgttgccagagtgc agagaaatttattgtccagcaccaccacaaattgacaatggaataattcaaggggaacgt gaccattatggatatagacagtctgtaacgtatgcatgtaataaaggattcaccatgatt ggagagcactctatttattgtactgtgaataatgatgaaggagagtggagtggcccacca cctgaatgcagaggaaaatctctaacttccaaggtcccaccaacagttcagaaacctacc acagtaaatgttccaactacagaagtctcaccaacttctcagaaaaccaccacaaaaacc accacaccaaatgctcaaggaaccctaccaactcttcagaaacccaccagagcaaatgat tcagccaccaaatccccagcagcagctcagacatctttcatatcaaaaaccctatctaca aagaccccttctgcagctcagaatcccatgatgacaaatgcttctgctacacaggccaca ctaacagcccaaaaattcaccacagcaaaagttgcatttacgcagagtccttcagcagca ctgactaatggtctcaagagtacacaaagattcccttctgctcatattacagcaacacgg agtacacctgtttccaggacaaccaagcattttcatgaaacaaccccaaataaaggaagt ggaaccacttcaggtactacccgtcttctatctggaactacacctttcaaaggctatcaa ctcagatcacattatgtttatggccctttcctgaagggcttaaggcccaaggaccagggc tgctggtgtgcagatgcaaatcctggagttcaaaggattgagaaccaggagctctggtgt ctgagggcagtagaagatggatgttccagctcaagaagggaaatcatcaactttatggga ataaaaatctgcggtagtgtgtctggaattgctgggttcttggtctcgctgacttcaaga atgaagccgcagaccctcacggtgagtgttacagttcttaaagatgatgtgtctggagtt tcttcttactggtgggtttgtggtctcgctgacttcaggattgaagctgcagaccttcac agaatgaagctgcagaccttcgtggtgagtgttacagctcataagggtggtgcggaccca aaggagctgcccgccagtcgcacgctgcggcgtgggcactcctcagcccttgggcagtcg atgggaccaggcgtggcagagcagggggcagcacccatctgggaggctcaggccacgcag gagcccaccgaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaaaaagaaaaaaagaagaagcaaaacccaaagagctaa >gi568815597f:207221766_207459607|GENSCAN_predicted_peptide_7|316_aa MRKAVDRSSPSVEKALKQLEAHSTKKERTFDGRVRWAFLTALRENNAGEIPKTVSKRQSC TDLVQILWEMRQAVFDLQTQGPNNERFTSQWVLALVDTGAHCCLLYGNPDKFPGKAAFTH GYGGQSVKVKPAFLHLGIGCLAPHLYTVYVSPIPEYILGVNILHGLDLYTMAGEFRLPVH VAKLAGVWWHDHSVAGQATDKTSQTRPNKAKSFLNPLSESSVFETSCMTSEGNGSMLQLE SRTDRSWVPLPYENVSCLTCGVEATETCQLGKCQTAENNMSVITFVDPRAQAPTLCISPL DFDTQPSVGCAGQKKE >gi568815597f:207221766_207459607|GENSCAN_predicted_CDS_7|951_bp atgaggaaggcagtggataggtcttccccgagtgtggagaaagcactaaagcagctggaa gcacacagcaccaagaaggaacgcaccttcgatggcagagtcagatgggcgtttctgact gcactgagggaaaataatgccggagagataccaaaaactgtgagtaaacggcaatcgtgt actgatttggtgcagatactctgggagatgcggcaggctgtgtttgatctgcaaacccag gggccaaataatgaacgctttacctctcagtgggtcctagcactggtagatactggtgca cactgctgtcttctttatgggaacccggataagtttccgggcaaagctgctttcactcac ggttatggtggccagtcggtgaaggtgaaacctgcgttcctgcatcttggcattggctgc ttggccccccacctgtacactgtgtatgtctctcccatacctgaatatattctgggggtg aacattttgcatggtctggacttatacaccatggccggagaattcagactcccagttcat gtagcaaagctggctggagtgtggtggcatgatcatagtgtagcaggacaagccacggac aaaacctctcagacacggccaaataaagccaaatccttccttaacccactgtctgagagc tcggtttttgagacgtcttgcatgacctcagagggaaatggcagcatgctacagctggag agtagaactgacaggagctgggtgcctctgccttatgagaatgtctcatgtctcacgtgt ggtgtagaagccacagaaacctgccaactcggaaagtgtcagacagcagaaaataacatg tcagtgataacttttgtggacccaagggctcaagccccaacactttgtataagtcccttg gactttgacacacagccttcggttgggtgtgcagggcagaagaaggaatga >gi568815597f:207221766_207459607|GENSCAN_predicted_peptide_8|231_aa MMCGEHAVAVKVSSGEWAKHEPWETGGEDHEQRIQDRQLEKSGGVSFVGRFPVASGSADR RPESDEHLQNPGGGEPGRRPRAPARRQPHRADQTAPLCGAASNSGGPSSRRGQWGQSGPG GLLQWSLKASGRRTRVEHCPHRLAFGPLLCIAAQKLSLPGTRPHLRAPPPSSPRTPGATR AKKTPSRPAAPMPRPKPPRRDAFRDPRASGSSGRARLEQQAAVSQERRALK >gi568815597f:207221766_207459607|GENSCAN_predicted_CDS_8|696_bp atgatgtgtggagagcatgcagtagcagtaaaagttagcagtggagagtgggcaaagcat gagccatgggagacaggtggggaagaccatgagcagcgcatacaagacaggcagctggag aaatcagggggtgtgagcttcgtgggtaggtttcccgtggcgagcggtagcgctgaccgc cggccagagagcgatgagcacctgcagaatcctgggggcggggagcccggcaggagaccg cgagcacccgcacgccgccagccacaccgggccgaccagacagcgcccctctgtggcgca gcctccaattctggaggtcccagctcccggaggggccagtggggacaatcaggaccaggc ggtttattgcagtggtccctcaaagctagcgggaggcggacacgcgtcgagcactgcccc caccgtctcgcttttggccccctgctttgcatcgcggcacagaaactttccctgcccggg acgcgtccccacctccgtgctccccctcccagctcaccgaggacccccggtgcgacgaga gccaagaaaaccccgagcaggcccgcggcgcccatgccacggccgaagcccccgcggcgg gatgcgttccgagacccgcgagcgtccggcagctctgggagggcaaggctggagcagcaa gcagctgtgagccaggagaggcgggcccttaaatag