GENSCAN 1.0 Date run: 2-Nov-116 Time: 20:40:49 Sequence gi568815589r:34270802_34472841 : 202040 bp : 45.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 1129 1008 122 1 2 84 99 142 0.566 14.19 1.03 Intr - 15903 15816 88 2 1 83 76 62 0.566 4.47 1.02 Intr - 19588 19373 216 0 0 70 79 132 0.264 8.12 1.01 Init - 40545 39923 623 0 2 83 90 307 0.280 25.22 1.00 Prom - 40993 40954 40 -6.66 2.00 Prom + 42873 42912 40 -2.56 2.01 Init + 43565 43575 11 1 2 68 77 15 0.802 -2.19 2.02 Intr + 47626 48003 378 1 0 97 17 623 0.865 50.18 2.03 Intr + 48049 48453 405 1 0 56 51 665 0.851 53.06 2.04 Term + 48566 48854 289 2 1 0 49 456 0.801 27.75 2.05 PlyA + 49516 49521 6 1.05 3.00 Prom + 53231 53270 40 -6.46 3.01 Init + 68239 68365 127 0 1 91 75 104 0.992 9.72 3.02 Term + 72323 72639 317 0 2 112 49 382 0.999 31.80 3.03 PlyA + 72878 72883 6 1.05 4.00 Prom + 82675 82714 40 -3.26 4.01 Init + 96820 96870 51 0 0 77 90 39 0.525 4.35 4.02 Term + 97239 97322 84 2 0 103 42 65 0.517 1.05 4.03 PlyA + 98555 98560 6 -0.45 5.20 PlyA - 98915 98910 6 1.05 5.19 Term - 102149 99998 2152 1 1 65 46 4217 0.269 400.57 5.18 Intr - 105442 105256 187 2 1 53 71 127 0.402 6.25 5.17 Intr - 107807 107749 59 1 2 78 93 -24 0.145 -4.37 5.16 Intr - 108380 108279 102 1 0 74 40 94 0.333 2.49 5.15 Intr - 108931 108845 87 0 0 109 97 121 0.999 14.19 5.14 Intr - 109673 109536 138 1 0 49 80 99 0.865 4.78 5.13 Intr - 110283 110096 188 0 2 20 63 189 0.120 8.09 5.12 Intr - 112065 111953 113 1 2 56 65 127 0.195 7.30 5.11 Intr - 114980 114836 145 2 1 124 116 147 0.967 20.96 5.10 Intr - 126773 126698 76 1 1 57 131 52 0.038 5.92 5.09 Intr - 130321 130212 110 1 2 111 9 299 0.060 23.38 5.08 Intr - 130919 130865 55 1 1 66 83 55 0.904 1.78 5.07 Intr - 131666 131586 81 1 0 64 89 106 0.865 7.05 5.06 Intr - 132006 131904 103 1 1 105 75 194 0.999 19.03 5.05 Intr - 134905 134831 75 2 0 75 66 66 0.826 2.59 5.04 Intr - 135163 135064 100 1 1 134 92 76 0.882 12.28 5.03 Intr - 138313 138224 90 1 0 54 84 40 0.372 0.39 5.02 Intr - 143821 143775 47 2 2 101 95 8 0.361 0.93 5.01 Init - 151454 151436 19 2 1 109 68 27 0.385 2.43 5.00 Prom - 155348 155309 40 -5.76 6.04 PlyA - 156713 156708 6 1.05 6.03 Term - 158482 158390 93 1 0 109 39 106 0.498 5.63 6.02 Intr - 167049 166872 178 2 1 -15 69 111 0.200 -1.18 6.01 Init - 167535 167408 128 0 2 77 27 173 0.374 7.73 6.00 Prom - 171229 171190 40 -2.26 7.03 PlyA - 171517 171512 6 1.05 7.02 Term - 174466 174383 84 1 0 63 48 82 0.386 -0.65 7.01 Init - 187462 187403 60 1 0 103 81 115 0.687 13.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 130321 130163 159 1 0 111 44 332 0.874 29.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:34270802_34472841|GENSCAN_predicted_peptide_1|350_aa MASWLYECLCEAELAQYYSHFTALGLQKIDELAKITMKDYSKLGVHDMNDRKRLFQLIKI IKIMQEEDKAVSIPERHLQTSSLRIKSQELRSGPRRQLNFDSPADNKDRNASNDGFEMCS LSDFSANEQKSTYLKVLEHMLPDDSQYHTKTGILNATAGDSYVQTEISTSLFSPNYLSAI LGDCDIPIIQRISHVSGYNYGIPHSCIRGNATCFAYGQTGAGKTYTMIGTHENPGLYALA AKDIFRQLEVSQPRKHLFVWISFYEIYCGQLYDLLNRRKRLFAREDSKHMVQIVGLQELQ VDSVELLLEVILKGSKERSTGATGVNADSSRSHAVIQIQIKDSAKRTFGS >gi568815589r:34270802_34472841|GENSCAN_predicted_CDS_1|1050_bp atggcatcctggttatatgaatgtctttgtgaagctgaacttgcacagtattattctcat ttcactgcccttggccttcagaaaatagatgaattagccaagattacaatgaaggactac tccaaattaggagtccatgacatgaacgaccgcaaacgtctcttccaacttatcaaaatt attaagattatgcaagaagaagataaagcagtcagtatcccagagcgtcatcttcagaca agcagcctgcgcatcaaatctcaggaattaagatctggccctcgcagacagctgaatttt gattctcctgctgacaataaagacagaaatgccagcaatgatgggtttgaaatgtgcagt ttatcagatttctctgcaaatgaacagaagtccacttacctaaaagtgctagaacacatg ctaccagatgattcccagtaccatacaaaaacaggaattctgaatgccacagctggtgat tcctatgtgcaaacagaaatcagcacttcactcttttcaccaaattacctttctgcaata ctgggggattgtgatattcccattattcaaagaatctctcatgtttcagggtataactat ggaatccctcattcttgtatcagaggcaatgccacttgctttgcttatggacagacaggt gctggaaagacctacaccatgataggaactcatgagaacccaggattgtatgctctagct gccaaagatatcttcaggcaactagaagtgtcccagccaagaaagcacctctttgtgtgg atcagcttctatgaaatttactgtggacagctttatgacctcctaaatagaagaaaaagg ctctttgcaagagaagatagcaagcacatggtgcagatagtgggactgcaagagcttcag gtggacagtgtggagctcctcttagaggtgatcttaaagggcagcaaggagcgcagcact ggggccactggagttaatgcagactcctcccgctcccatgccgtcatccaaattcagatc aaagattcagccaagaggacatttggcagn >gi568815589r:34270802_34472841|GENSCAN_predicted_peptide_2|360_aa MLASAFCLLAVALATEVKKPAATAAPGTAEKLSPKAATLAEHSAGLAFSLYQAMAKDQAV ENILVSPVVVASSLGLVSLGGKATTASEAKAVLSAKQLSDEEVHAGVGEPLRSLSNSTAR NVTWKLCSRLSKQHYNCEHSKINFHDKRSALQSIHEWAVQTTDGKLPKVTKDMECMDGAL LVNTMFFKPHWNEKFHHKMVENRGFMVTRFYTVGVMVMHQTGLYNYYDNEKEKLQIVEMP LAHKLSSLIILMPHHVEPLEALKSWLGLTEAIDKNKANLSRMPHKKDLYLTSVFHATAFE LDTDGNSFDQDIYGSKELRSPKLFYSDHPFIFLVWDTQSGSLLFTGHLVRPKVDKMQDEF >gi568815589r:34270802_34472841|GENSCAN_predicted_CDS_2|1083_bp atgttggccagcgccttctgcctcctggcggtggccttggcgaccgaggtgaagaaacct gcagccacagcagctcctggcaccgcagagaagctgagccccaaggcagccacgctggcc gaacacagcgccggcctggccttcagcctgtaccaggccatggccaaggaccaggcggtg gagaacatcctggtgtcgcccgtggtggtggcctcgtcgttggggctcgtgtcgctgggc ggcaaggcgaccacggcgtcggaggccaaggcagtgctgagtgccaagcagctgagcgac gaggaggtgcacgccggcgtgggcgagccgctgcgttcactcagcaactccaccgcgcgc aacgtgacctggaagctgtgcagtcgcctcagcaagcagcactacaactgcgagcactcc aagatcaatttccatgacaagcgcagtgcgctgcagtccatccacgagtgggccgtgcag accaccgacggcaagctgcccaaggtcaccaaggacatggagtgcatggatggcgccctg cttgtcaacaccatgttcttcaagccacactggaatgagaaattccaccacaagatggtg gaaaaccgtggcttcatggtgactcggttctataccgtgggtgtcatggtgatgcaccag acaggcctctacaactactatgacaatgagaaggaaaagctgcaaatcgtggagatgccc ctggcccacaagctctccagcctcatcatcctcatgccccaccacgtggagcccctcgag gccttaaaaagctggcttggcctgactgaggccattgacaagaacaaggcaaacttgtca cgcatgccacacaagaaggacctgtacctgaccagcgtgttccacgccaccgcctttgag ttggacacagacggcaactcctttgaccaggacatctatgggagcaaggagctgcgcagc cccaagctgttctactccgaccaccccttcatcttcctggtgtgggacacccagagcggc tccctgctgttcactgggcacctggtccggcctaaggttgacaagatgcaagacgagttt tag >gi568815589r:34270802_34472841|GENSCAN_predicted_peptide_3|147_aa MALRACGLIIFRRCLIPKVDNNAIEFLLLQASDGIHHWTPPKGHVEPGEDDLETALRETQ EEAGIEAGQLTIIEGFKRELNYVARNKPKTVIYWLAEVKDYDVEIRLSHEHQAYRWLGLE EACQLAQFKEMKAALQEGHQFLCSIEA >gi568815589r:34270802_34472841|GENSCAN_predicted_CDS_3|444_bp atggccttgagagcatgtggcttgatcatcttccgaagatgcctcattcccaaagtggac aacaatgcaattgagtttttactgctgcaggcatcagatggcattcatcactggactcct cccaaaggccatgtggaaccaggagaggatgacttggaaacagccctgagggagacccaa gaggaagcaggcatagaagcaggccagctgaccattattgaggggttcaaaagggaactc aattatgtggccaggaacaagcctaaaacagtcatttactggctggcggaggtgaaggac tatgacgtggagatccgcctctcccatgagcaccaagcctaccgctggctggggctggag gaggcctgccagttggctcagttcaaggagatgaaggcagcgctccaagaaggacaccag tttctttgctccatagaggcctga >gi568815589r:34270802_34472841|GENSCAN_predicted_peptide_4|44_aa MVSFDSMSHIQVTLMQEAQHYVEDAKTWGLHPLKPRPELYLGPF >gi568815589r:34270802_34472841|GENSCAN_predicted_CDS_4|135_bp atggtctcctttgactccatgtctcacatccaggtcacactgatgcaggaggctcaacac tacgtggaagatgccaagacttggggcttgcaccctctgaagccaaggcctgagctgtat cttggccccttttag >gi568815589r:34270802_34472841|GENSCAN_predicted_peptide_5|1308_aa MAVPLADLPQRPYPYKKSLSFPALLNCGPKELAVSSRIKYHQELPSRPQGPHDPAAASIS DGDCDAREGESVAMNYKPSPLQVKLGKEGSYSSLQEPETLRGISEAKNVPEKQRELARKG SLKNGSMGSPVNQQPKKNNVMARTRLVVPNKGYSSLDQSPDEKPLVALDTDSDDDFDMSR YSSSGYSSAEQINQDLNIQLLKDGYRLDEIPDDEDLDLIPPKSVNPTAPTSIKEVYKDPP LCAWEANKFLTPGLTHTMERHVDPEALQKMAKCAVQDYTYRGSISGHPYLPEKYWLSQEE DKCSPNYLGSDWYNTWRMEPYNSSCCNKYTTYLPRLPKQLPRITPRCGCVDPLPGRLPFH GYESACSGRHYCLRGMDYYASGAPCTDRRLRPWCREQPTVRKMSSLAGPPSPSVVGAVIP GGYVSREMVTTLPEWLWPPRIPHHLRRCVPPYEHRPGMQCAVTTPPPSYYPYPNLRWDTS HFKKSGGPQRNNYVIHPEFVSETYPDYRCWSPHHSLPKASLCSDTLAAQAPVPDAIKIAL FAALVVFVVRVEQAAAPALGKTGLTMPCPSDGSWSHQLSPRAPFCYCIQAASPLMLQNPQ EKSQAYPRRRRPGCYAYRQNPEAIAAAAMYTFLPDNFSPAKPKPSKDLKPLLGSAVLGLL LVLAAVVAWCYYSVSLRKAERLRAELLDLKAGGFSIRNQKGEQVFRLAFRSGALDLDSCS RDGALLGCSLTADGLPLHFFIQTVRPKDTVMCYRVRWEEAAPGRAVEHAMFLGDAAAHWY GGAEMRTQHWPIRLDGQQEPQPFVTSDVYSSDAAFGGILERYWLSSRAAAIKVNDSVPFH LGWNSTERSLRLQARYHDTPYKPPAGRAAAPELSYRVCVGSDVTSIHKYMVRRYFNKPSR VPAPEAFRDPIWSTWALYGRAVDQDKVLRFAQQIRLHHFNSSHLEIDDMYTPAYGDFDFD EVKFPNASDMFRRLRDAGFRVTLWVHPFVNYNSSRFGEGVERELFVREPTGRLPALVRWW NGIGAVLDFTHPKARDWFQGHLRRLRSRYSVASFKFDAGEVSYLPRDFSTYRPLPDPSVW SRRYTEMALPFFSLAEVRVGYQSQNISCFFRLVDRDSVWGYDLGLRSLIPAVLTVSMLGY PFILPDMVGGNAVPQRTAGGDVPERELYIRWLEVAAFMPAMQFSIPPWRYDAEVVAIAQK FAALRASLVAPLLLELAGEVTDTGDPIVRPLWWIAPGDETAHRIDSQFLIGDTLLVAPVL EPGKQERDVYLPAGKWRSYKGELFDKTPVLLTDYPVDLDEIAYFTWAS >gi568815589r:34270802_34472841|GENSCAN_predicted_CDS_5|3927_bp atggccgtacctctggcagatctcccacaaagaccttatccctataagaagtctttgtct ttccctgccctcctgaattgtggtccaaaggagctggcagtttcatccaggatcaaatac catcaggaactgcccagtaggcctcaaggaccacatgacccagccgccgcctccatctct gacggagactgtgacgcccgggagggtgagtcagtagccatgaattacaaaccatccccg ctccaagtgaagctgggcaaagagggcagttacagttctctgcaggagcctgaaactcta agaggcatctctgaagccaagaacgttccagagaagcagcgggagctggcccggaagggc tccctgaagaatggcagcatgggtagccctgtcaaccagcaacccaagaagaacaatgtc atggcccgaacaaggctggtcgtccccaataaaggctactcctcacttgaccagagccct gatgagaagccactggtagcccttgacacggacagcgatgatgactttgacatgtctaga tactcctcctccggctactcctctgctgagcagatcaaccaagatttgaacatccagctg ctgaaggacggctaccggttagatgagatccccgacgacgaggacctagacctcatcccc cccaagtccgtgaaccccacggctcccacctccatcaaggaggtctataaggacccaccc ctgtgtgcctgggaagccaacaagtttctgactccgggtctgactcataccatggagcga catgtggatcccgaagccctgcagaagatggccaaatgtgctgtacaggactacacttat agggggtccatatcaggccacccctacttgcctgagaagtactggctttctcaagaggaa gacaaatgcagcccaaactacctgggcagtgactggtacaacacatggaggatggaacct tacaacagcagctgctgcaacaagtataccacctaccttcctcggctgcctaagcagctg ccgcggatcacgccccgatgcgggtgcgtggacccgctgcccggccgcctgcccttccat ggttacgaaagtgcttgctcgggccgccactactgtctgcgcgggatggactactacgcc agcggggcgccctgcaccgaccgccgcctgcggccttggtgccgggagcaaccgactgta agaaaaatgtcctctcttgctggtcccccgtcaccgtcggtagtgggtgcggtcattcct gggggctacgtgagccgggaaatggtgaccactttaccagaatggctctggcctcctcgg atcccgcaccacctgcggagatgtgtacctccctacgagcaccggcccggaatgcagtgt gctgttacaactcccccgccgtcatactacccatatccgaaccttagatgggacacaagt cacttcaagaagtctggtggtccccagagaaacaactatgttatccatcctgagtttgtg tctgagacctatcccgactatcgttgctggagtccccatcacagcctcccaaaagcctct ctgtgctcagacacccttgctgcacaagccccggtaccagatgccattaaaattgccctc tttgctgccctcgtggtcttcgtggtccgcgtggagcaagcagcagcaccagcccttgga aagacaggcctcaccatgccatgtccatctgatggaagctggtcccaccagctcagcccc agggctcctttctgctactgcattcaggcagccagcccactaatgctccagaaccctcag gagaagagccaggcctacccccgccgccgccggcctggctgctacgcataccgtcagaac cccgaggccatcgcagccgcagctatgtacaccttcctgcccgacaacttctcacctgcc aagcccaagccttccaaagacctgaagccgctgctgggctccgcggttctggggctgctg cttgtgctggccgcggtggtggcctggtgctactacagcgtctccctacgcaaggcggag cgacttcgcgcggagctgctggacctgaaagctggcggcttctccatccgcaatcagaag ggagagcaggtcttccgcctggccttccgctccggcgcgctggaccttgactcctgcagc cgcgatggcgccctgctgggctgctcgctcacggccgacgggctgccgctgcacttcttc atccagactgtgcggcccaaggacacggtcatgtgctaccgcgtgcgctgggaggaggca gcgccgggccgggccgtggagcacgccatgttcttgggcgacgcggcggcccactggtat ggtggcgccgagatgaggacgcaacactggcccatccgcctggatggccagcaggagccc cagccgttcgtcaccagcgatgtctactcctccgacgccgcgtttgggggcatcctcgag cgctactggctatcttcgcgcgcggccgccatcaaagtcaatgactcagtgcccttccac ctgggctggaacagcacggagcgctcgctgcggcttcaggcgcgctaccacgacacgccc tacaagccacccgccggccgcgccgcagcgccagagctgagctaccgagtgtgcgtgggc tcagacgtcacctccatccacaagtacatggtgcgtcgctacttcaacaagccgtcaagg gtgccagcacccgaggccttccgagaccccatttggtccacatgggcgctgtacgggcgc gccgtggaccaggacaaggtgctgcgttttgcccaacagatccgcctgcaccacttcaac agcagccacctggaaatcgacgacatgtacacacctgcttatggcgacttcgacttcgat gaggtcaaattccccaacgccagcgacatgttccgccgcctgcgcgacgccggcttccgc gtcacgctctgggtgcacccttttgtcaactacaactcgtcgcgcttcggcgagggcgtg gagcgcgagctgttcgtgcgcgaacccacgggccggttacctgcgctggtgcgctggtgg aacggcatcggcgcggtgctagacttcacgcacccaaaggcccgcgactggttccaggga cacctgcggcggctgcgctctcgctactccgtggcttccttcaagttcgacgcgggcgag gtcagctacctgccgcgggacttcagcacctaccggccgctgccggaccccagcgtctgg agccggcgctacactgagatggcgctgcccttcttctcgctggcggaggtgcgcgtaggc taccagtcacagaacatctcctgcttcttccgcctggtggatcgcgactctgtgtggggc tacgacctggggttgcgctcactcatccccgcggtgctcaccgtcagcatgctgggctac ccattcatcctacccgatatggtgggcggcaacgccgtgccccagcggacagccggcggc gatgtgcccgagcgcgagctctacattcgctggctggaagtggccgcctttatgccggcc atgcagttctctatcccgccctggcgctacgacgcggaagtggtggccatcgcgcagaag ttcgccgccctgcgggcctcgcttgtggcaccgctgttgcttgagctggcgggcgaggtc accgacacgggtgaccctatcgtgcgccccctttggtggattgcgcccggcgacgagaca gctcaccgtatcgactcgcagttccttattggggacacgctgcttgtggccccggtgctg gagccaggcaagcaggagcgcgacgtctatttgcccgccggcaagtggcgcagctacaag ggtgagcttttcgacaagacgccggtgctgctcaccgattacccggtcgacctggatgag atcgcctactttacctgggcgtcctga >gi568815589r:34270802_34472841|GENSCAN_predicted_peptide_6|132_aa MRPHSSALVWSMGLGAVEQGVALIGEAQASLGRLMAAQEPMEGGGLKGSSNATKVGAQAE EALRATEGCEDCQHAVTSQKHLLLICHQTHSVSFNLLNSAKKASGSLIEGTVKPKGMHYT SKNLLVKEIDAK >gi568815589r:34270802_34472841|GENSCAN_predicted_CDS_6|399_bp atgcgcccgcactcctcagcccttgtgtggtcgatgggactgggcgccgtggagcagggg gtggcgctcattggggaggcacaggcctcactggggaggctcatggccgcacaggagccc atggagggcggtgggctgaagggctcctcaaatgccaccaaagtaggagcccaggcagag gaggcgctgagagcgaccgagggctgtgaggactgccagcacgctgtcacctctcagaag cacttactacttatctgccaccagacacacagtgtttcatttaatcttctcaactctgca aagaaggcttctgggtcccttattgaaggaactgtcaagcccaaaggcatgcattatacc tcaaagaacctcctggtcaaagaaattgatgctaaataa >gi568815589r:34270802_34472841|GENSCAN_predicted_peptide_7|47_aa MMEEIDRFQVPTAHSEMQPLNLQTGEEDTCSYETELQKNTAGSFKDE >gi568815589r:34270802_34472841|GENSCAN_predicted_CDS_7|144_bp atgatggaggagatcgaccggttccaggtgcccaccgcgcactcggagatgcagccgctg aacttacagactggtgaagaagatacttgctcttatgaaacagaactacagaagaataca gctggctcctttaaagatgagtaa