GENSCAN 1.0 Date run: 19-Jun-119 Time: 14:26:42 Sequence gi568815579r:53694052_53924174 : 230123 bp : 49.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.14 PlyA - 272 267 6 1.05 1.13 Term - 31174 31011 164 2 2 39 49 137 0.369 3.00 1.12 Intr - 74610 74488 123 1 0 40 71 103 0.067 4.36 1.11 Intr - 81757 81678 80 0 2 102 48 114 0.049 7.99 1.10 Intr - 94043 93836 208 0 1 71 84 89 0.395 4.94 1.09 Intr - 100085 100051 35 1 2 91 48 32 0.208 -2.63 1.08 Intr - 101978 101808 171 1 0 88 81 162 0.743 14.56 1.07 Intr - 104362 104192 171 0 0 114 77 168 0.999 17.36 1.06 Intr - 107346 107176 171 2 0 49 101 202 0.997 16.66 1.05 Intr - 110071 109901 171 0 0 90 89 59 0.877 5.26 1.04 Intr - 111399 111229 171 2 0 123 70 179 0.999 18.66 1.03 Intr - 113617 113444 174 0 0 22 105 215 0.613 15.65 1.02 Intr - 117237 115536 1702 1 1 95 87 1907 0.953 178.62 1.01 Init - 130123 129835 289 1 1 51 74 275 0.787 19.77 1.00 Prom - 134593 134554 40 -2.96 2.00 Prom + 137662 137701 40 -4.66 2.01 Init + 139299 139376 78 2 0 56 80 61 0.065 3.16 2.02 Intr + 172215 172366 152 2 2 30 101 185 0.554 12.96 2.03 Intr + 173809 173892 84 1 0 68 78 55 0.533 1.44 2.04 Intr + 175753 175787 35 1 2 34 105 49 0.209 -0.93 2.05 Term + 179477 180447 971 0 2 104 47 1581 0.195 147.82 2.06 PlyA + 182497 182502 6 1.05 3.00 Prom + 184612 184651 40 -6.06 3.01 Init + 188444 188613 170 1 2 104 81 147 0.750 13.26 3.02 Intr + 189112 189143 32 1 2 96 116 8 0.962 2.17 3.03 Intr + 190110 190192 83 1 2 59 86 112 0.968 7.46 3.04 Intr + 195587 195698 112 1 1 108 76 144 0.999 15.15 3.05 Intr + 195835 195966 132 2 0 92 81 219 0.999 22.22 3.06 Intr + 197623 197779 157 2 1 79 62 181 0.999 13.67 3.07 Intr + 198458 198592 135 2 0 84 81 278 0.999 26.38 3.08 Intr + 198937 199024 88 1 1 80 65 143 0.999 11.17 3.09 Intr + 199311 199340 30 2 0 139 105 13 0.967 6.43 3.10 Intr + 203908 204060 153 0 0 119 53 92 0.914 9.07 3.11 Intr + 204389 204577 189 1 0 36 107 407 0.997 37.18 3.12 Intr + 206182 206273 92 0 2 77 75 122 0.999 8.59 3.13 Intr + 206368 206430 63 1 0 108 99 101 0.999 11.03 3.14 Intr + 206560 206698 139 1 1 58 76 199 0.994 16.17 3.15 Intr + 209022 209102 81 2 0 84 110 114 0.998 13.03 3.16 Intr + 210584 210691 108 1 0 89 64 233 0.999 21.48 3.17 Intr + 212266 212406 141 0 0 107 77 131 0.955 14.45 3.18 Term + 212656 212844 189 0 0 109 33 373 0.992 31.35 3.19 PlyA + 213576 213581 6 1.05 4.00 Prom + 216465 216504 40 -13.24 4.01 Init + 218781 218976 196 2 1 51 92 311 0.983 24.89 4.02 Intr + 220449 220535 87 1 0 150 33 62 0.991 6.84 4.03 Intr + 221314 221454 141 2 0 130 88 216 0.999 26.12 4.04 Term + 223352 223389 38 0 2 66 47 39 0.418 -4.90 4.05 PlyA + 224600 224605 6 1.05 5.02 PlyA - 224709 224704 6 1.05 5.01 Sngl - 229945 229430 516 1 0 53 47 619 0.913 50.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 153234 153166 69 0 0 111 42 106 0.835 6.24 S.002 Sngl - 229030 228704 327 1 0 53 47 294 0.918 17.81 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:53694052_53924174|GENSCAN_predicted_peptide_1|1209_aa MLRTAGRDGLCRLSTYLEELEAVELKKFKLYLGTATELGEGKIPWGSMEKAGPLEMAQLL ITHFGPEEAWRLALSTFERINRKDLWERGQREDLVRDPQETYRDYVRRKFRLMEDRNARL GECVNLSHRYTRLLLVKEHSNPMQVQQQLLDTGRGHARTVGHQASPIKIETLFEPDEERP EPPRTVVMQGAAGIGKSMLAHKVMLDWADGKLFQGRFDYLFYINCREMNQSATECSMQDL IFSCWPEPSAPLQELIRVPERLLFIIDGFDELKPSFHDPQGPWCLCWEEKRPTELLLNSL IRKKLLPELSLLITTRPTALEKLHRLLEHPRHVEILGFSEAERKEYFYKYFHNAEQAGQV FNYVRDNEPLFTMCFVPLVCWVVCTCLQQQLEGGGLLRQTSRTTTAVYMLYLLSLMQPKP GAPRLQPPPNQRGLCSLAADGLWNQKILFEEQDLRKHGLDGEDVSAFLNMNIFQKDINCE RYYSFIHLSFQEFFAAMYYILDEGEGGAGPDQDVTRLLTEYAFSERSFLALTSRFLFGLL NEETRSHLEKSLCWKVSPHIKMDLLQWIQSKAQSDGSTLQQGSLEFFSCLYEIQEEEFIQ QALSHFQVIVVSNIASKMEHMVSSFCLKRCRSAQVLHLYGATYSADGEDRARCSAGAHTL LVQLRPERTVLLDAYSEHLAAALCTNPNLIELSLYRNALGSRGVKLLCQGLRHPNCKLQN LRLKRCRISSSACEDLSAALIANKNLTRMDLSGNGVGFPGMMLLCEGLRHPQCRLQMIQL RKCQLESGACQEMASVLGTNPHLVELDLTGNALEDLGLRLLCQGLRHPVCRLRTLWLKIC RLTAAACDELASTLSVNQSLRELDLSLNELGDLGVLLLCEGLRHPTCKLQTLRLGICRLG SAACEGLSVVLQANHNLRELDLSFNDLGDWGLWLLAEGLQHPACRLQKLWLDSCGLTAKA CENLYFTLGINQTLTDLYLTNNALGDTGVRLLCKRLSHPGCKLRVLWLFGMDLNKMTHSF PEPLQPDAVRDLYPRQFPAGNRNHRLFSSCRRPSSTASVDMGVTGDAQMSQHFPLGHQNS APHLRPTGWSSSTSLYTADEAPDVDIDQAFANKPAARAAGGAAAARQPSFAKALGSRGPQ TAGTRPLRHQDAQEAERKSNAYYVHLIFNQENWAGFLENSNELRAKCSLEFQYRDTQSRW EHPTSSLRC >gi568815579r:53694052_53924174|GENSCAN_predicted_CDS_1|3630_bp atgctacgaaccgcaggcagggacggcctctgtcgcctgtccacctacttggaagaactc gaggctgtggaactgaagaagttcaagttatacctggggaccgcgacagagctgggagaa ggcaagatcccctggggaagcatggagaaggccggtcccctggaaatggcccagctgctc atcacccacttcgggccagaggaggcctggaggttggctctcagcacctttgagcggata aacaggaaggacctgtgggagagaggacagagagaggacctggtgagggatccccaggaa acctacagggactatgtccgcaggaaattccggctcatggaagaccgcaatgcgcgccta ggggaatgtgtcaacctcagccaccggtacacccggctcctgctggtgaaggagcactca aaccccatgcaggtccagcagcagcttctggacacaggccggggacacgcgaggaccgtg ggacaccaggctagccccatcaagatagagaccctctttgagccagacgaggagcgcccc gagccaccgcgcaccgtggtcatgcaaggcgcggcagggataggcaagtccatgctggca cacaaggtgatgctggactgggcggacgggaagctcttccaaggcagatttgattatctc ttctacatcaactgcagggagatgaaccagagtgccacggaatgcagcatgcaagacctc atcttcagctgctggcctgagcccagcgcgcctctccaggagctcatccgagttcccgag cgcctccttttcatcatcgacggcttcgatgagctcaagccttctttccacgatcctcag ggaccctggtgcctctgctgggaggagaaacggcccacggagctgcttcttaacagctta attcggaagaagctgctccctgagctatctttgctcatcaccacacggcccacggctttg gagaagctccaccgtctgctggagcaccccaggcatgtggagatcctgggcttctctgag gcagaaaggaaggaatacttctacaagtatttccacaatgcagagcaggcgggccaagtc ttcaattacgtgagggacaacgagcctctcttcaccatgtgcttcgtccccctggtgtgc tgggtggtgtgtacctgcctccagcagcagctggagggtggggggctgttgagacagacg tccaggaccaccactgcagtgtacatgctctacctgctgagtctgatgcaacccaagccg ggggccccgcgcctccagcccccacccaaccagagagggttgtgctccttggcggcagat gggctctggaatcagaaaatcctatttgaggagcaggacctccggaagcacggcctagac ggggaagacgtctctgccttcctcaacatgaacatcttccagaaggacatcaactgtgag aggtactacagcttcatccacttgagtttccaggaattctttgcagctatgtactatatc ctggacgagggggagggcggggcaggcccagaccaggacgtgaccaggctgttgaccgag tacgcgttttctgaaaggagcttcctggcactcaccagccgcttcctgtttggactcctg aacgaggagaccaggagccacctggagaagagtctctgctggaaggtctcgccgcacatc aagatggacctgttgcagtggatccaaagcaaagctcagagcgacggctccaccctgcag cagggctccttggagttcttcagctgcttgtacgagatccaggaggaggagtttatccag caggccctgagccacttccaggtgatcgtggtcagcaacattgcctccaagatggagcac atggtctcctcgttctgtctgaagcgctgcaggagcgcccaggtgctgcacttgtatggc gccacctacagcgcggacggggaagaccgcgcgaggtgctccgcaggagcgcacacgctg ttggtgcagctcagaccagagaggaccgttctgctggacgcctacagtgaacatctggca gcggccctgtgcaccaatccaaacctgatagagctgtctctgtaccgaaatgccctgggc agccggggggtgaagctgctctgtcaaggactcagacaccccaactgcaaacttcagaac ctgaggctgaagaggtgccgcatctccagctcagcctgcgaggacctctctgcagctctc atagccaataagaatttgacaaggatggatctcagtggcaacggcgttggattcccaggc atgatgctgctttgcgagggcctgcggcatccccagtgcaggctgcagatgattcagttg aggaagtgtcagctggagtccggggcttgtcaggagatggcttctgtgctcggcaccaac ccacatctggttgagttggacctgacaggaaatgcactggaggatttgggcctgaggtta ctatgccagggactgaggcacccagtctgcagactacggactttgtggctgaagatctgc cgcctcactgctgctgcctgtgacgagctggcctcaactctcagtgtgaaccagagcctg agagagctggacctgagcctgaatgagctgggggacctcggggtgctgctgctgtgtgag ggcctcaggcatcccacgtgcaagctccagaccctgcggttgggcatctgccggctgggc tctgccgcctgtgagggtctttctgtggtgctccaggccaaccacaacctccgggagctg gacttgagtttcaacgacctgggagactggggcctgtggttgctggctgaggggctgcaa catcccgcctgcagactccagaaactgtggctggatagctgtggcctcacagccaaggct tgtgagaatctttacttcaccctggggatcaaccagaccttgaccgacctttacctgacc aacaacgccctaggggacacaggtgtccgactgctttgcaagcggctgagccatcctggc tgcaaactccgagtcctctggttatttgggatggacctgaataaaatgacccacagtttt ccggagccattacagccagacgctgtaagggacctgtacccaagacagtttccggctggg aatcgaaaccacaggctctttagttcctgcagaagaccgagctccacggcatccgttgat atgggcgtcaccggtgacgctcaaatgtcgcagcactttccacttggacatcagaatagt gctccacatttgaggcccacaggctggagcagctccaccagcctctacactgcagatgag gctcccgatgttgacattgaccaggcctttgcaaacaagccagcagctcgggcggcggga ggagccgcagccgccaggcagcccagcttcgccaaggctctcggcagccgcggcccgcag acagccggcacgcgccctctccgccaccaggatgcccaagaggcagaaagaaaaagcaat gcctactacgtgcaccttatatttaaccaagagaattgggctggatttctagaaaacagc aatgaacttagagccaagtgttcactcgagttccagtacagggacacacagtcccgatgg gagcatccaacatcgagtctacgatgttga >gi568815579r:53694052_53924174|GENSCAN_predicted_peptide_2|439_aa MSQTSRKALEQFPERIPSGTTRQIPQPPQLSALEGAQRRQRQTRPVSSASLSIRSRHQAP PSGRSQPGRARDDCRARPPTPPSPQTLPAAAVVFASAAVATGCLRSFSVDSSAKTAAMPV TVTRTTITTTTTSSSGLGSPMIVGSPRALTQPLGLLRLLQLVSTCVAFSLVASVGAWTGS MGNWSMFTWCFCFSVTLIILIVELCGLQARFPLSWRNFPITFACYAALFCLSASIIYPTT YVQFLSHGRSRDHAIAATFFSCIACVAYATEVAWTRARPGEITGYMATVPGLLKVLETFV ACIIFAFISDPNLYQHQPALEWCVAVYAICFILAAIAILLNLGECTNVLPIPFPSFLSGL ALLSVLLYATALVLWPLYQFDEKYGGQPRRSRDVSCSRSHAYYVCAWDRRLAVAILTAIN LLAYVADLVHSAHLVFVKV >gi568815579r:53694052_53924174|GENSCAN_predicted_CDS_2|1320_bp atgagccagacctccaggaaggcattagagcagttccccgagaggatccccagtggaact accaggcaaattccccagcctcctcagttgtcggccctggaaggtgcccagcggcggcag cgccagacgcgccccgttagctcagcgtcgctgagcatccgcagccgccaccaggccccg cccagcggccgcagccagccaggccgcgcccgggacgactgcagagcgcgccctccaacc ccacccagccctcagacccttccagctgccgctgtcgtctttgcttcagccgcagtcgcc actggctgcctgagatctttctccgtggattcctctgctaagaccgctgccatgccagtg acggtaacccgcaccaccatcacaaccaccacgacgtcatcttcgggcctggggtccccc atgatcgtggggtcccctcgggccctgacacagcccctgggtctccttcgcctgctgcag ctggtgtctacctgcgtggccttctcgctggtggctagcgtgggcgcctggacggggtcc atgggcaactggtccatgttcacctggtgcttctgcttctccgtgaccctgatcatcctc atcgtggagctgtgcgggctccaggcccgcttccccctgtcttggcgcaacttccccatc accttcgcctgctatgcggccctcttctgcctctcggcctccatcatctaccccaccacc tatgtccagttcctgtcccacggccgttcgcgggaccacgccatcgccgccaccttcttc tcctgcatcgcgtgtgtggcttacgccaccgaagtggcctggacccgggcccggcccggc gagatcactggctatatggccaccgtacccgggctgctgaaggtgctggagaccttcgtt gcctgcatcatcttcgcgttcatcagcgaccccaacctgtaccagcaccagccggccctg gagtggtgcgtggcggtgtacgccatctgcttcatcctagcggccatcgccatcctgctg aacctgggggagtgcaccaacgtgctacccatccccttccccagcttcctgtcggggctg gccttgctgtctgtcctcctctatgccaccgcccttgttctctggcccctctaccagttc gatgagaagtatggcggccagcctcggcgctcgagagatgtaagctgcagccgcagccat gcctactacgtgtgtgcctgggaccgccgactggctgtggccatcctgacggccatcaac ctactggcgtatgtggctgacctggtgcactctgcccacctggtttttgtcaaggtctaa >gi568815579r:53694052_53924174|GENSCAN_predicted_peptide_3|697_aa MAGLGPGVGDSEGGPRPLFCRKGALRQKVVHEVKSHKFTARFFKQPTFCSHCTDFIWGIG KQGLQCQVCSFVVHRRCHEFVTFECPGAGKGPQTDDPRNKHKFRLHSYSSPTFCDHCGSL LYGLVHQGMKCSCCEMNVHRRCVRSVPSLCGVDHTERRGRLQLEIRAPTADEIHVTVGEA RNLIPMDPNGLSDPYVKLKLIPDPRNLTKQKTRTVKATLNPVWNETFVFNLKPGDVERRL SVEVWDWDRTSRNDFMGAMSFGVSELLKAPVDGWYKLLNQEEGEYYNVPVADADNCSLLQ KFEACNYPLELYERVRMGPSSSPIPSPSPSPTDPKRCFFGASPGRLHISDFSFLMVLGKG SFGKVMLAERRGSDELYAIKILKKDVIVQDDDVDCTLVEKRVLALGGRGPGGRPHFLTQL HSTFQTPDRLYFVMEYVTGGDLMYHIQQLGKFKEPHAAFYAAEIAIGLFFLHNQGIIYRD LKLDNVMLDAEGHIKITDFGMCKENVFPGTTTRTFCGTPDYIAPEIIAYQPYGKSVDWWS FGVLLYEMLAGQPPFDGEDEEELFQAIMEQTVTYPKSLSREAVAICKGFLTKHPGKRLGS GPDGEPTIRAHGFFRWIDWERLERLEIPPPFRPRPCGRSGENFDKFFTRAAPALTPPDRL VLASIDQADFQGFTYVNPDFVHPDARSPTSPVPVPVM >gi568815579r:53694052_53924174|GENSCAN_predicted_CDS_3|2094_bp atggctggtctgggccccggcgtaggcgattcagaggggggaccccggcccctgttttgc agaaagggggccctgaggcagaaggtggtccacgaagtcaagagccacaagttcaccgct cgcttcttcaagcagcccaccttctgcagccactgcaccgacttcatctggggtatcgga aagcagggcctgcaatgtcaagtctgcagctttgtggttcatcgacgatgccacgaattt gtgaccttcgagtgtccaggcgctgggaagggcccccagacggacgacccccggaacaaa cacaagttccgcctgcatagctacagcagccccaccttctgcgaccactgtggctccctc ctctacgggcttgtgcaccagggcatgaaatgctcctgctgcgagatgaacgtgcaccgg cgctgtgtgcgtagcgtgccctccctgtgcggtgtggaccacaccgagcgccgcgggcgc ctgcagctggagatccgggctcccacagcagatgagatccacgtaactgttggcgaggcc cgtaacctaattcctatggaccccaatggtctctctgatccctatgtgaaactgaagctc atcccagaccctcggaacctgacgaaacagaagacccgaacggtgaaagccacgctaaac cctgtgtggaatgagacctttgtgttcaacctgaagccaggggatgtggagcgccggctc agcgtggaggtgtgggactgggaccggacctcccgcaacgacttcatgggggccatgtcc tttggcgtctcggagctgctcaaggcgcccgtggatggctggtacaagttactgaaccag gaggagggcgagtattacaatgtgccggtggccgatgctgacaactgcagcctcctccag aagtttgaggcttgtaactaccccctggaattgtatgagcgggtgcggatgggcccctct tcctctcccatcccctccccttcccctagtcccaccgaccccaagcgctgcttcttcggg gcgagtccaggacgcctgcacatctccgacttcagcttcctcatggttctaggaaaaggc agttttgggaaggtgatgctggccgagcgcaggggctctgatgagctctacgccatcaag atcttgaaaaaggacgtgatcgtccaggacgacgatgtggactgcacgctggtggagaaa cgtgtgctggcgctggggggccggggtcctggcggccggccccacttcctcacccagctc cactccaccttccagaccccggaccgcctgtatttcgtgatggagtacgtcaccggggga gacttgatgtaccacattcaacagctgggcaagtttaaggagccccatgcagcgttctac gcggcagaaatcgctatcggcctcttcttccttcacaatcagggcatcatctacagggac ctgaagctggacaatgtgatgctggatgctgagggacacatcaagatcactgactttggc atgtgtaaggagaacgtcttccccgggacgacaacccgcaccttctgcgggaccccggac tacatagccccggagatcattgcctaccagccctatgggaagtctgtcgattggtggtcc tttggagttctgctgtatgagatgttggcaggacagcctcccttcgatggggaggacgag gaggagctgtttcaggccatcatggaacaaactgtcacctaccccaagtcgctttcccgg gaagccgtggccatctgcaaggggttcctgaccaagcacccagggaagcgcctgggctca gggcctgatggggaacctaccatccgtgcacatggctttttccgctggattgactgggag cggctggaacgattggagatcccgcctcctttcagaccccgcccgtgtggccgcagcggc gagaactttgacaagttcttcacgcgggcggcgccagcgctgacccctccagaccgccta gtcctggccagcatcgaccaggccgatttccagggcttcacctacgtgaaccccgacttc gtgcacccggatgcccgcagccccaccagcccagtgcctgtgcccgtcatgtaa >gi568815579r:53694052_53924174|GENSCAN_predicted_peptide_4|153_aa MSHCSSRALTLLSSVFGACGLLLVGIAVSTDYWLYMEEGTVLPQNQTTEVKMALHAGLWR VCFFAGREKGRCVASEYFLEPEINLVTENTENILKTVRTATPFPMVSLFLVFTAFVISNI GHIRPQRTILAFVSGIFFILSVAVQIVEKGSSN >gi568815579r:53694052_53924174|GENSCAN_predicted_CDS_4|462_bp atgagtcactgcagcagccgcgccctgaccctgctgagcagcgtgtttggtgcgtgtggc ctgctcctggtaggcatcgcggtcagcactgactactggctgtacatggaagaaggcaca gtgctaccgcagaaccagaccaccgaggtcaagatggccctgcacgccggcctctggcga gtctgcttctttgcaggtcgggagaaaggtcgctgtgtggcctcagaatattttcttgaa ccggagatcaatttggtgacggaaaacacggagaatattctgaagacagtgcgcacggcc acccccttccccatggtcagcctcttcctcgtgttcacggccttcgtcatcagcaacatc ggccacatccgcccgcagaggaccattctggcttttgtctctggcatcttcttcatacta tcggtggctgtgcaaatcgttgaaaagggatcctccaactaa >gi568815579r:53694052_53924174|GENSCAN_predicted_peptide_5|171_aa MTRPGDNSTNTRPGDNSTNTRPGDNSTNDQTWGNSTNDQAWGNSTNDQAWGNSTNDQTWG NSTNDQTWGNCTNDQTWGNSTNDQTWGNSTNDQTWDNSTNDQAWDNSTNDQTWGNCTNDQ TWDNSTNDQTWGNSTNDQPGTTPPMTRPGTTPPMTRPGATPPMTSLGQLHQ >gi568815579r:53694052_53924174|GENSCAN_predicted_CDS_5|516_bp atgaccaggcctggggacaactccaccaataccagacctggggacaactccaccaatacc agacctggggacaactccaccaatgaccagacctggggcaactccaccaatgaccaggcc tggggcaactccaccaatgaccaggcctggggcaactccaccaatgaccagacctggggc aactccaccaatgaccagacctggggcaactgcaccaatgaccagacctggggcaactcc accaatgaccagacctggggcaactccaccaatgaccagacctgggacaactccaccaat gaccaggcctgggacaactccaccaatgaccagacctggggcaactgcaccaatgaccag acctgggacaactccaccaatgaccagacctggggcaactccaccaatgaccagcctggg acaactccaccaatgaccagacctgggacaactccaccaatgaccagacctggggcaact ccaccaatgaccagcctgggacaactccaccaatga