GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:44:44 Sequence gi568815574f:6146269_6266592 : 120324 bp : 38.30% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 596 591 6 1.05 1.01 Sngl - 3096 2083 1014 0 0 44 38 476 0.813 34.96 1.00 Prom - 18922 18883 40 -5.35 2.00 Prom + 22248 22287 40 -5.75 2.01 Init + 23883 23955 73 2 1 80 108 31 0.842 5.58 2.02 Term + 28886 28995 110 0 2 32 37 147 0.794 1.69 2.03 PlyA + 29493 29498 6 1.05 3.05 PlyA - 35622 35617 6 1.05 3.04 Term - 48695 48507 189 2 0 43 42 165 0.214 3.87 3.03 Intr - 59297 59118 180 2 0 19 94 110 0.342 3.84 3.02 Intr - 59870 59746 125 0 2 38 37 117 0.234 0.98 3.01 Init - 77973 77469 505 0 1 39 38 247 0.049 9.99 3.00 Prom - 83143 83104 40 -7.75 4.00 Prom + 85448 85487 40 -1.05 4.01 Init + 91862 91965 104 1 2 56 103 22 0.144 0.27 4.02 Intr + 97608 97780 173 0 2 72 81 129 0.632 9.26 4.03 Intr + 98602 98762 161 2 2 96 45 155 0.746 10.79 4.04 Intr + 99142 99516 375 0 0 44 62 303 0.727 17.59 4.05 Intr + 99971 100518 548 1 2 43 -52 446 0.745 16.85 4.06 Intr + 101069 101165 97 2 1 68 78 110 0.773 7.09 4.07 Intr + 101294 101405 112 1 1 102 61 106 0.963 8.33 4.08 Intr + 101507 101652 146 0 2 76 105 108 0.970 10.38 4.09 Intr + 101759 101840 82 1 1 43 99 45 0.947 -0.61 4.10 Intr + 102109 102224 116 2 2 49 44 140 0.884 4.95 4.11 Term + 104137 104328 192 0 0 73 44 131 0.591 3.64 4.12 PlyA + 105648 105653 6 -1.75 5.09 PlyA - 106112 106107 6 1.05 5.08 Term - 108938 108722 217 1 1 -10 47 219 0.599 3.63 5.07 Intr - 109034 108961 74 2 2 124 43 64 0.525 2.79 5.06 Intr - 116632 116561 72 1 0 38 88 67 0.495 0.38 5.05 Intr - 117479 117334 146 0 2 129 59 67 0.793 6.98 5.04 Intr - 117675 117566 110 2 2 81 99 68 0.754 6.21 5.03 Intr - 118208 117953 256 0 1 108 61 33 0.667 -1.82 5.02 Intr - 118790 118578 213 0 0 -17 44 197 0.326 2.56 5.01 Intr - 119589 118997 593 2 2 78 44 365 0.091 22.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815574f:6146269_6266592|GENSCAN_predicted_peptide_1|337_aa MVKGSIQQEEVTILNTYAPNTGAPRYIKQVLSDLQRDLDYQTIIMGDFNTPVSTLDRSTR QKVNKDTQELNSTLHQADLIDIHRTLHPKSTEYTFFSALYHTYSKIDHIVGSKALLSKCK RTEIITNCLSDHSAIKLELRIKKLTQNRSTTWKLNNLLLNDYWAHNEMKAEIKMFFETNE NKDTTYQNLWDTFKALCRGKFLALNAHKRKQEKSKIDTLTSQLKELEKQEQTHSKASRRQ EITKIRAELKEIETQKTLQKINESRSWFFERINKIDRQLARLIKKKREKNQIDAIKNDQG DITTDPTEIQTTIREYYKQLYANKLENLEEMDKFLDT >gi568815574f:6146269_6266592|GENSCAN_predicted_CDS_1|1014_bp atggtaaagggatcaattcaacaagaagaggtaactatcctaaatacatatgcacccaat acaggagcacccagatacataaagcaagtcctgagtgacctacaaagagacttagactac caaacaataataatgggagactttaacaccccagtgtcaacattggacagatcaacgaga cagaaagttaacaaggatacccaggaattaaactcaacgctgcaccaagcggacctaata gacatccacagaactctccaccccaaatcaacagaatatacatttttttcagcactgtac cacacctattccaaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaa agaacagaaattataacaaactgtctctcagaccacagtgcaatcaaactagaactcagg attaagaaactcactcaaaaccgctcaactacatggaaactgaacaacctgctcctgaat gactactgggcacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgag aacaaagacacaacataccagaatctctgggacacattcaaagcattgtgtagagggaaa tttctagcactaaatgcccacaagagaaagcaggaaaaatccaaaattgacaccctaaca tcacaattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaataactaaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaa attaatgaatccaggagctggttttttgaaaggatcaacaaaattgatagacagctagca agactaataaagaaaaaaagagagaagaatcaaatagatgcaataaaaaatgatcaaggg gatatcaccaccgaccccacagaaatacaaactaccatcagagaatactacaaacaactc tacgcaaataaactagaaaatctagaagaaatggataaattcctcgacacatag >gi568815574f:6146269_6266592|GENSCAN_predicted_peptide_2|60_aa MAPASAPSKGLRKISVMAKCERGAIIKLLDPLKTYSKTCSDFGKHREPVDAQGAVEPLVI >gi568815574f:6146269_6266592|GENSCAN_predicted_CDS_2|183_bp atggcaccagcatctgcccctagcaagggcctcaggaagatttcagtcatggcaaaatgt gaaaggggagcaatcatcaaactactagatcccctgaaaacatactccaaaacctgctct gactttggcaagcacagggaaccagtggatgcccagggagctgtggaacccctggtgatc taa >gi568815574f:6146269_6266592|GENSCAN_predicted_peptide_3|332_aa MYGKAWVPTKRPAPQADALQNTFDGAVWKGSMGLDTPQRVPIMALPSGAVETELQPSRPQ SGRSIDSLHPELGKATDTQLQPMRVATGATSCKAIWVELPKAFRTHPLHQCTLDVGHGFK DYFGALRIKVCPAGFQTSGGPIAPFYWLFSLFWNGNAYSMPVPPLHLARVVWSGEALWTL VEREHRNFEALNSVLSCYSRKENQTKLSGQILDDVSIKPWARGEHTAFEKNYPLQAACIT EETLGSKQPTAIPRYYIEGIGCASETCWLQDLVPCILALAKRCQHKAQAVASEEANPKPW QLKRDIGPVGAEKLRIEVWEPPPRFQRMYGNT >gi568815574f:6146269_6266592|GENSCAN_predicted_CDS_3|999_bp atgtatggaaaagcctgggttcccacaaagaggcctgctccacaggcagatgcccttcag aatacatttgatggggcagtgtggaagggaagtatggggttggacaccccacagagagtc cccattatggcactgcctagtggagctgtggaaacagagctgcagccttctagaccccaa agtggtagatccattgacagcttgcaccctgagcttggaaaagccacagacactcaactt caacccatgagagtagcaacaggggctacatcatgcaaagccatatgggtagagctgccc aaggcctttcgaacccaccccctgcatcagtgtaccctagatgtgggacatggattcaag gattattttggagctttaagaattaaagtctgccctgctgggtttcagacttctgggggg cctattgcccctttctattggctattttctctattttggaatgggaatgcctactcaatg cctgtaccaccattgcatcttgcaagagtggtgtggagtggagaggctctctggacactg gtggagagagaacacagaaattttgaagcattaaactcggtgctttcctgttacagcaga aaggaaaaccagactaaactcagtggacagatcctagatgatgtttctataaaaccctgg gccagaggagaacacactgcatttgaaaaaaattatccacttcaggcagcatgcataact gaggagaccttgggttctaaacaaccaacagctatacccaggtactacatcgagggcatt gggtgtgcctctgagacctgttggcttcaggacttggtgccctgcatcctagccttggct aaaagatgccaacataaagctcaggctgttgcttcagaagaagcaaaccctaagccttgg cagctaaaacgtgatattgggcctgtgggtgcagaaaagttaagaattgaggtttgggaa cctccacctagatttcagaggatgtatggaaacacctga >gi568815574f:6146269_6266592|GENSCAN_predicted_peptide_4|701_aa MPWQAGIISCGSQVYSGSIVSSSWVLIAAHCVRNMSTSGVPNNLHLVKESPDDSVGKPPD ACNSARAQRMWSRDISQTFGLSRSDVDPLFLSAPMPPRSPMPIPCCQPSGIGSCKDMALS QKPRPQGHSRSLPPSEGPPASPLMAWMAALAVGLRTTAGSTAHRSSEARGRGPVLPEPHQ QAPQRLLRVREPLGRQGSAQQRARRPTMANPGGWPLVCPGHRTRGPLECSLEYSILREEA WYSEPVFASTCESTCQGSSLQCRRIPPLHKQSLRGRLEARRMRPEGSLTYRVPERLRQGS CGVGRAAQALVCASAKEGTAFRMEAVQEGAAGVESEQAALGEEAVLLLDDIMAEVEVVAE EEGLVERREEAQRAQQAVPGPGPMTPESALEELLAVQVELEPVNAQARKAFSRQREKMER RRKPHLDRRGAVIQSVPGFWANVVSFSVFLRPFYLFDLNQIANHPQMSALITDEDEDMLS YMVSLEVEEEKHRVHLCKIMLFFRSNPYFQNKVITKEYLVNITEYRASHSTPIEWYPDYE VEAYRRRHHNSSLNFFNWFSDHNFAGSNKIAEILCKDLWRNPLQYYKRMKPPEEGTETSE SMRSKAENHSVQSVALDSTAVNPTGSHPTNPMRLGSLNTSEALERESCPPLTTVPLTHKH RRNRCSKQIILKTVGTLWPPQAALYPTVCMSKTLWSSTVSL >gi568815574f:6146269_6266592|GENSCAN_predicted_CDS_4|2106_bp atgccttggcaggctgggatcatcagctgtggcagtcaggtctacagcgggtccatagtt agcagctcatgggttctcatagctgcccactgtgtcaggaacatgtcaacatctggtgtg ccaaacaatctacatctggtcaaggagtctccagatgattcagtgggcaagcctcctgac gcctgcaattctgcaagagcacagagaatgtggagcagggatatctcccagacatttggc ctgtcacgctccgatgttgatcctctgtttctgtctgcccccatgcccccacgaagcccg atgcccatcccctgctgccagccatctggaatcggcagctgcaaggatatggctctgtcc cagaagcccaggccacaggggcattcacggagtctacctccaagtgaaggacctccagcg agtccattgatggcctggatggccgcactcgccgtggggctcaggaccacagcagggtca actgcgcataggagctcagaagccagaggccgaggccctgtgcttccagagccccaccag caggcaccgcagcggctgctgcgggtgcgggagcctctgggtcgtcaaggcagtgcacaa cagcgtgcgcgcaggccgacaatggccaaccctggcggctggcctctggtgtgcccaggg cacaggactagaggccctttggaatgctccttggagtacagcatcctcagggaggaagca tggtactcggagcctgtatttgcctcgacctgcgagagtacatgccagggttctagccta cagtgcagacgaattccacctctgcacaagcagtcccttagggggcgcctggaagcccgg cgcatgcgccctgagggctcgctgacctaccgggtgccagagaggctgcggcagggttcc tgtggcgtgggtcgggcagcacaggccttggtgtgtgcgagtgccaaggagggcaccgcc ttcaggatggaggctgtgcaggagggggcggccggggtggagagtgagcaggcggctttg ggggaggaggcggtgctgctgttggatgacataatggcggaggtggaggtggtggcggag gaggagggcctcgtggagcggcgggaggaggcccagcgggcacagcaggctgtgcctggc cctgggcccatgaccccagagtctgcactggaggagctgctggccgttcaggtggagctg gagccggttaatgcccaagccaggaaggccttttctcggcagcgggaaaagatggagcgg aggcgcaagccccacctagaccgcagaggcgccgtcatccagagcgtccctggcttctgg gccaatgttgtatccttctcagtgtttcttcggcctttctatctctttgacctaaatcag attgcaaaccacccccagatgtcagccctgatcactgacgaagatgaagacatgctgagc tacatggtcagcctggaggtggaagaagagaagcatcgtgttcatctctgcaagatcatg ttgttctttcggagtaacccctacttccagaataaagtgattaccaaggaatatctggtg aacatcacagaatacagggcttctcattccactccaattgagtggtatccggattatgaa gtggaggcctatcgccgcagacaccacaacagcagccttaacttcttcaactggttctct gaccacaacttcgcaggatctaacaagattgctgagatcctatgtaaggacctgtggcgc aatcccctgcaatactacaagaggatgaagccacctgaagagggaacagagacgtcagaa tcaatgagaagtaaagctgaaaatcattcagttcagtctgtggcacttgattccacggct gtcaaccccaccggcagtcatcccaccaaccccatgagattgggctccctgaatacctct gaagctctggagcgggagtcttgtcctcctctgactaccgtccccctgacccacaaacac aggagaaacaggtgttctaagcaaattattctgaaaacagtcggaaccctttggccccct caagctgccctctatcctactgtgtgcatgtcaaagacactgtggtccagtacggtatcc ctatag >gi568815574f:6146269_6266592|GENSCAN_predicted_peptide_5|560_aa XSGCGACWWGSASPGPGLWLPSSCAQLSLLGTGALWPVRDLRVQRRSSGNLGPRRCGTRF SAGRRPWVLQGAGLLGSGPPEPTGAGHGLGWAGLRRPRVCGSTQEKTVFRLEAMLERTAG VQSKEAALEEEAVLKVEDIMAEVEVVVEVEPDVGWQKEGQRAQPGPGPSTPGPSMDSLEV LHLELGSVNAPGHRASPPSLGARGREGEPGPALTRENRSAKVPLRTAQIRRTRFPGNVPG GRGICMPLPAIEPPLLSVPVSSRLTPETQAATTCPLFLRPVYGQPRPSWPVSTHSKNHHS CGVASSPDRDRGPTMKGDWPNVWEMALFHIVCVLAKLQGVTRLAHPILWRVLAQRWRNSA IPVTGGRMISFHPNLYFHSEIIMKEHCVGILGYRVSHSTAVQRFWDHEGQASSCRQYTSY LSSFSCLAEHDCPGFGRIAELVHLTRNLEDHTGLQCEAMFYFLQDTGRLAPLQSSGSRVM QVKARVSRSWLSDCEGPGYGTIAEVGQLWGIMAKDLLRHSLASEELALNQNLTCHDQFAQ STRSSARACGSIFCSTTQGS >gi568815574f:6146269_6266592|GENSCAN_predicted_CDS_5|1683_bp nncagcggttgcggtgcctgttggtggggctctgcaagcccagggccggggctctggctc ccgagctcctgtgcgcagttgagcctgctggggaccggagccctttggccagtgcgggat ctgcgggtccagcggaggtcctcaggaaacctgggtccacgtaggtgtgggaccaggttc tcagcagggcgacgcccgtgggtcttgcagggagcgggtctgctggggagcgggccccca gagcctacgggtgcggggcatgggctgggctgggctgggctgcgcaggcccagggtctgt gggagcacccaggagaaaaccgtgttcaggctggaggcaatgctggagaggacggccggg gtacagagcaaggaggcggccttggaagaggaggcggtgctgaaggtggaagacatcatg gctgaggtggaggtggtggttgaggtggagcccgacgtggggtggcagaaggagggccag cgggcacagcctggccctggaccgagcacaccggggccgtcaatggactcgctggaggtc cttcacttggagctgggctccgtgaatgccccaggccacagagcatctccgccttccctg ggagcacgtggtagggaaggggagccagggccagcactgacaagggagaatcgcagcgcc aaggtccctttgcgcacagcccaaattcgaaggacacgtttccctgggaacgtccctgga ggacggggaatctgtatgccattaccagccattgaaccacccctgctctcggtgcctgtt tccagcaggctcaccccagaaacacaagctgcaaccacctgcccactttttctgcgtccc gtctatggtcagcccaggccgtcttggccggtgtccacccactccaaaaaccaccacagt tgtggcgttgcctcctcgccagacagagatagagggccaacaatgaagggtgactggcca aatgtctgggagatggccctgttccacattgtctgtgttcttgcgaaattgcaaggcgtc acgaggcttgcccacccaatcctctggagagttcttgcgcagaggtggaggaactcagcc atcccggttaccggtggcaggatgatttcctttcatcccaacctttatttccacagtgaa atcatcatgaaggagcactgtgttggcatcctcggctacagggtgtctcattccactgca gtccagcggttctgggatcacgaaggtcaagcctccagctgcaggcagtacacctcctac ctgagctcattcagctgtttggctgaacatgactgcccgggttttggcaggattgctgag ttagtccacctcacacggaatctggaggaccacactgggctccagtgtgaggcaatgttt tattttcttcaggatacagggcgactggctccactgcagtccagtggttctagggtcatg caggtgaaagcccgagtttcccgcagctggttgtctgactgtgagggcccaggttacggc acgattgctgaggtggggcagctatggggcatcatggcaaaggaccttcttcgacattcc ttggcatcggaggaattggctttgaaccagaacctgacctgtcacgaccaatttgcccag tccaccagatcatcagccagggcctgtggctctatattctgcagcactacccaagggagt tag