GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:52:43 Sequence gi568815589r_40381710 : 201908 bp : 38.20% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2664004 2664041 38 2 2 86 76 33 0.213 1.43 1.02 Intr + 2675759 2675815 57 1 0 70 100 49 0.213 1.38 1.03 Term + 2676209 2676443 235 1 1 41 38 220 0.314 7.11 1.04 PlyA + 2676808 2676813 6 1.05 2.00 Prom + 2683117 2683156 40 -3.05 2.01 Init + 2694026 2694112 87 0 0 21 88 176 0.982 11.59 2.02 Term + 2697137 2697208 72 0 0 107 37 15 0.234 -4.77 2.03 PlyA + 2698364 2698369 6 1.05 3.03 PlyA - 2698392 2698387 6 1.05 3.02 Term - 2708465 2708287 179 2 2 50 46 187 0.550 7.57 3.01 Init - 2711545 2711287 259 0 1 35 68 144 0.346 4.45 3.00 Prom - 2712973 2712934 40 -5.15 4.00 Prom + 2713311 2713350 40 -6.75 4.01 Sngl + 2714087 2715523 1437 0 0 37 43 417 0.978 27.62 4.02 PlyA + 2715543 2715548 6 1.05 5.07 PlyA - 2719184 2719179 6 1.05 5.06 Term - 2728516 2728157 360 0 0 34 47 201 0.809 4.15 5.05 Intr - 2729019 2728801 219 2 0 55 80 246 0.938 18.08 5.04 Intr - 2729348 2729230 119 2 2 65 78 112 0.656 7.16 5.03 Intr - 2729694 2729557 138 0 0 131 53 142 0.991 14.51 5.02 Intr - 2730246 2730123 124 2 1 67 105 109 0.990 9.74 5.01 Init - 2730379 2730332 48 0 0 74 60 58 0.413 2.60 5.00 Prom - 2732973 2732934 40 -11.64 6.00 Prom + 2733254 2733293 40 -8.05 6.01 Init + 2734064 2734214 151 0 1 59 33 167 0.773 8.57 6.02 Intr + 2734269 2734432 164 0 2 85 37 38 0.236 -2.83 6.03 Intr + 2737734 2738005 272 1 2 121 70 161 0.027 13.02 6.04 Intr + 2740759 2740929 171 0 0 14 65 139 0.004 2.34 6.05 Intr + 2745520 2745668 149 0 2 104 43 61 0.353 2.16 6.06 Intr + 2746108 2746304 197 1 2 48 37 174 0.891 6.51 6.07 Intr + 2747259 2747366 108 1 0 69 98 105 0.985 9.16 6.08 Intr + 2749708 2749860 153 2 0 97 113 25 0.838 5.25 6.09 Intr + 2752911 2753010 100 1 1 67 68 29 0.422 -2.44 6.10 Term + 2757459 2757574 116 0 2 98 55 83 0.367 3.75 6.11 PlyA + 2758137 2758142 6 1.05 7.00 Prom + 2759147 2759186 40 -5.55 7.01 Sngl + 2762472 2763698 1227 1 0 42 49 398 0.864 26.85 7.02 PlyA + 2764909 2764914 6 1.05 8.10 PlyA - 2765197 2765192 6 1.05 8.09 Term - 2777702 2777587 116 2 2 93 49 60 0.051 0.35 8.08 Intr - 2783732 2783663 70 1 1 74 78 49 0.032 0.34 8.07 Intr - 2789046 2788963 84 2 0 111 97 25 0.171 4.80 8.06 Intr - 2810091 2810007 85 1 1 62 65 48 0.120 -1.20 8.05 Intr - 2811280 2811042 239 0 2 66 80 169 0.721 9.39 8.04 Intr - 2812442 2812403 40 0 1 80 68 41 0.585 -1.39 8.03 Intr - 2818051 2817998 54 2 0 84 82 49 0.267 1.08 8.02 Intr - 2818611 2818511 101 2 2 52 76 76 0.365 0.79 8.01 Intr - 2820318 2820169 150 2 0 66 63 80 0.433 2.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 2743298 2743416 119 0 2 78 61 117 0.847 7.72 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r_40381710|GENSCAN_predicted_peptide_1|109_aa MNSHVLSEGPGGSILNATDSVSNIFNGGQTGHLDETGAVTKLISDLANRVCSATDLCKYC AVEPFCDRSLLLTNRRMQTETCAAFTADAQQPAYTDKANENPLIVDHLK >gi568815589r_40381710|GENSCAN_predicted_CDS_1|330_bp atgaattcccatgtgttgtcagagggacctggtgggagcattttgaatgctactgattct gtgagtaatattttcaacggaggtcagacaggccatcttgatgaaactggggcagtcaca aaactgattagtgatctggcaaacagggtctgctctgctactgatctgtgtaaatattgt gctgtagagcctttctgtgacaggtcattgctgctaacaaaccgcagaatgcaaacagag acctgtgctgcgttcactgcagatgcacagcagccagcttacactgacaaagcaaatgaa aatcctcttattgtcgatcatctcaagtaa >gi568815589r_40381710|GENSCAN_predicted_peptide_2|52_aa MKTKKEDEEEDEDEETEEEGKENKEEEQEMVSSAGLFNMFVFRGSSIGTGGS >gi568815589r_40381710|GENSCAN_predicted_CDS_2|159_bp atgaaaacaaagaaagaggatgaggaggaagatgaggatgaggagactgaggaggaaggc aaggagaataaggaagaggagcaagagatggtgtcttctgctggactatttaatatgttt gtatttagaggcagctccatagggactggaggatcctag >gi568815589r_40381710|GENSCAN_predicted_peptide_3|145_aa MSRDGSRFRLKKQSGHDLPQLLCSRSKPPCLPSTTRGKQPTDATVIATTPSPGNSVILDS LQSAVTGHSLSGDESLHSSVLGTQGPGVKLLTFLVVVTAHKGSVDPKSVQQDLLQTAKTQ TLHTTETDPSRLSLSLVAAYFYSFT >gi568815589r_40381710|GENSCAN_predicted_CDS_3|438_bp atgagtagggatggatccaggttccgcctaaaaaagcagtctggccatgatctgccacag ctgctgtgctcccggtctaaacctccctgtctccccagcaccaccaggggaaaacagcca actgatgccacagtgatagcaaccaccccttcccctgggaactcagtcatcttagacagt ctccagtctgctgtcactggccacagcctgagtggcgatgagagtctgcacagctctgtg cttggcacccaaggccctggagtgaagctgctgaccttcctggtggttgtcacagctcat aaaggcagtgtggacccaaagagtgtgcagcaagatttactgcaaacagcaaaaacacaa accctccacaccacggaaacggacccaagccggttatcactgtcccttgtggcagcctac ttttattccttcacctga >gi568815589r_40381710|GENSCAN_predicted_peptide_4|478_aa MNIDAKILNKILAKRIQEHIKKLIHHDQVGFITGMQGWFNIRKSINVFQHINRAKDKNHM IISIDAEKAFDRIQQPFMLKTLNTLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKT GTRQGCPLSPLLFNTVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSA QNLLKLISNFSKVSEYKINVQKSQALLYTNNTQTESQIMSELPFTIASKRIKYLGIQLTR DVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMT FFTELEKTTLKFIWNQKRAHITKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRD IDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDHFLTP YTKINSRWIKELNVIPKTIKTLEENLGITIQDIGMGKDFMSTVLKKRISSSEFHIWPN >gi568815589r_40381710|GENSCAN_predicted_CDS_4|1437_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaaacgaatccaggagcacatc aaaaagcttatccaccatgatcaagtgggcttcatcactgggatgcaaggctggttcaat atacgcaaatcaataaatgtattccagcatataaacagagccaaagacaaaaaccacatg attatctcaatagatgcagaaaaagcctttgacagaattcaacaacccttcatgctaaaa actctcaatacattaggtattgatgggacatatttcaaaataataagagctatctatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaact ggcacaagacagggatgccctctctcaccactcctattcaacacagttttggaagttctg gccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtc aaattgtctctgtttgcagacgacatgattgtatatctagaaaaccccattgtctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcagaatataaaatcaatgta caaaaatcacaagcattgttatacaccaacaacacacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggat acaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatg gccatactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatgact ttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccac atcaccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttc aaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagat atagatcaatggaacagaacagagccctcagaaataacgccgcatatctacaactatctg atctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatgg tgctgggaaaactggctagccatatgtagaaagctgaaactggatcacttccttacacct tatacaaaaatcaattcaagatggattaaagagttaaacgttatacctaaaaccataaaa accctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggacttcatg tcaacagtcttaaagaaaagaatttccagctcagaatttcatatctggccaaactaa >gi568815589r_40381710|GENSCAN_predicted_peptide_5|335_aa MAEGLRGNAREAQGMGVFGLGSVAHMLLNKTFGSYLGVNLGFGFGVTMGVHVAGRISGAH MNAAVSFTNCALGRVPWRKFPVYVLGQFLGSFLAAATIYRLFYMAILHFSGGELMVTGPV ATAGIFATYLPDHMTLWRGFLNEEWLTGMLQLCLFAITDREKNPALPGTHALVIGILVVI IRVSHGMNTGYAINPSRDLPPPHLHLHCWLGQTGLQSWPEVDEGAVLGISPLSWPLPLAC SNGENLWWVPVVAPLLGASLGGIIYLVFIGSTIPREPLKLEDSVAYEDHGITVLPKMGSH EPMISPLTLISMSPANRSSVHPAPPLHESMALEHF >gi568815589r_40381710|GENSCAN_predicted_CDS_5|1008_bp atggcagagggtcttcgaggcaacgccagggaggcccagggcatgggggtattcggcctt ggttccgtggcccatatgcttctaaataaaacatttgggagctaccttggtgtcaacttg ggttttggcttcggagtcaccatgggagtgcacgtggcaggccgcatctctggagcccac atgaacgcagctgtgagcttcactaactgtgcactgggccgtgtgccctggaggaagttt ccagtctatgtgctggggcagttcctgggctccttcctggcggctgccaccatctacaga ctcttctacatggccattctccacttttcgggtggagagctgatggtgaccggtcccgtt gctacagctggcatttttgccacctaccttcctgatcacatgacattgtggcggggcttc ctgaatgaggagtggctgaccgggatgctccagctgtgtctcttcgccatcacggaccgg gagaaaaacccagcactgccaggaacacacgcgctggtgataggcatcctcgtggtcatc atcagggtgtcccatggcatgaacacaggatatgccatcaatccatcccgggacctgccc cccccgcatcttcaccttcattgctggttggggcaaacaggtcttcagtcttggcccgag gtggatgagggtgctgtcctgggcatcagccccctcagctggcctctgcctcttgcctgc agcaatggggagaacttgtggtgggtgccagtggtggcaccacttctgggtgcctctcta ggtggcatcatctacctggtcttcattggctccaccatcccacgggagccactgaaattg gaggactctgtggcatatgaagaccatgggataactgtattgcccaagatgggatctcat gaacccatgatctctccccttaccctcatctctatgagccctgccaacagatcttcagtc caccctgccccacccttacatgaatccatggccctagagcacttctaa >gi568815589r_40381710|GENSCAN_predicted_peptide_6|526_aa MSSSGNTGLHSTLHGNSQLEWQQLYSLAQIGTLQPAKVPTALSGRSLGPRRPGWLQPLLS LISECGLAHPTLPHSQEASSKGNTPGSCNTDQSTSLVLAAINGAKAPSSEARNGQCDPWG QNGEGVMEGHGRAPLTSCCPPTHHHDICAHELSQELSLHLPLIFLAPYFLDLCYHGPGDH FGATGGPDTVARAHGHPWEEKQSIEQLLCSVMSIAPAERVGTVGHVEEVNGLEEGVGGSQ RKGNLGQEKKAGGGEVSKAGIWLVPLEPSPDALPKITSLIWPAVPWRPSSEAGLCEVRGG VLGKASKAPVKEPQLDRGMGLGAQRRGSSGTEVQSGEALGASGSPRGLLEPRPDWVSNNG AGSLGFQQLPIVDKIRTIAQAVYGAKDIELCPEAQVKIDRYTQQGFGNLPICMAKTDLSL SHQPDKKGVPRDFILPISDVRASIGAGFIYPLVGTQFTQQIHSLGQCCIRHSARGYRKPQ PKEAFSASGAPLTSLLGWTLESRKLLEHVWGISDVTRYECRVGLLD >gi568815589r_40381710|GENSCAN_predicted_CDS_6|1581_bp atgagcagctcaggcaacactggcctgcattccaccctccatggcaacagtcagctggag tggcagcagctatacagccttgcacagattggcactctgcagccagccaaagtccccacc gccctgagtggccgctctcttggcccccgtaggcccggatggctccagcccctcctctcc ctgatcagtgaatgtggcctagcccatccaactctcccccacagccaggaggccagctcc aagggcaacacacctggaagctgcaacacagatcagtccacttcccttgtcctggctgca ataaatggggccaaggctcccagctcagaggcaaggaatggacaatgtgacccatggggt cagaatggggagggggtcatggaaggccatggcagagccccactcacttcgtgctgccca cccactcaccatcatgacatatgtgctcatgaactcagccaggaactctcgctccatctt cctctcatcttcctcgcaccatatttcctggatctttgctatcacggaccaggagaccat tttggagccacgggtggacctgatacagtggcccgagcccatggacacccgtgggaagaa aagcaatcaattgagcaattactatgcagtgtgatgagcattgccccagctgaaagagtg ggcaccgtggggcatgtggaggaagtgaatggcctagaggagggggtaggaggctcccag agaaagggaaatttaggtcaagaaaagaaggctggagggggagaggtcagtaaggcaggg atttggctggtgccacttgaaccaagtccagatgcactgcccaaaataacatccctcatc tggccagcagtgccgtggagaccgagttctgaagcaggcctttgtgaggtcagaggtgga gttctgggaaaggccagcaaggcccccgtaaaggagccacagctggatcggggcatgggg ctcggggcccagcggcgtgggagctccgggaccgaggtgcagtcaggagaggccctgggg gccagcggcagtccaagaggacttttggagccaaggcctgactgggtctcaaacaatggg gcaggcagtttggggtttcagcagcttccaattgtggacaagataaggaccattgcccag gctgtctatggagccaaagatatcgaactctgtcctgaggcacaagtcaaaatagatcgt tacactcagcagggttttggaaatttgcccatctgcatggcaaagaccgatctttctctg tctcaccaacctgacaaaaaaggtgtgccaagggacttcatcttacctatcagtgatgtc cgggccagcataggtgctgggttcatttaccctttggtcggaacgcagttcacccaacaa attcattcactcggccaatgctgcataaggcactcagctcggggctacaggaagccccaa cctaaagaagctttctctgcttcaggtgcaccactgacatcattgctcggctggactctg gaaagcaggaaattgctggagcatgtttggggcatcagtgatgtcacccgctacgaatgc cgggtggggttactggactga >gi568815589r_40381710|GENSCAN_predicted_peptide_7|408_aa MNIDAKILNKILAKRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRTKDKNHM IISMDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKT GTRQGCPLSPLLFNIVMEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSA QNLLKLISNFSKVSGYKINVQKSKAFLYTTNKQTESQIMSELPFTIASKRIKYLGIQLTR DVKDLFKENFKPLLKEIKEDTNKWKNIPCSWVGRINLVKKAILPKVIYTFNAIPIKLPMP FFTELEKTTLKFIWNQKRARITKSILSQKNNAGGITLPDFKLYYKATVTKTAWYWYQNRV IDQWNRTEPSEITPHIYNYLIFDKPEKNKQWEKDPLFNKWCWENWLAI >gi568815589r_40381710|GENSCAN_predicted_CDS_7|1227_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaaacgaatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaat atacgcaaatcaataaatgtaatccagcatataaacagaaccaaagacaaaaaccacatg attatctcaatggatgcagaaaaggcctttgacaaaattcaacaacccttcatgctaaaa actctcaataaattaggtattgatgggacttatttcaaaataataagagctatctatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaact ggcacaagacagggatgccctctctcaccactcttattcaacatagtgatggaagttctg gccagggcaatcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtc aaactgtccctgtttgcagatgacatgattgtatatctagaaaaccccattgtctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgta caaaaatcaaaagcattcttatacaccaccaacaaacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaaaagaataaaatacctaggaattcaacttacaagg gatgtgaaggacctcttcaaggagaacttcaaaccactgctcaaggaaataaaagaggat acaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatctcgtgaaaaag gccatactgcccaaggtaatttacacattcaatgccatccccatcaagctaccaatgcct ttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgc atcaccaagtcaatcctaagccaaaagaacaatgctggaggcatcacactacctgacttc aaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagtt atagatcaatggaacagaacagagccctcagaaataacgccacatatctacaactatctg atctttgacaaacctgagaaaaacaagcaatgggaaaaggatcccctatttaataaatgg tgctgggaaaactggctagccatatga >gi568815589r_40381710|GENSCAN_predicted_peptide_8|312_aa ASKRSKSPLEDATKREFQNCSVKRKVQHCEMNAHITKKFLRLLLSGFYVKASKRSKCPLA DSAKSRFYKKSVSKLPYRKKCSTLLYKKNVSKQLYQKKRSTMIYKKSVSKLLYEVATKLS KCPLGDSTKRVFPTCSIKRKVQICELNAHITKKFLRMPLSSTYVKIFPFPTKASKLSKYP LADFTKSVSKGSMKWSQSASNVHLQIRQKECFKPALSKERFNSDSKGSQCPLADSTKRVF KPAQSKEMFNPASKLSKCPLADSTKRVFQNCSIENVTERVFENYTIEGKVQLSEMNAHIT KNLSECFCLIFM >gi568815589r_40381710|GENSCAN_predicted_CDS_8|939_bp gcctcaaaacgatccaaaagtccacttgaagatgctacaaaaagagagtttcaaaactgc tcagtcaaaagaaaggttcaacactgcgagatgaatgcacacatcacaaagaagtttctc agattgcttctgtctggattttatgtgaaggcctcaaagcgttccaaatgtccacttgca gattctgcaaaaagcagattctacaaaaagagtgtttcaaaactgccctatcgaaagaaa tgttcaaccctattatacaaaaagaatgtttcaaaacagctctatcaaaagaaacgttca acaatgatctacaaaaagagtgtttcaaaactgctctatgaagtggccacaaagctctcc aaatgtccacttggagattctacaaaacgagtgtttccaacctgctctatcaaaagaaag gttcagatctgtgagctgaatgcacacatcacaaagaagtttctgagaatgcctctgtct agtacttatgtgaagatattcccgtttccaacaaaggcctcaaaactgtccaaatatcca cttgcagattttacaaagagtgtttcaaaaggctctatgaaatggtctcaaagcgcttca aatgtccatttgcagattcggcaaaaagagtgtttcaaacctgccctatcaaaagaaaga ttcaactctgactcaaagggctcccaatgtccacttgcagattctacaaaaagagttttt aaacctgctcaatcaaaagaaatgttcaacccggcctcaaagctctccaaatgtccactt gcagattctacaaaaagagtgtttcaaaactgctcaatcgaaaatgttacagaaagagtg tttgaaaactacacaatcgaaggaaaggttcaactctctgagatgaatgcacacatcaca aagaatttgtcagaatgcttctgtctaatatttatgtga