GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:23:08 Sequence gi568815592r:73273111_73473263 : 200153 bp : 44.22% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 11393 11432 40 -1.26 1.01 Init + 18852 19451 600 2 0 89 65 330 0.543 26.09 1.02 Term + 19734 20042 309 2 0 27 38 188 0.836 2.76 1.03 PlyA + 20217 20222 6 1.05 2.04 PlyA - 21409 21404 6 1.05 2.03 Term - 26333 26038 296 0 2 61 34 173 0.884 4.67 2.02 Intr - 36618 36442 177 1 0 78 105 104 0.914 10.99 2.01 Init - 38343 38181 163 0 1 70 72 71 0.799 3.60 2.00 Prom - 39289 39250 40 -4.96 3.00 Prom + 40990 41029 40 -2.46 3.01 Init + 49736 49843 108 1 0 52 94 168 0.970 14.02 3.02 Term + 50007 50168 162 2 0 13 42 177 0.816 3.54 3.03 PlyA + 51015 51020 6 1.05 4.06 PlyA - 54790 54785 6 1.05 4.05 Term - 68399 68301 99 2 0 68 47 53 0.003 -2.67 4.04 Intr - 80639 80536 104 0 2 94 64 39 0.403 1.99 4.03 Intr - 80922 80743 180 1 0 101 78 331 0.915 33.24 4.02 Intr - 81160 81004 157 1 1 3 109 181 0.551 11.28 4.01 Init - 81263 81204 60 2 0 78 80 55 0.571 2.95 4.00 Prom - 84666 84627 40 -4.76 5.00 Prom + 85544 85583 40 -8.26 5.01 Init + 89285 89349 65 1 2 51 61 54 0.804 -0.37 5.02 Intr + 89607 89788 182 0 2 116 109 139 0.940 18.31 5.03 Intr + 89985 90164 180 1 0 82 102 154 0.462 16.04 5.04 Term + 90446 90750 305 0 2 92 49 260 0.962 17.83 5.05 PlyA + 90896 90901 6 1.05 6.05 PlyA - 91801 91796 6 1.05 6.04 Term - 95753 95674 80 0 2 86 47 51 0.837 -1.27 6.03 Intr - 96275 96096 180 0 0 60 78 143 0.843 10.34 6.02 Intr - 96583 96493 91 1 1 53 85 103 0.532 6.07 6.01 Init - 96849 96610 240 0 0 88 35 245 0.433 15.07 6.00 Prom - 107482 107443 40 -2.46 7.00 Prom + 108570 108609 40 -2.16 7.01 Init + 121796 122038 243 1 0 73 31 99 0.034 0.16 7.02 Intr + 127124 127253 130 1 1 98 111 72 0.691 10.77 7.03 Intr + 131580 131661 82 1 1 66 90 49 0.761 1.60 7.04 Intr + 134850 134991 142 0 1 95 17 83 0.744 2.26 7.05 Intr + 136138 136238 101 0 2 107 98 68 0.563 8.61 7.06 Term + 149426 149528 103 2 1 42 50 66 0.090 -4.05 7.07 PlyA + 149532 149537 6 1.05 8.05 PlyA - 150483 150478 6 1.05 8.04 Term - 166082 165985 98 0 2 78 48 71 0.882 0.23 8.03 Intr - 167335 167099 237 2 0 70 63 70 0.493 0.29 8.02 Intr - 172637 172418 220 2 1 76 116 135 0.989 12.97 8.01 Init - 179071 178415 657 1 0 91 105 450 0.592 41.87 8.00 Prom - 180964 180925 40 -9.75 9.00 Prom + 183247 183286 40 -3.36 9.01 Init + 186402 186568 167 2 2 78 86 86 0.697 6.55 9.02 Intr + 188744 188961 218 2 2 28 85 237 0.617 15.55 9.03 Intr + 193099 193298 200 2 2 79 110 44 0.893 4.77 9.04 Intr + 193379 193496 118 1 1 81 79 17 0.256 0.14 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 70101 70171 71 1 2 110 42 54 0.830 1.10 S.002 Intr - 122061 121957 105 1 0 54 36 149 0.802 6.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:73273111_73473263|GENSCAN_predicted_peptide_1|302_aa MAGYDLTTRITHLLDRHLVFPLLEFLSVKEIYKEKELLQGKLDLLSDANMVDFAMDAYKN LYSDDIPHALKENRTTVVAQLKQLQAETELIVKMFEDPETTRQMRSTRDGRMLFDYLADK HGFRQEYLDTFYTYAKFQYECGNYSGAAEYLYFFRVLVPATDRNAVSSLWGKLASEILKQ NWDADMEDLTWLKETIDNNSINFDLDGAKKKLRECESVLVNDFLVACLKDFIDNAHLFIF ETFCHIHQCISINMLADKLNMTPDETERWIVNVIRNARLDAKIDSKLGHVVMVTMYSHPI SK >gi568815592r:73273111_73473263|GENSCAN_predicted_CDS_1|909_bp atggcggggtacgacttaactactcgcatcacgcaccttttggatcggcatctagtcttt ccgctccttgagtttctctctgtaaaggagatatataaagaaaaggaattattacaaggt aaattggaccttcttagtgatgccaacatggtagactttgctatggatgcatacaaaaac ctttattctgatgatattcctcatgctttgaaagagaatagaaccacagttgttgcacaa ctgaaacagcttcaggcagaaacagaactaattgtgaaaatgtttgaagatccagaaacg acaaggcaaatgcggtcaaccagggatggtaggatgctctttgactacctggcggacaag catggttttaggcaggagtatttagatacattctacacatatgcaaaattccagtatgaa tgtgggaattactcaggagcagcagaatatctttattttttcagagtgttggttccagca acagacagaaatgctgtaagttcactctggggaaagctggcctctgaaatcttaaagcag aattgggatgcagacatggaagaccttacatggttaaaagagaccatagataataattct attaactttgacttagatggggctaagaaaaagctgagagaatgtgaatcagtgcttgtg aacgacttcttggtggcttgtcttaaggatttcattgacaatgcccatctcttcatattt gagactttctgtcacatccaccagtgtatcagcattaacatgttggcagacaaattgaac atgactccagatgaaactgaaaggtggattgtaaatgtgattagaaatgcaagactggat gccaagattgattctaaattaggtcatgtggttatggtaacaatgtactctcaccctatc agcaagtga >gi568815592r:73273111_73473263|GENSCAN_predicted_peptide_2|211_aa MDKFLDTYTLPRLNQEKTESLNRPITGSEIEPIINSLPTKKVQDQKDSQPNSTRGQKREM LSAFQRLFRVLFVIETVSEYGVLIFIYGWPFLQTLAMLLIGTVSFHLWIRRNRAPQSAAP LEENVNRAVSNKAAHGTVRLTWLLVVGGVSPFSRGGCCAHPRGCAPENGQTPPGILLLKE KHQLSPERALDPRRPSPVLVQLRAVGGIWWL >gi568815592r:73273111_73473263|GENSCAN_predicted_CDS_2|636_bp atggataaattcctggacacatacaccctcccaagactaaaccaggaaaaaactgaatcc ctgaatagaccaataacaggctctgaaattgagccaataattaatagcctaccaaccaaa aaagtccaggaccagaaggattcacagccgaattctaccagaggccagaaacgcgaaatg ctgtcggccttccagaggctgttccgagtcttatttgtgattgaaacagtctcagaatac ggagtcctgatcttcatttatggatggccctttctccagacccttgcgatgctcctgatc gggacagtgtctttccacctttggatccgcagaaatcgagctccgcagtcagcggctccg ttggaagagaatgtgaaccgcgctgtctccaacaaagctgctcacggaacagtgcgttta acttggctgctggtcgtcggaggcgtcagtcccttcagcaggggcggctgctgtgcccac ccccggggctgtgcgccagagaacggtcagactccgccagggattcttctcctgaaagaa aagcaccaactatctcccgagcgagctctcgacccacgaaggccctcccccgtgctggtc cagctgcgtgctgtcggtggcatctggtggctctag >gi568815592r:73273111_73473263|GENSCAN_predicted_peptide_3|89_aa MWYEIKAQVHNIHLCKDKHGKTGLQLQTTNKGLFVQDRIVQWIVTMHKDSTSHGGFIIKK GKVFPVVKGSSVVLKDSSPTTMCARFKNV >gi568815592r:73273111_73473263|GENSCAN_predicted_CDS_3|270_bp atgtggtatgagatcaaggcccaggtacacaacatccacctgtgcaaagacaaacatggc aagactgggctgcagctgcagaccaccaacaaggggctctttgtgcaggacaggatagtc cagtggattgtcaccatgcacaaggacagcacaagccatggtggcttcatcatcaagaag ggaaaggtcttccctgtggtcaaagggagctctgtggtcctcaaggactcttcaccaacc accatgtgtgccaggttcaagaacgtttaa >gi568815592r:73273111_73473263|GENSCAN_predicted_peptide_4|199_aa MDSGSWPGPGAGPGRGRGHQAWFALESGLGCEGHKMGTLPARRHIPPWVKVPEDLKDPEV FQVQTRLLKAIFGPDGSRIPYIEQVSKAMLELKALESSDLTEVVVYGSYLYKLRTKWMLQ SMAEWHRQRQERAPWASPGVPLASGPSQDPFGNGGPRMPAVGLGDPGTQHHYCYAHFISE KTEVEEVKGPNTVILKVAP >gi568815592r:73273111_73473263|GENSCAN_predicted_CDS_4|600_bp atggactcagggtcctggcccgggccgggggcggggcctggccgggggcggggccaccag gcctggtttgcgctggagagtggtcttggttgtgagggtcataagatgggaactctcccg gcacgtagacatatcccgccgtgggtgaaagttcccgaagacctgaaagatccagaggtg ttccaggtccagacgcggctgctgaaagccattttcggcccggacggatctcgaatccct tacatcgagcaggtgagcaaggccatgctcgagctgaaggctctggagtcttcagacctc accgaggtcgtggtttacggctcctatttgtacaagctccggaccaagtggatgctccag tccatggctgagtggcaccgccagcgccaggagcgagcaccttgggcgtcaccaggggtg cccttggcatctgggccttcccaagacccctttgggaatgggggccccagaatgccggcc gtcggtcttggagacccagggacacaacatcactattgttatgcccattttatcagtgag aaaactgaagttgaggaagttaagggtcctaataccgtcattctcaaagtggccccttga >gi568815592r:73273111_73473263|GENSCAN_predicted_peptide_5|243_aa MVKTPAGHQRIPGAQSPVPPTGTGRSMDAPRRFPTLVQLMQPKAMPVEVLGHLPKRFSWF HSEFLKNPKVVRLEVWLVEKIFGRGGERIPHVQGMSQILIHVNRLDPNGEAEILVFGRPS YQEDTIKMIMNLADYHRQLQAKGSGKALAQDVATQKAETQRSSIEVREAGTQRSVEVREA GTQRSVEVQEVGTQGSPVEVQEAGTQQSLQAANKSGTQRSPEAASKAVTQRFREDARDPV TRL >gi568815592r:73273111_73473263|GENSCAN_predicted_CDS_5|732_bp atggttaagactcccgctgggcaccagcggattcctggtgcccagagcccggttcctcct accgggaccggccgcagcatggacgctcccaggcggtttccgacgctcgtgcaactgatg cagccaaaagcaatgccagtggaggtgctcggtcacctccctaagcggttctcctggttc cactctgagttcctgaagaatccgaaggtagttcgccttgaggtttggctggtggaaaag atcttcggccggggcggagaacgcatcccgcacgtccagggtatgtcccaaatcttgatt cacgtgaatcgattggaccctaacggcgaggctgagatcttggtatttgggaggccttct taccaggaggacacaatcaagatgatcatgaacctggctgactatcaccgccagctccag gcgaaaggctcaggaaaggccctcgcccaggatgtcgccactcagaaggccgagacccag cggtcttcaatagaagtccgggaggccgggacgcagcgttcggtggaggtccgggaggcc gggacccagcgttcggtggaagtccaggaggtcgggacacagggttctccggtggaggtg caggaggccgggacccagcagtctctccaggctgccaacaagtcggggacccagcgatcc cccgaagctgccagcaaggcagtgacccagcggtttcgcgaggatgcccgggacccagtt actagattatga >gi568815592r:73273111_73473263|GENSCAN_predicted_peptide_6|196_aa MPGRRLHPLTPTRCEAPPEGAERVEYGKREPELRAWVWKRSCESRLERAPRLLVPVWSMM LVPLSPSGANRLRPTPWSSCIRIRPWWFPVQELRDPLVFYLEAWLADELFGPDRAIIPEM EWTSQALLTVDIVDSGNLVEITVFGRPRVQNRVKSMLLCLAWFHREHRARAEKMKHLEKN LKAHASDPHSPQDPVA >gi568815592r:73273111_73473263|GENSCAN_predicted_CDS_6|591_bp atgccggggcgtcgcttgcacccgctcaccccgacgcggtgcgaggcgccgccggaaggg gcggagcgggtcgaatatggtaaaagagagcccgagcttcgcgcctgggtctggaagagg tcttgcgaaagccgcctcgagcgcgctccgcggctgctggtcccagtatggtcgatgatg ctggtgccgctgagtcccagcggggcaaacagactccggcccactccctggagcagctgc attcgcatccggccctggtggtttccggtgcaggaactgagagaccctttggtgttctac ctagaggcatggctggcagacgagctctttggcccagaccgagccataattccagaaatg gagtggacgagccaggccctgctgacagtggacatagtggactcagggaacctagtcgaa atcaccgttttcgggcggccccgtgtacagaatcgggtgaagagcatgctcctgtgcctg gcatggtttcaccgagaacatcgtgcccgagctgagaagatgaaacaccttgagaagaac ttgaaggcccatgcatcagacccccactctccccaggatcctgttgcttaa >gi568815592r:73273111_73473263|GENSCAN_predicted_peptide_7|266_aa MSHHGGAPKASTWVVASRRSSTVSRAPERRPAEELNRTGPEGYSVGRGGRWRGTSRPPEA VAAGHEELPLCFALKSHFVGAIIQEQPESLVKIFGSKAMQTKAKAVIDNFVKKLEENYNS ECGIDLPPIKKNFYKESTATSAMSKVEADSWSVCVYGGGNRDEQIEELKKGVDIIIATPG RLNDLQMSNFVNLKNITYLVLDEADKMLDMGFEPQIMKILLDVRPDRQTVMTSQASDEAT ALADVLIIALRDSEQSHTQNFGPLKM >gi568815592r:73273111_73473263|GENSCAN_predicted_CDS_7|801_bp atgtcccaccacggaggagctcccaaggcctctacgtgggtcgttgctagtcggcgaagc tcgacagtgtcccgagcgccagagaggaggccggcggaggagttgaatcgaacaggtcct gagggatatagtgtcggcagaggtggtcgctggagaggcacctctaggcccccggaggcc gtggccgctggtcacgaggaactgccgctgtgttttgctttgaagagccactttgttggc gcgataatacaagaacaaccagaatcattagtcaaaatttttggcagcaaggcaatgcaa acgaaagcaaaagcagtgatagacaattttgttaaaaagctagaagaaaattacaattca gaatgcggaattgatttaccaccaattaagaaaaacttttataaagagtccactgccaca agtgccatgtcaaaagtagaagcagatagttggagtgtttgtgtatatggtggtggaaat agagatgaacaaatagaagagcttaaaaaaggtgtagatatcataattgcaactcccgga agattgaatgatctgcaaatgagtaacttcgtcaatctgaagaatataacctacttggtt ttagatgaagcagacaagatgttggacatgggatttgaaccccagataatgaagattttg ttagatgtgcgcccagataggcagacagttatgaccagtcaagcctcagatgaggctaca gcgctggctgacgtcttaattatagccttgagagactctgagcagagccacacccagaat tttggcccactgaaaatgtga >gi568815592r:73273111_73473263|GENSCAN_predicted_peptide_8|403_aa MQPWHGKAMQRASEAGATAPKASARNARGAPMDPTESPAAPEAALPKAGKFGPARKSGSR QKKSAPDTQERPPVRATGARAKKAPQRAQDTQPSDATSAPGAEGLEPPAAREPALSRAGS CRQRGARCSTKPRPPPGPWDVPSPGLPVSAPILVRRDAAPGASKLRAVLEKLKLSRDDIS TAAGMVKGVVDHLLLRLKCDSAFRGVGLLNTGSYYEHVKISAPNEFDVMFKLEVPRIQLE EYSNTRAYYFVKFKRNPKENPLSQFLEGEILSASKMLSKFRKIIKEEINDIKDTDVIMKR KRGGSPAVTLLISEKISVDITLALESKSSWPASTQEGLRIQNWLSAKVRKQLRLKPFYLV PKHAKEGNGFQDLYRTFGTYYYWIPGKLKETIKQMNQVINEYC >gi568815592r:73273111_73473263|GENSCAN_predicted_CDS_8|1212_bp atgcagccttggcacggaaaggccatgcagagagcttccgaggccggagccactgccccc aaggcttccgcacggaatgccaggggcgccccgatggatcccaccgagtctccggctgcc cccgaggccgccctgcctaaggcgggaaagttcggccccgccaggaagtcgggatcccgg cagaaaaagagcgccccggacacccaggagaggccgcccgtccgcgcaactggggcccgc gccaaaaaggcccctcagcgcgcccaggacacgcagccgtctgacgccaccagcgcccct ggggcagaggggctggagcctcctgcggctcgggagccggctctttccagggctggttct tgccgccagaggggcgcgcgctgctccacgaagccaagacctccgcccgggccctgggac gtgcccagccccggcctgccggtctcggcccccattctcgtacggagggatgcggcgcct ggggcctcgaagctccgggcggttttggagaagttgaagctcagccgcgatgatatctcc acggcggcggggatggtgaaaggggttgtggaccacctgctgctcagactgaagtgcgac tccgcgttcagaggcgtcgggctgctgaacaccgggagctactatgagcacgtgaagatt tctgcacctaatgaatttgatgtcatgtttaaactggaagtccccagaattcaactagaa gaatattccaacactcgtgcatattactttgtgaaatttaaaagaaatccgaaagaaaat cctctgagtcagtttttagaaggtgaaatattatcagcttctaagatgctgtcaaagttt aggaaaatcattaaggaagaaattaacgacattaaagatacagatgtcatcatgaagagg aaaagaggagggagccctgctgtaacacttcttattagtgaaaaaatatctgtggatata accctggctttggaatcaaaaagtagctggcctgctagcacccaagaaggcctgcgcatt caaaactggctttcagcaaaagttaggaagcaactacgactaaagccattttaccttgta cccaagcatgcaaaggaaggaaatggtttccaagatttatataggacttttggcacctac tactattggattccagggaaattgaaggagaccataaaacaaatgaaccaagttataaat gagtattgctga >gi568815592r:73273111_73473263|GENSCAN_predicted_peptide_9|235_aa MSQQRVMCFPLQRAYERTCITKIPIPEKQMFIALGMECNPCGKPLNGRMRGACPCGMFYF RGCGRWVAVSFTKQQFPLARLSSDSAAPRTPHFDVIVIGGGHAGTEAATAAARCGSRTLL LTHRVDTIGQMSCNPSFGGIGKGHLMREVDALDGLCSRICDQSGVHYKVLNRRKGPAVWG LRAQIDRKLYKQNMQKEILNTPLLTVQEGAVEDLILTEPEPEHTGKCRVSGVVLX >gi568815592r:73273111_73473263|GENSCAN_predicted_CDS_9|705_bp atgagccaacagcgggtgatgtgcttccccctgcagagagcctatgaacggacgtgcatc accaagattcctatcccagaaaagcagatgttcatagctctgggaatggaatgcaaccct tgtggaaagcctctaaacggacgcatgaggggcgcctgtccctgtggcatgttctacttc cgaggctgtggccgttgggtcgcggtttccttcaccaagcagcaatttccgttggcacgg ttgagcagtgacagcgcggcgccccggactccgcacttcgacgtgatagtcattggtgga ggacatgccgggactgaggcagccaccgccgccgctcggtgcggctctcggactctgctc ctcactcaccgcgtggacacgatcggtcagatgtcatgtaatccttcctttggtggcatc ggaaagggacatttaatgagggaagtagatgccttggatggcctgtgttctcgcatctgt gaccagtctggtgtacattataaagtattaaaccggcgtaagggaccagctgtgtggggt ctgagagctcagattgataggaaactctataaacagaacatgcagaaagaaatcttgaat acaccactgcttactgttcaggagggagctgtagaagatcttattcttacagaaccagag cctgaacacactgggaaatgccgtgtcagtggggttgttttggnn