GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:50:27 Sequence gi568815582f:29563729_29764928 : 201200 bp : 48.61% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 1778 1623 156 0 0 87 110 128 0.979 14.88 1.02 Intr - 2521 2358 164 0 2 88 38 74 0.666 2.02 1.01 Init - 10051 10044 8 1 2 106 68 23 0.411 1.15 1.00 Prom - 18043 18004 40 -5.66 2.00 Prom + 21781 21820 40 -7.36 2.01 Init + 31257 31412 156 2 0 94 78 310 0.865 30.11 2.02 Term + 36973 37203 231 0 0 58 44 148 0.489 3.87 2.03 PlyA + 39909 39914 6 1.05 3.00 Prom + 40850 40889 40 -5.36 3.01 Init + 42044 42143 100 1 1 72 46 152 0.943 7.82 3.02 Term + 43468 43697 230 2 2 39 46 206 0.533 8.29 3.03 PlyA + 43943 43948 6 1.05 4.02 PlyA - 45227 45222 6 1.05 4.01 Sngl - 49912 49247 666 1 0 82 44 1286 0.928 119.78 4.00 Prom - 54528 54489 40 -5.56 5.10 PlyA - 54964 54959 6 -1.75 5.09 Term - 55200 55057 144 0 0 85 44 158 0.993 9.01 5.08 Intr - 59145 58990 156 0 0 108 63 195 0.418 19.21 5.07 Intr - 65201 65032 170 0 2 73 -8 178 0.641 6.37 5.06 Intr - 65700 65606 95 2 2 71 52 44 0.373 -1.29 5.05 Intr - 68420 68098 323 2 2 35 46 144 0.303 -0.44 5.04 Intr - 69677 69615 63 2 0 97 121 79 0.762 11.01 5.03 Intr - 72600 72482 119 1 2 102 68 96 0.992 9.18 5.02 Intr - 72802 72663 140 0 2 59 92 102 0.814 7.81 5.01 Init - 78514 78507 8 1 2 114 91 0 0.860 3.40 5.00 Prom - 87096 87057 40 -6.76 6.00 Prom + 96876 96915 40 -5.56 6.01 Sngl + 100001 101203 1203 1 0 73 42 968 0.996 84.55 6.02 PlyA + 101765 101770 6 1.05 7.00 Prom + 108459 108498 40 -7.16 7.01 Init + 115470 115482 13 2 1 109 98 9 0.147 3.97 7.02 Intr + 130936 131471 536 2 2 129 48 743 0.926 67.04 7.03 Intr + 133217 133399 183 1 0 74 94 203 0.646 19.48 7.04 Term + 133471 133683 213 0 0 111 37 310 0.936 25.43 7.05 PlyA + 133893 133898 6 1.05 8.08 PlyA - 138908 138903 6 1.05 8.07 Term - 144496 144378 119 2 2 84 48 86 0.812 2.90 8.06 Intr - 149027 148949 79 2 1 51 99 36 0.045 0.12 8.05 Intr - 180210 180068 143 1 2 68 46 87 0.157 2.57 8.04 Intr - 180819 180726 94 0 1 69 93 12 0.186 -0.66 8.03 Intr - 181223 180885 339 2 0 109 80 234 0.083 20.37 8.02 Intr - 185506 185303 204 1 0 4 71 165 0.272 5.80 8.01 Init - 185549 185529 21 2 0 60 113 56 0.294 3.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 197383 197483 101 2 2 83 54 95 0.915 3.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:29563729_29764928|GENSCAN_predicted_peptide_1|110_aa MRRTDSASADPGNLKYSSSRDRGGSSSYGLQPSNSAVVSRQRHDDTRVHADIQNDEKGGY SVNGGSGENTYGRKSLGQELRVNNVTSPEFTSVQHGSRALATKDMRKSQX >gi568815582f:29563729_29764928|GENSCAN_predicted_CDS_1|330_bp atgcgcagaactgatagtgcatcagctgacccaggtaatttaaaatattcttcatccaga gatagaggtggttcttcctcttacggactgcaaccttcaaattcagctgtggtgtctcgg caaaggcacgatgataccagagtccacgctgacatacagaatgacgaaaagggtggctac agtgtcaatggaggatctggggaaaatacttatggtcggaagtcgttggggcaagagctg agggttaacaatgtgaccagccctgagttcaccagtgttcagcatggcagtcgtgcttta gccaccaaagacatgaggaaatcacaggnn >gi568815582f:29563729_29764928|GENSCAN_predicted_peptide_2|128_aa MAKRRRPEKRESPPEPAAADTPLRPGAEEEAEEEEEKEEEEEEEEEAAAEGGTAPTISTD RQPPPSPPTDSPRHLHRQTAPTISTGRQPPPSPLTDSPHLLHRQTAPTIFTDRQPPPSPP IDSAHLLH >gi568815582f:29563729_29764928|GENSCAN_predicted_CDS_2|387_bp atggccaagcgccgccgcccagagaagcgcgagtcgccgcccgaaccggccgctgccgac accccgctccggcccggggctgaggaggaagccgaggaggaggaagagaaggaggaggag gaggaggaggaggaggaggcggcggcggagggcgggacagcccccaccatctccaccgac agacagcccccaccttctccaccgacagacagcccccgccatcttcaccgacagacagcc cccaccatctccaccggcagacagcccccaccatctccactgacagacagcccccacctt ctccaccgacagacagcccccaccatcttcaccgacagacagcccccaccatctccacca atagacagcgcccaccttctccactga >gi568815582f:29563729_29764928|GENSCAN_predicted_peptide_3|109_aa MCPTLDTPLQGVALTLAWALGGLRSPLFCVKAAAGSHASPNCDRETGGGYFCFQGMEELS AGHFPNTYALGELTETEPGYLLHDAVSPSGPSSESTSLEVEAPEAHSIF >gi568815582f:29563729_29764928|GENSCAN_predicted_CDS_3|330_bp atgtgccctaccttggacacccctctacagggggtggctctcaccctggcctgggccctc gggggtctgcgcagcccactcttctgtgtgaaggcggctgcaggctcccatgcttcaccc aactgcgacagggaaacaggagggggctacttctgctttcaaggcatggaggagctgtct gcaggccacttcccgaacacctatgcgcttggggaactgactgagactgagccggggtat cttctccacgatgcggtgagcccatcaggcccttcttctgagagcacctccctggaggtg gaagcccctgaggcccacagcatcttctga >gi568815582f:29563729_29764928|GENSCAN_predicted_peptide_4|221_aa MAGAGPKRRALAAPVAEEKEEAREKMLASKRADGAAPAGEGEGVTLQRNITLLNGVAIIV GAIIGSGIFVTPTGVLKEAGSPGLALVMWAACGVFSIVGALCYAELGTTISKSGGDYAYM LDVYGSLPAFLKLWIELLVIRPSSQYIVALVFATYLLKPLFPSCPVPEEAAKLMACHCVR EYGARAGGWEVVVPGSLHPCITTPLAPTSLHFCVPGPSRQF >gi568815582f:29563729_29764928|GENSCAN_predicted_CDS_4|666_bp atggcgggtgcgggcccgaagcggcgggcgctagcggccccggtggccgaggagaaggaa gaggcgcgggagaagatgctggcctccaagcgcgcggacggcgcggcgccggcaggcgag ggcgagggcgtgaccctgcagcggaacatcacgctactcaacggcgtggccatcatcgtg ggcgccatcatcggctcgggcatcttcgtgacgcccacgggcgtgcttaaggaggcaggc tcgccggggctggcgctggtgatgtgggccgcgtgcggcgtcttctccatcgtgggcgcg ctctgctacgcggagctgggcaccaccatctccaaatcgggcggcgactacgcctacatg ctggacgtctacggctcgctgcccgccttcctcaagctctggatcgagctgctcgtcatc cggccttcatcgcagtacatcgtggccctggtcttcgccacctacctgctcaagccgctc ttccccagctgcccggtgcccgaggaggcagccaagctcatggcctgccactgcgtgcgt gagtacggggcgcgggccgggggttgggaggtcgtggtccctgggtccctgcatccttgc atcactacgcccctggctcccacatctctacatttttgcgttcctggaccgtcccggcag ttctag >gi568815582f:29563729_29764928|GENSCAN_predicted_peptide_5|405_aa MPRRWAGGGQSEWDRQSPPFTEPTFGLSLQPYSNPLRVNSYNPNVDNASGISGGPLENHY RLKQFHFHWGAVNEGGSEHTVDGHVYPAELGAHHQMLQRLVDVLLEVKHKLSTRFPASSA QLSIRFPASSTARKDGGDKSVAFLQDQLRKGFEGEGEIPKKCSLFIQQQTKEFQGRRSQR KPQMCGRMRLQPVLTMVRAGPDGAGASPSSRFKEPFDLVTKHCSCGILFTTPARSTCPGS GQGDIAPHLGPYQCRPVTMHSAPSTQAQPQCGPGDPGQTLPEWRPRGFAFDPLTIARSFG KNCPQRDARAAMRPFDPSALLPTCWDYWTYVGSLTTPPLTESVTWIIQKEPVEVAPSQLS AFRTLLFSALGEEKKMTVNNYRPLQPLMNWKVWASFQATNEGTRS >gi568815582f:29563729_29764928|GENSCAN_predicted_CDS_5|1218_bp atgcccaggcgctgggctggcggggggcagagtgaatgggacagacagagcccacctttc acggagcccacctttggcttgtcacttcaaccatacagcaaccccttgcgagtgaactca tataaccctaatgtagacaatgcctcaggaattagtggcgggcccttggaaaaccactac agactgaagcaatttcacttccactggggagcagtgaacgaggggggctcagagcacaca gtggacggccacgtgtaccccgcagagctcggggcccatcatcagatgctgcagaggctg gtggacgtcttgctggaagtaaaacataagctctccacacgcttccctgcatccagcgcc cagctctccatacgcttccctgcatccagcacagctcgaaaagatggcggggacaagagt gtcgcctttctccaagatcagctgagaaaaggctttgagggagaaggagaaattccaaag aaatgtagtttatttattcagcagcaaacaaaagaattccagggcagaaggagtcaacgc aagcctcagatgtgcggacggatgaggcttcaacctgtgttaacaatggtgcgggctggt ccagatggagctggggcgagcccttcctcgcggttcaaggagccgtttgatctagtcaca aagcactgctcctgcggcatcctgttcaccactcccgccaggagcacctgcccaggctca gggcaaggggacatcgccccacacctcggcccctaccagtgccgacctgtcactatgcac tcagctcccagcacgcaggcccagccccagtgtggccccggtgaccctgggcagacactc ccagaatggagacctcgaggcttcgcctttgaccctctcaccatagccagaagttttggc aaaaactgcccacagagggacgcgcgggcggccatgcgccccttcgacccctccgctctg ctgcccacctgctgggattactggacctatgtgggctcgctcaccaccccgccgctgacc gagtcggtcacctggatcatccagaaggagcccgttgaagtggccccaagccaactctct gcatttcgtactctcctgttttctgcacttggtgaagagaagaagatgacggtgaacaac tatcgtccacttcaacccctgatgaactggaaggtctgggcgtccttccaggccactaat gagggcacaagatcctag >gi568815582f:29563729_29764928|GENSCAN_predicted_peptide_6|400_aa MATLLLLLGVLVVSPDALGSTTAVQTPTSGEPLVSTSEPLSSKMYTTSITSDPKADSTGD QTSALPPSTSINEGSPLWTSIGASTGSPLPEPTTYQEVSIKMSSVPQETPHATSHPAVPI TANSLGSHTVTGGTITTNSPETSSRTSGAPVTTAASSLETSRGTSGPPLTMATVSLETSK GTSGPPVTMATDSLETSTGTTGPPVTMTTGSLEPSSGASGPQVSSVKLSTMMSPTTSTNA STVPFRNPDENSRGMLPVAVLVALLAVIVLVALLLLWRRRQKRRTGALVLSRGGKRNGVV DAWAGPAQVPEEGAVTVTVGGSGGDKGSGFPDGEGSSRRPTLTTFFGRRKSRQGSLAMEE LKSGSGPSLKGEEEPLVASEDGAVDAPAPDEPEGGDGAAP >gi568815582f:29563729_29764928|GENSCAN_predicted_CDS_6|1203_bp atggccacgcttctccttctccttggggtgctggtggtaagcccagacgctctggggagc acaacagcagtgcagacacccacctccggagagcctttggtctctactagcgagcccctg agctcaaagatgtacaccacttcaataacaagtgaccctaaggccgacagcactggggac cagacctcagccctacctccctcaacttccatcaatgagggatcccctctttggacttcc attggtgccagcactggttcccctttacctgagccaacaacctaccaggaagtttccatc aagatgtcatcagtgccccaggaaacccctcatgcaaccagtcatcctgctgttcccata acagcaaactctctaggatcccacaccgtgacaggtggaaccataacaacgaactctcca gaaacctccagtaggaccagtggagcccctgttaccacggcagctagctctctggagacc tccagaggcacctctggaccccctcttaccatggcaactgtctctctggagacttccaaa ggcacctctggaccccctgttaccatggcaactgactctctggagacctccactgggacc actggaccccctgttaccatgacaactggctctctggagccctccagcggggccagtgga ccccaggtctctagcgtaaaactatctacaatgatgtctccaacgacctccaccaacgca agcactgtgcccttccggaacccagatgagaactcacgaggcatgctgccagtggctgtg cttgtggccctgctggcggtcatagtcctcgtggctctgctcctgctgtggcgccggcgg cagaagcggcggactggggccctcgtgctgagcagaggcggcaagcgtaacggggtggtg gacgcctgggctgggccagcccaggtccctgaggagggggccgtgacagtgaccgtggga gggtccgggggcgacaagggctctgggttccccgatggggaggggtctagccgtcggccc acgctcaccactttctttggcagacggaagtctcgccagggctccctggcgatggaggag ctgaagtctgggtcaggccccagcctcaaaggggaggaggagccactggtggccagtgag gatggggctgtggacgccccagctcctgatgagcccgaagggggagacggggctgcccct taa >gi568815582f:29563729_29764928|GENSCAN_predicted_peptide_7|314_aa MDAEGLALLLPPVTLAALVDSWLREDCPGLNYAALVSGAGPSQAALWAKSPGVLAGQPFF DAIFTQLNCQVSWFLPEGSKLVPVARVAEVRGPAHCLLLGERVALNTLARCSGIASAAAA AVEAARGAGWTGHVAGTRKTTPGFRLVEKYGLLVGGAASHRYDLGGLVMVKDNHVVAAGG VEKCQVLGPVLTLVLPAAQQAVRAARQAADFTLKVEVECSSLQEAVQAAEAGADLVLLDN FKPEELHPTATVLKAQFPSVAVEASGGITLDNLPQFCGPHIDVISMGMLTQAAPALDFSL KLFAKEVAPVPKIH >gi568815582f:29563729_29764928|GENSCAN_predicted_CDS_7|945_bp atggacgctgaaggcctggcgctgctgctgccgcccgtcaccctggcagccctggtggac agctggctccgagaggactgcccagggctcaactacgcagccttggtcagcggggcaggc ccctcgcaggcggcgctgtgggccaaatcccctggggtactggcagggcagcctttcttc gatgccatatttacccaactcaactgccaagtctcctggttcctccccgagggatcgaag ctggtgccggtggccagagtggccgaggtccggggccctgcccactgcctgctgctgggg gaacgggtggccctcaacacgctggcccgctgcagtggcattgccagtgctgccgccgct gcagtggaggccgccaggggggccggctggactgggcacgtggcaggcacgaggaagacc acgccaggcttccggctggtggagaagtatgggctcctggtgggcggggccgcctcgcac cgctacgacctgggagggctggtgatggtgaaggataaccatgtggtggccgccggtggc gtggagaagtgccaggtgctgggcccagtcctcacccttgtcctcccggctgcccagcag gcggtgcgggcggccagacaggcggctgacttcactctgaaggtggaagtggaatgcagc agcctgcaggaggccgtgcaggcagctgaggctggtgccgaccttgtcctgctggacaac ttcaagccagaggagctgcaccccacggccaccgtgctgaaggcccagttcccgagtgtg gctgtggaagccagtgggggcatcaccctggacaacctcccccagttctgcgggccgcac atagacgtcatctccatggggatgctgacccaggcggccccagcccttgatttctccctc aagctgtttgccaaagaggtggctccagtgcccaaaatccactag >gi568815582f:29563729_29764928|GENSCAN_predicted_peptide_8|332_aa MKLRTLTAACPEFVPSDVRTCSEFLPSGGFVVSLASGVKLQTFTMLQLIKAVQTQDKQQQ DLLERSKEQSFLCGRMPLTPEPPSGRVEGPPAWEAAPWPSLPCGPCIPIMLVLATLAALF ILTTAVLAERLFRRALRPDPSHRAPTLVWRPGGELWIEPMGTARERSEDWYGSAVPLLTD RAPEPPTQQLGPPDRTGGPSPEHLLGAPALGGEAPRHRPVFTEATNTKENGSCGAEQPRC GDDSWQEMTGIQFPRATNPVLLGHSPTKFTENINNDPYYPFYRFAQGEAKSPSGIHPREI ENVCPHQNQYMNFHSSIIHNSQNVETTHMSIN >gi568815582f:29563729_29764928|GENSCAN_predicted_CDS_8|999_bp atgaagctgcggaccctcacggcggcatgtccggagtttgttccttctgatgttcggacg tgttccgagtttcttccttctggtgggttcgtggtctcgctggcctctggcgtgaagctg cagaccttcacgatgttacagctcataaaggcagtgcagacccaggataagcagcagcaa gatttattggaaagatcaaaagaacaaagcttcctatgtggaaggatgccgttgactcca gagccgccctctgggcgcgtggaggggccccccgcatgggaagcagccccatggccctca ctgccctgtgggccctgcatccccatcatgctggtcctggccaccctggctgcgctcttc atcctcaccaccgctgtgttggctgaacgcctgttccgccgtgctctccgcccagacccc agccaccgtgcacccaccctggtgtggcgcccaggaggagagctgtggattgagcccatg ggcaccgcccgagagcgctctgaggactggtatggctctgcggtccccctgctgacagat cgggcccctgagcctcccacccagcaacttgggcccccagaccgtactggaggtcccagc ccggagcaccttctgggggccccagccctgggaggggaggccccccgccacaggcctgtg tttaccgaagccaccaataccaaagagaacgggtcctgcggtgctgaacagcctcggtgt ggcgatgacagctggcaggagatgacaggaatccagtttcccagagccacaaatcctgtt ctccttggccactcacccactaaatttacagagaatataaacaatgatccttattatccc ttttaccggtttgcacagggagaagccaaaagcccatctggtatacacccaagagaaatt gaaaacgtgtgcccacaccaaaaccagtacatgaattttcatagcagcataattcataat agccaaaacgtagaaacaacccacatgtccatcaactga