GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:20:47 Sequence gi568815575f:23567571_23786332 : 218762 bp : 43.10% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3306 3318 13 2 1 102 127 -2 0.740 5.21 1.02 Term + 5050 5240 191 2 2 76 38 104 0.501 1.81 1.03 PlyA + 5322 5327 6 1.05 2.00 Prom + 8892 8931 40 -3.06 2.01 Sngl + 24664 25191 528 0 0 60 42 195 0.657 8.16 2.02 PlyA + 26087 26092 6 1.05 3.00 Prom + 37847 37886 40 -4.26 3.01 Init + 41840 41987 148 1 1 52 82 108 0.648 6.85 3.02 Intr + 58238 58339 102 0 0 70 95 37 0.227 2.75 3.03 Intr + 67831 67923 93 2 0 102 77 23 0.583 2.54 3.04 Term + 69159 69319 161 1 2 71 54 73 0.447 0.30 3.05 PlyA + 70918 70923 6 1.05 4.02 PlyA - 74770 74765 6 1.05 4.01 Sngl - 79936 78935 1002 1 0 86 37 285 0.770 20.15 4.00 Prom - 81835 81796 40 -4.96 5.03 PlyA - 82004 81999 6 1.05 5.02 Term - 83145 82858 288 0 0 -104 54 344 0.630 7.18 5.01 Init - 84671 84411 261 2 0 63 50 98 0.593 0.57 5.00 Prom - 92145 92106 40 -4.36 6.00 Prom + 94617 94656 40 -5.66 6.01 Init + 96950 97148 199 1 1 105 82 131 0.978 11.59 6.02 Intr + 99939 100241 303 1 0 14 39 346 0.910 18.86 6.03 Intr + 103959 104076 118 1 1 36 78 141 0.883 7.42 6.04 Intr + 114826 114956 131 1 2 113 77 89 0.943 10.64 6.05 Intr + 116101 116135 35 2 2 101 98 16 0.856 1.84 6.06 Term + 118715 118765 51 1 0 102 54 37 0.746 -0.87 6.07 PlyA + 118803 118808 6 1.05 7.11 PlyA - 119086 119081 6 1.05 7.10 Term - 124266 124126 141 0 0 76 54 34 0.303 -3.27 7.09 Intr - 124711 124463 249 1 0 33 119 91 0.042 4.33 7.08 Intr - 129953 129916 38 0 2 79 79 17 0.031 -2.12 7.07 Intr - 137274 137124 151 0 1 60 107 79 0.782 6.74 7.06 Intr - 137510 137423 88 1 1 86 96 26 0.763 3.17 7.05 Intr - 139169 139058 112 0 1 96 78 77 0.918 6.94 7.04 Intr - 155183 155100 84 0 0 87 111 59 0.842 7.89 7.03 Intr - 162994 162957 38 0 2 101 69 23 0.823 -0.39 7.02 Intr - 163416 163246 171 2 0 99 87 68 0.590 6.86 7.01 Init - 168342 168284 59 0 2 95 71 42 0.630 3.98 7.00 Prom - 185656 185617 40 -5.86 8.00 Prom + 190530 190569 40 -5.06 8.01 Init + 197915 198035 121 1 1 53 55 50 0.786 -1.45 8.02 Intr + 198365 198494 130 0 1 65 84 67 0.903 3.75 8.03 Intr + 198910 199072 163 1 1 82 100 83 0.844 8.88 8.04 Intr + 215743 215847 105 0 0 30 69 93 0.042 2.01 8.05 Intr + 216088 216139 52 0 1 102 86 14 0.946 1.08 8.06 Intr + 216230 216313 84 0 0 108 86 60 0.965 7.59 8.07 Intr + 217758 217859 102 1 0 74 80 115 0.966 9.45 8.08 Intr + 217950 217990 41 1 2 121 91 15 0.999 3.04 8.09 Term + 218116 218286 171 0 0 94 55 125 0.991 7.63 8.10 PlyA + 218326 218331 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 215782 215847 66 0 0 81 69 76 0.885 6.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:23567571_23786332|GENSCAN_predicted_peptide_1|67_aa MRPYVLEVLARAIRQEKEIKDIQISKEEVKLSLFADDMIIYLKNPKDSSRKLLELMKRIQ QSLWMQD >gi568815575f:23567571_23786332|GENSCAN_predicted_CDS_1|204_bp atgaggccttacgtactggaagtcctagccagagcaatcagacaagagaaagaaataaag gacatccaaatcagtaaagaagaagtcaaactgtcactgtttgctgacgatatgatcatt tacctcaaaaaccctaaagactcctccagaaagctcctagaactgatgaaaagaattcag caaagtctctggatgcaagattaa >gi568815575f:23567571_23786332|GENSCAN_predicted_peptide_2|175_aa MWKQLWNWVTDRGWNSLEGSEEDRKMWESLELPRDLLNGFDQNADNDMDNEIQAEVVSGG HEKLVENWSKSDSCYVLAKRLVAFCPCPRDLWNFELERDDLGYLVEEISKQQSIQEVTWV LLKAFSFKRETEQKQEQEGDSKMPYSFKPPDLMRDLRQGQHQGDGAKPFMRNPPP >gi568815575f:23567571_23786332|GENSCAN_predicted_CDS_2|528_bp atgtggaagcaactttggaactgggtaacagacagaggttggaacagtttggagggctca gaagaagacaggaaaatgtgggaaagtttggaacttcctagagacttgttgaatggcttt gaccaaaatgctgataatgatatggacaatgaaatccaggctgaagtggtctcaggtgga catgagaaacttgttgagaactggagcaaaagcgactcttgttatgttttagcaaagaga ctggtggcattttgcccctgccctagggatttgtggaactttgaacttgagagagacgat ttagggtatctggtggaagaaatttctaagcagcaaagcattcaagaggtgacttgggtg ctgttaaaagcattcagttttaaaagggaaacagagcaaaagcaggagcaagagggagac agcaagatgccatactcttttaaaccaccagatctcatgagagatctgcggcaaggacag caccaaggggatggtgctaagccattcatgagaaatccacccccatga >gi568815575f:23567571_23786332|GENSCAN_predicted_peptide_3|167_aa MTEDEGEHDWYKCSSFNNLHSKPWSQRDKDINVSLTYEENLGLELLLPEALFWWWMGSTK PPSQKTTEKMKNCLIKTFQGGYVELQVLPEEMSRPPATDFSPLPLPFLLGATHGGMGSAV YPGGCPKDKGSQVTCSSEPETRLPGATAIDSNPAPITSRTAGQLNAP >gi568815575f:23567571_23786332|GENSCAN_predicted_CDS_3|504_bp atgacagaagatgaaggagagcatgactggtacaagtgcagcagcttcaataaccttcac tcaaaaccttggagtcagagggacaaagacattaatgtctcactcacatatgaagagaat ctggggctggaactgctcctgccagaagcactattctggtggtggatgggcagcacaaag cccccatcccaaaagaccacggagaagatgaaaaactgcctcataaaaacttttcaagga ggatacgtagaattgcaagttttacctgaggaaatgtccaggccccctgctacagacttt agccccttgcccttgcctttccttctaggggctacacatggtggaatgggcagtgcagtg tacccaggaggctgccccaaagacaaagggagccaagtcacatgctcctcagagcctgaa actcgcctgcctggagccacagccattgacagcaaccctgctcccatcaccagcaggact gcaggacaattaaatgcaccctga >gi568815575f:23567571_23786332|GENSCAN_predicted_peptide_4|333_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRACISKTILSQKNKAGGITLPD FKLYYKATVTKTAWYWYQNRDIDQLNRTEPSEITPHVYNHLIFDKPDKNKQWGKDSLFNK WSWENWVAIWRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDF MSKTPKAMATKAKIDKWDLIKLKSFCTGKETTIRVNRQPTEWEKMFAIYSSDKGLISRIY IELKQIYKKKSNNPINKWAKDMNKHFSKEDIYAANRHMKKCSSSLAIREMQIKTTKRYHL TPVRMAIIQKVRKQQVLERTWRNRNTFTLLVGL >gi568815575f:23567571_23786332|GENSCAN_predicted_CDS_4|1002_bp atggccatactgcccaaggtaatttatagattcaatgctatccccatcaagctgccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc tgtatttccaagacaatcctaagccaaaagaacaaagctggaggcatcacgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagaccaattgaacagaacagagccctcagaaataacaccacacgtgtacaaccat ctgatctttgacaaacctgacaaaaacaagcaatggggaaaggattccctatttaataaa tggtcctgggaaaactgggtagccatatggagaaagctgaaactggaccccttccttaca ccttatacaaaaattaattcaagatggattaaagacttaaatgttagaccaaaaaccata aaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacaggaaaagaaactaccatcagagtgaacaggcaacctaca gaatgggagaaaatgtttgcaatctactcatctgacaaagggctaatatccagaatctac atagaactcaaacaaatttacaagaaaaaatcaaacaaccccatcaacaagtgggcaaag gatatgaacaaacacttctcaaaagaagacatttatgcggccaacagacacatgaaaaaa tgctcgtcatcactggccatcagagaaatgcaaatcaaaaccacaaagagataccatctc acaccagttagaatggcgatcattcaaaaagtcaggaaacaacaggtgctggagaggacg tggagaaataggaacacttttaccctgttggtgggactgtaa >gi568815575f:23567571_23786332|GENSCAN_predicted_peptide_5|182_aa MQIVQTWRCTLSQGYHCSWAAVDPFSVYGLVNPVHTADVEFFEITVSWTGLIFGFTNHMQ LHIGATFTANSTPCNELFNSHTIPRQQTENDFDELREEGFRRSNFSELKEKVRTQRKEVK NLEKRLDEWVTRITSAEKSLNDLIELKTMARELRDECTSFSSQFDQLEERVSVLEDQMNE MK >gi568815575f:23567571_23786332|GENSCAN_predicted_CDS_5|549_bp atgcagatagttcagacctggcgctgtaccctttctcaaggttaccactgttcatgggct gcagtagatcccttttctgtgtatggccttgtcaatcctgtgcacacagcagatgtggaa ttctttgaaataactgttagttggactggtttaatatttggattcaccaaccacatgcag ttacacattggtgcaacctttacagcaaacagtactccctgcaatgaactatttaattca catactatccccagacagcagacagagaatgactttgacgagttgagagaagaaggcttc agacgatcaaacttctccgagctaaaggagaaagttcgaacccaacgcaaagaagttaaa aaccttgaaaaaagattagacgaatgggtaactagaataaccagtgcagagaagtcctta aatgacctgattgagctgaaaaccatggcacgagaactacgtgatgaatgcacaagcttc agtagccaatttgatcaactggaagaaagggtatcagtgctggaagatcaaatgaatgaa atgaagtga >gi568815575f:23567571_23786332|GENSCAN_predicted_peptide_6|278_aa MDHRSRLRGTGLNRIPGTQSRAPRVPLPFHVQQEAREGEDWEREPPRQRPPIYEPPESEE LPDNVMAARRRQKRRSRQGTCFCARVVMEALPLLAATTPDHGRHRRLLLLPLLLFLLPAG AVQGWETEERPRTREEECHFYAGGQVYPGEASRVSVADHSLHLSKAKISKPAPYWEGTAV IDGEFKELKLTDYRGKYLVFFFYPLDLGLFIIDDKGILRQITLNDLPVGRSVDETLRLVQ AFQYTDKHGEVCPAGWKPGSETIIPDPAGKLKYFDKLN >gi568815575f:23567571_23786332|GENSCAN_predicted_CDS_6|837_bp atggatcaccgaagccgactacggggcacaggcctgaaccgaatccctgggactcagtcc cgagccccccgagtcccactccccttccacgtgcaacaggaggccagggaaggagaagac tgggagcgagagccacctcgtcagaggcctcctatctatgagccaccagaaagtgaagag ctgccagataatgttatggctgcccggcggcggcagaagcggcgctcgcgccaagggacg tgtttctgcgctcgcgtggtcatggaggcgctgccgctgctagccgcgacaactccggac cacggccgccaccgaaggctgcttctgctgccgctactgctgttcctgctgccggctgga gctgtgcagggctgggagacagaggagaggccccggactcgcgaagaggagtgccacttc tacgcgggtggacaagtgtacccgggagaggcatcccgggtatcggtcgccgaccactcc ctgcacctaagcaaagcgaagatttccaagccagcgccctactgggaaggaacagctgtg atcgatggagaatttaaggagctgaagttaactgattatcgtgggaaatacttggttttc ttcttctacccacttgatttaggtctcttcattattgatgacaaaggaatcctaagacaa attactctgaatgatcttcctgtgggtagatcagtggatgagacactacgtttggttcaa gcattccagtacactgacaaacacggagaagtctgccctgctggctggaaacctggtagt gaaacaataatcccagatccagctggaaagctgaagtatttcgataaactgaattga >gi568815575f:23567571_23786332|GENSCAN_predicted_peptide_7|376_aa MVSLFDAYVVKNLVLLLFERDHVKAMEERKLLHSFLAKSQDGLPPRRMKDSYIEVLLPLG SEPELREKYLTVQNTVRFGRILEDLDSLGVLICYMHNKIHSAKMSPLSIVTALVDKIVNK GRRIAFSSTSLLKMAPSAEERTTIHEMFLSTLDPNGSRPFVVAVDDIMFQKPVEVGSLLF LSSQVCFTQNNYIQVRVHSEVASLQEKQHTTTNVFHFTFMSEKEVPLVFPKTYGAIFLEG SVVSLKVSCLSNEVSIKGPRGPGTQNFWTAELVKACRKVNKNSPMCWDDGAPQLHGNRSS RAQDPSRPRPMCLFIWVFIRILQNILHKKLVDSIRIELDQKIPGWCPLQNCLPVGGENPY TLHIWSQKSSVLIVVM >gi568815575f:23567571_23786332|GENSCAN_predicted_CDS_7|1131_bp atggtgtctctgtttgatgcctatgtggtgaaaaaccttgttttacttttatttgaaaga gaccatgtgaaggcaatggaagaaaggaaattacttcatagtttcttggctaaatcacag gatggactgcctcctaggagaatgaaggacagttatattgaagttctcttgcctttgggc agtgagcctgaattacgagagaaatatttgactgttcaaaacaccgtaagatttggcagg attcttgaggatcttgacagcttgggagttcttatttgttacatgcacaacaaaatccac tccgccaagatgtctcctttatcgatagttacagccctggtggataagattgtgaacaag gggagaagaattgccttcagctccacgtcgttactgaaaatggcccccagcgctgaggag aggaccaccatacatgagatgtttctcagcacactggatccaaatggttctcgaccgttt gtggtagcagtagatgacatcatgtttcagaaacctgttgaggttggctcattgctcttt ctttcttcacaggtatgctttactcagaataattatattcaagtcagagtacacagtgaa gtggcctccctgcaggagaagcagcatacaaccaccaatgtctttcatttcacgttcatg tcggaaaaagaagtgccattggttttcccaaaaacatatggagccattttcttggaggga tcagtggtgtcactgaaggtgtcatgcctatccaatgaagtctccataaaaggcccaaga ggaccaggtacgcagaacttctggacagctgagctggtgaaggcttgcaggaaggtgaac aagaattcacccatgtgttgggatgatggtgcaccccaactccacgggaatagaagctcc cgtgctcaggacccttccagacctcgccctatgtgcctcttcatctgggtgtttattcgt atccttcaaaatatccttcataagaaactggtggacagcatcagaattgaattggatcag aagattcctggctggtgtccactgcagaattgcttgcctgttggtggggaaaatccctac accctacacatttggtcacagaagtcttctgtattgattgttgtcatgtga >gi568815575f:23567571_23786332|GENSCAN_predicted_peptide_8|322_aa MATPQVEQPVDHFQSIGTALTSERNDGIAVPQLYQLDSPTTRKLCDQWCLCLSFDQARQT CSTDVAWQAVLSSCYQPGFHACRGPAIQWVLSSRPTSRKNEVRGQLEGEQGGEEPYRVTE QLSGNLKWVAPFRRQVVPGPGPQREEKQKTKMAKFVIRPATAADCSDILRLIKELAKYEY MEEQVILTEKDLLEDGFGEHPFYHCLVAEVPKEHWTPEGHSIVGFAMYYFTYDPWIGKLL YLEDFFVMSDYRGFGIGSEILKNLSQVAMRCRCSSMHFLVAEWNEPSINFYKRRGASDLS SEEGWRLFKIDKEYLLKMATEE >gi568815575f:23567571_23786332|GENSCAN_predicted_CDS_8|969_bp atggctactccacaggtagagcagccagtagaccattttcaaagcattgggacagctctc acctcagaaagaaatgatggtattgcggtgcctcagctctaccaactagattcaccaacc actagaaagctctgtgaccagtggtgcctttgcctgagttttgatcaggcccgccagact tgttccactgacgtggcctggcaggctgtgctcagctcatgctaccagccaggattccac gcctgccgaggtcccgccattcagtgggtcctgagttctcgtcccacatccaggaagaat gaggtacgcggacaactagagggtgagcaaggcggagaggagccttatcgagtgacagaa cagctctcgggaaacttgaagtgggtagctcctttccgcaggcaggttgtcccagggcct ggtccgcaaagggaagaaaagcaaaagacgaaaatggctaaattcgtgatccgcccagcc actgccgccgactgcagtgacatactgcggctgatcaaggagctggctaaatatgaatac atggaagaacaagtaatcttaactgaaaaagatctgctagaagatggttttggagagcac cccttttaccactgcctggttgcagaagtgccgaaagagcactggactccggaaggacac agcattgttggttttgccatgtactattttacctatgacccgtggattggcaagttattg tatcttgaggacttcttcgtgatgagtgattatagaggctttggcataggatcagaaatt ctgaagaatctaagccaggttgcaatgaggtgtcgctgcagcagcatgcacttcttggta gcagaatggaatgaaccatccatcaacttctataaaagaagaggtgcttctgatctgtcc agtgaagagggttggagactgttcaagatcgacaaggagtacttgctaaaaatggcaaca gaggagtga