GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:14:25 Sequence gi568815594r:89629193_89935667 : 306475 bp : 35.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12203 12354 152 2 2 108 45 94 0.316 5.99 1.02 Intr + 18474 19367 894 1 0 -16 40 396 0.116 14.75 1.03 Intr + 19651 19967 317 2 2 17 86 145 0.211 2.06 1.04 Term + 26611 26709 99 0 0 127 37 68 0.432 2.75 1.05 PlyA + 27552 27557 6 1.05 2.00 Prom + 33904 33943 40 -2.85 2.01 Init + 43186 43405 220 0 1 88 30 203 0.733 13.34 2.02 Term + 43709 44220 512 0 2 16 39 251 0.717 6.55 2.03 PlyA + 44383 44388 6 1.05 3.00 Prom + 45046 45085 40 -7.85 3.01 Init + 45544 45702 159 0 0 70 57 79 0.503 2.88 3.02 Term + 50296 50982 687 0 0 75 37 393 0.748 25.52 3.03 PlyA + 54482 54487 6 1.05 4.03 PlyA - 54523 54518 6 1.05 4.02 Term - 59997 59865 133 2 1 36 41 154 0.432 2.08 4.01 Init - 69094 69054 41 1 2 90 70 48 0.315 2.91 4.00 Prom - 69167 69128 40 -3.35 5.04 PlyA - 69377 69372 6 1.05 5.03 Term - 87762 87668 95 1 2 84 38 136 0.792 5.21 5.02 Intr - 94177 94027 151 1 1 57 61 142 0.908 7.21 5.01 Init - 95629 95627 3 1 0 55 115 0 0.537 -0.55 5.00 Prom - 96911 96872 40 -5.45 6.00 Prom + 102342 102381 40 -3.45 6.01 Init + 102683 102815 133 1 1 50 85 162 0.826 12.45 6.02 Intr + 118579 118661 83 2 2 100 94 67 0.407 6.94 6.03 Intr + 124039 124227 189 0 0 50 47 97 0.124 0.66 6.04 Term + 139506 139661 156 2 0 17 48 115 0.036 -2.65 6.05 PlyA + 139836 139841 6 1.05 7.05 PlyA - 140046 140041 6 1.05 7.04 Term - 142393 142269 125 2 2 38 53 139 0.716 2.97 7.03 Intr - 146588 146441 148 2 1 37 33 107 0.094 -1.01 7.02 Intr - 160920 160781 140 1 2 75 81 97 0.610 6.96 7.01 Init - 168378 168249 130 0 1 43 70 118 0.602 5.86 7.00 Prom - 182973 182934 40 -3.35 8.05 PlyA - 183337 183332 6 1.05 8.04 Term - 183738 183637 102 0 0 99 35 82 0.778 1.30 8.03 Intr - 193196 193054 143 0 2 59 86 176 0.933 13.65 8.02 Intr - 198992 198951 42 0 0 72 115 31 0.547 1.49 8.01 Init - 206475 206355 121 0 1 78 82 135 0.569 12.30 8.00 Prom - 206796 206757 40 -6.65 9.00 Prom + 214394 214433 40 -6.05 9.01 Init + 222132 222308 177 2 0 83 32 144 0.116 7.61 9.02 Intr + 227072 227352 281 1 2 38 65 151 0.097 3.15 9.03 Intr + 227803 227940 138 1 0 47 36 123 0.456 1.66 9.04 Intr + 246658 246825 168 1 0 7 45 159 0.010 1.64 9.05 Intr + 265815 266402 588 0 0 57 69 412 0.001 27.01 9.06 Intr + 280084 280157 74 1 2 60 75 40 0.052 -1.97 9.07 Intr + 293976 294080 105 1 0 125 40 108 0.616 8.97 9.08 Intr + 298603 298776 174 2 0 70 93 134 0.635 11.09 9.09 Intr + 305618 306394 777 0 0 101 35 548 0.087 41.32 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 260844 260945 102 2 0 95 55 120 0.872 6.70 S.002 Init + 305692 306394 703 0 1 44 35 573 0.907 42.77 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:89629193_89935667|GENSCAN_predicted_peptide_1|487_aa XKQRNAAGPSEWSLPLPAPKQLASSCAHVLQFLPAKGSGKYPALSTQYYRAQLEKQEQTH SKASRRQEITKIRAELKEIETQKTLQKINESRSWFFEKINKIRRPLARLIKKKREKNQID AIKNDKGDITPNPTEIQTTIREYYKHLYTNELENLEEMDKFLNTYNLPRLNQEEVESLNR PITGCEIEALINSLPTKKSPGTDGLTAKFYHRYKEELVPFLLKRFQSVEKERILPNSFYE ASIILIPKPGRDATKEENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQ GWFNICKSINVIQHINRTNDKNHIIISIDAEEAFNKIQQPFMLKTLNKLAPNLLKLISNF SKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLIRDVKDLFKENY KPLLNEIKEDTNKWKNIPCSWIGRINIMKMATLPKDLSFVHMKRENAGVSSTYKDVSPIA CHPYDLI >gi568815594r:89629193_89935667|GENSCAN_predicted_CDS_1|1464_bp nncaagcagcggaatgcagcaggtccgagtgaatggagtttgcccctaccagcaccgaag cagctggctagttcttgcgctcatgtactccagttcctgcccgcaaaggggtcagggaaa tatcctgctttatcaacacaatactatagagcacaactagagaagcaagagcaaacacat tcaaaagctagcagaaggcaagaaataactaagatcagagcagaactgaaggagatagag acacaaaaaacccttcaaaaaatcaatgaatccaggagctggttttttgaaaagatcaac aaaattcgtagaccactagcaaggctaataaagaagaaaagagagaagaatcagatagat gcaataaaaaatgataaaggggatatcacccccaatcccacagaaatacaaactaccatc agagaatactataaacacctctacacaaatgaactagaaaatctagaagaaatggataaa ttcctcaacacatacaacctcccaagactaaaccaggaagaagttgaatctcttaataga ccaataacaggctgtgaaattgaggcattaattaatagcttaccaaccaaaaaaagtcca ggaacagacggattgacagccaaattctaccataggtacaaggaggagctggtaccattc cttctgaaacgattccaatcagtagaaaaagagagaatcctccctaactcattttatgag gccagcatcatcctgataccaaagcctggcagagacgcaacaaaagaagaaaattttcga ccaatatccttgatgaacatcgatgcaaaaatcctcaataaaatactggcaaaccgaatc cagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaa ggctggttcaacatatgcaaatcaataaatgtaatccagcatataaacagaaccaatgac aaaaaccacattattatctcaatagatgcagaagaggccttcaacaaaattcaacagccc ttcatgctaaaaactctcaataaattagccccaaatctccttaagctgataagcaacttc agcaaagtctcaggatacaaaatcaacgtgcaaaaatcacaagcattcttatacaccaat aacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaagaga ataaaatacctaggaatccaacttataagggatgtgaaggaccttttcaaggaaaactac aaacctctgctcaacgaaataaaagaggacacaaacaaatggaagaacattccatgctca tggataggaagaatcaatatcatgaaaatggccacactgcccaaggacctttcctttgtg cacatgaagagagagaacgctggtgtctcctctacttataaggacgtcagtcctattgca tgccacccttatgacctcatttaa >gi568815594r:89629193_89935667|GENSCAN_predicted_peptide_2|243_aa MGKKQHKKAENSKNQNASSPKDQNSLPAKEQNWMENEFDKLTEVGFRRWVITNNSELKEY VLTQCKEAKNLEKTRQANIQIQEIQRTPQRYSSRRTTPRHIIVRFTKVEMKEKMLRAARE KGRVTHKGKPIRLTADFSAETLQARREWGPIFNILKENNFQPRISYPAKLSFISEGEIKS FTDKQILRDFVTTRPALQELLKEAVNMETNNQYQPLQKIPNCKDHQCCEETASINGQNNQ LTS >gi568815594r:89629193_89935667|GENSCAN_predicted_CDS_2|732_bp atggggaaaaagcagcacaaaaaggctgaaaattccaaaaaccagaatgcctcttctcca aaggatcaaaactccttgccagcaaaggaacaaaactggatggagaatgagtttgacaaa ttgacagaagtaggcttcagaaggtgggtaataacaaacaactccgagctaaaggagtat gttctaacccaatgcaaggaagctaagaaccttgaaaaaacaagacaggccaacatccaa atccaggaaatacagagaacaccacaaagatactcctcgagaagaacaaccccaagacac ataattgtcagattcaccaaggttgaaatgaaggaaaaaatgttaagggcagccagagag aaaggtcgggttacccacaaagggaagcccatcagactaacagcagatttctctgcagaa accctacaagccagaagagagtgggggccaatattcaacattcttaaagaaaataatttt caacccagaatttcctatccagccaaactaagcttcataagtgaaggagaaataaaatcc tttacagacaagcaaatactgagggattttgtcaccaccagacctgccttacaagagctc ctgaaagaagcagtaaacatggaaacgaacaaccagtaccagccactgcaaaaaatacca aattgtaaagaccatcaatgctgtgaagaaactgcatcaattaacgggcaaaataaccag ctaacatcataa >gi568815594r:89629193_89935667|GENSCAN_predicted_peptide_3|281_aa MDKFLDTYTLPSLSQEEVESPNRPTTSSEIEAVINSLPTKKSPGPDGFTAKFEWVLGLAD FKNEAADPRGECYSSQRWCVRSLFLQMFTCPEFLPSGGLIVSLASGVKLQTFAVSVTAHK GGASGVVRSFRWVRGLSGFRSEAADLCSVTSLKGGTSRVVHSSRPELFCSSFLVPPSGFV VSLASGVKLQTFAVSVTALKGGTDPKSEQQQDLLQRAKEQNFEKVEGDLSGLLRWPAFIP LFGPNHILLIGLFYRELIGPFYRVLIGPFLHRALIGAFTKL >gi568815594r:89629193_89935667|GENSCAN_predicted_CDS_3|846_bp atggataaattcctggacacatacaccctcccaagtctaagccaagaagaagttgaatcc ccgaatagaccaacaacaagttctgaaattgaggcggtaattaatagcctaccaaccaaa aaaagtccaggaccagatggattcacagccaaatttgagtgggttcttggtcttgctgac ttcaagaatgaagccgcagaccctcgtggtgagtgttacagttctcaaagatggtgtgtc cggagtttgttccttcagatgttcacgtgtccagagtttcttccttctggtgggttaata gtctcgctggcttcaggagtgaagctgcagaccttcgcagtgagtgttacagctcataaa ggtggcgcttctggagttgttcgttccttccggtgggttcgtggtctctctggcttcagg agtgaagctgcagacctttgcagtgttacatctcttaaaggtggcacgtccagagttgta cattcctcccgtccagagttgttttgttcctccttcctcgttcctcccagtgggtttgtg gtctcgctggcttcaggagtgaagctgcagacctttgcagtgagtgttacagctcttaaa ggtggcacagacccaaagagtgagcagcagcaagatttattgcaaagagcaaaagaacaa aacttcgaaaaagtggaaggggacctgagtgggttgctcaggtggcctgcttttattccc ttatttggccccaatcacatcctgctgattggtctattttacagagagctgattggtcca ttttacagagtgctgattggtccgtttttacacagagcactgattggtgcatttacaaag ctttag >gi568815594r:89629193_89935667|GENSCAN_predicted_peptide_4|57_aa MGKGEEKARTFLTSFEDAGREPQATECKWALEVRSVKETDSPPEPPKRNATMLAPWF >gi568815594r:89629193_89935667|GENSCAN_predicted_CDS_4|174_bp atggggaaaggtgaagagaaagcaaggaccttcctcacaagctttgaagatgcaggtagg gagccacaagccacggaatgcaaatgggctctagaagttagatcagtcaaagaaacagat tctccaccagagcctccaaaaaggaatgcaaccatgctggcaccttggttttag >gi568815594r:89629193_89935667|GENSCAN_predicted_peptide_5|82_aa MVDSVRFELRDTQLVATKNGRIAWCRKHTPHTHVVSEMNRKYCVPEILSVVTLSSDKEAL PQCDPEAADCLKIKEPHSFIAV >gi568815594r:89629193_89935667|GENSCAN_predicted_CDS_5|249_bp atggtagatagtgtccgatttgaattacgggacacccagttggtagccacaaagaatggg agaattgcttggtgtagaaaacacaccccacacacacatgtggtgtcagaaatgaaccgg aaatattgtgttccggaaatattgagtgttgtgaccctttccagtgataaagaagccctg ccgcaatgtgaccccgaagctgcagattgtctgaagatcaaagagccacatagctttata gctgtctaa >gi568815594r:89629193_89935667|GENSCAN_predicted_peptide_6|186_aa MTAQSYSRVQIDNRVYDLDVGSGHPNGVTAIKGSFVRQLKSYMIGQSACPLEEKGPGFFI SPSLDLSAHMHLREHCSNCKRFCFDCGNWNNQGTNCKVTGKGSSRHDSSLSNFKAADFKR ENQADESEQICSSAHFTIGFVLLWQSNATTELTGGGAQLLLCSLVSNSLTGYGPVLVMAL GLGTPD >gi568815594r:89629193_89935667|GENSCAN_predicted_CDS_6|561_bp atgacagcacaatcctattctagagtccaaatcgacaatagggtttacgacctcgatgtt ggatcaggtcatcctaatggtgtaactgctattaaaggttcgtttgttcgacaattgaag tcctacatgatcggccagtctgcatgtccactagaagaaaagggacctgggttcttcatc agcccatccttagatcttagtgcacatatgcacctaagagagcactgcagtaactgcaaa cgcttctgttttgattgtggtaattggaacaatcagggcacaaactgcaaggtcactggc aagggtagcagcaggcatgattcctcccttagtaattttaaagcagcagatttcaaaagg gaaaaccaagcagatgaaagtgagcagatttgttcttcagcacatttcacaatagggttc gtgctcctatggcaatctaatgccaccactgagctgacaggaggaggagctcagctcttg ctgtgcagcctggtatcgaacagcctaacaggctatggaccagtactggtcatggccctg gggttggggacccctgattga >gi568815594r:89629193_89935667|GENSCAN_predicted_peptide_7|180_aa MVTTRSMLAGGRGISSVIQEADSKMVLELQPSTRDLLQAIAMKVITHTSFHSSAITFVTK VMVLSPPIAAAGLGHQDLGSTHHVPRTGPQVAKHMNEVQWKAFLRNPVKTKHTHTHTDIR EAGRDVHYSTDKHPGRPSRKISSYFGTGDAHTQKAKRHSIFKEAHKKQCIQKWRELEEWT >gi568815594r:89629193_89935667|GENSCAN_predicted_CDS_7|543_bp atggtaacaacaagatccatgcttgctggaggtagaggcatcagttcagtcattcaggaa gctgattccaagatggtgttagaattacaaccatccacaagagatttattgcaggcaata gctatgaaagtcatcactcatacctctttccactcttctgccatcacttttgtcaccaaa gtcatggtcctttccccgccgattgctgctgcaggtctagggcaccaagacttaggcagc actcaccatgtgccaagaactggaccacaggtagcaaagcacatgaatgaagttcagtgg aaggccttcctcaggaatccagtaaaaaccaaacatacacacacacacacggacatccgt gaggcaggaagggatgtccactatagtacagacaagcatcctggaaggccatcaaggaaa atctcctcttactttggcactggagatgcccatacgcagaaagcaaaaaggcacagcata tttaaggaagctcataagaaacagtgcatccagaagtggcgagaattggaggaatggaca tga >gi568815594r:89629193_89935667|GENSCAN_predicted_peptide_8|135_aa MDVFMKGLSKAKEGVVAAAEKTKQGVAEAAGKTKEGVLYVGSKTKEGVVHGVATVAEKTK EQVTNVGGAVVTGVTAVAQKTVEGAGSIAAATGFVKKDQLGKAASYQGVRPKSFLLSDNT KPHESPLRGRTRDKL >gi568815594r:89629193_89935667|GENSCAN_predicted_CDS_8|408_bp atggatgtattcatgaaaggactttcaaaggccaaggagggagttgtggctgctgctgag aaaaccaaacagggtgtggcagaagcagcaggaaagacaaaagagggtgttctctatgta ggctccaaaaccaaggagggagtggtgcatggtgtggcaacagtggctgagaagaccaaa gagcaagtgacaaatgttggaggagcagtggtgacgggtgtgacagcagtagcccagaag acagtggagggagcagggagcattgcagcagccactggctttgtcaaaaaggaccagttg ggcaaggctgcatcctaccagggagtaagacccaagtccttcctgctttcagacaacacc aagcctcatgagtccccactcagaggaaggaccagagacaaactctaa >gi568815594r:89629193_89935667|GENSCAN_predicted_peptide_9|828_aa MAEGKEEQVTSYVDGSRQGERAFAGKLPFLKQSDIMRLTLYHESSTRKTRPHDSTISHQW VQDSGCSTSSMTRRTARHCLTQEVQGVREFPFIAKQSCDRRHLKNRVTPTQILRFSNGLS KRHTRKLYPAPGSEGPMPTEPCSLLAQQSEIELQTDSGVDLQQTPTDLQLRILTVKRITN KQKGHPHQNYKATVTKMARYYWQMRQMKIVKLDELDYRYFASTATDKETTPPSTKKTQCP KPKFHNVQFQVQMYYLWSGGIGLNNSKHSWTIPEDGNSQKTMPSASVPPNKIQSLQILPT TRVMSAEIATTPEARTSEDSLLKSTLPPSETSAPAEGVRNQTLTSTEKAEGVVKLQNLTL PTNASIKFNPGAESVVLSNSTLKFLQSFARKSNEQATSLNTVGGTGGIGGVGGTGGVGNR APRETYLSRGDSSSSQRTDYQKSNFETTRGKNWCAYVHTRLSPTVILDNQVTYVPAQEQQ SLIHTNQAESHTAVGRGVAEQQQQQGCGDPEVMQKMTDQVNYQAMKLTLLQKKIDNISLT VNDVRNTYSSLEGKVSEDKSREFQSLLKGLKSKSINVLIRDIVREQFKIFQNDMQETVAQ LFKTVSSLSEDLESTRQIIQKVNESVVSIAAQQKFVLVQENRPTLTDIVELRNHIVNVRQ EMTLTCEKPIKELEVKQTHLEGALEQEHSRSILYYESLNKTLSKLKEVHEQLLSTEQVSD QKNAPAAESVSNNVTEYMSTLHENIKKQSLMMLQMFEDLHIQESKINNLTVSLEMEKESL RGECEDMLSKCRNDFKFQLKDTEENLHVLNQTLAEVLFPMDNKMDKMX >gi568815594r:89629193_89935667|GENSCAN_predicted_CDS_9|2484_bp atggcagaaggcaaggaggagcaagtcacgtcttatgtggatggcagcaggcaaggggag agagcttttgcagggaaactcccatttttaaaacaatcagatatcatgagacttactctc tatcatgagagcagcacaagaaagacccgaccccatgattcaactatctcccaccagtgg gtgcaggacagtgggtgcagcacatcgagcatgacccgaagaacagcgaggcattgcctc acccaggaagtgcaaggggtgagggaattccctttcatagccaagcaaagctgtgacaga cggcacctgaaaaatcgggtcactcccacccaaatactgcgcttttccaatggtcttagc aagcggcacaccaggaaattatatcctgcgcctggctcagagggtcccatgcccacggag ccttgctcattgctagcacagcagtctgagattgaactgcaaacagattctggagtggac ctccagcaaactccaacagacctgcagctgaggatcctgactgtcaaaaggataactaac aaacagaaaggacatccacaccaaaactacaaggctacagtaaccaaaatggcacggtat tactggcagatgaggcaaatgaagattgtaaagttagatgaacttgattatcgatacttt gccagtactgctacagacaaagagaccacacctccatctaccaaaaaaactcagtgtcct aaaccaaagtttcataatgttcaattccaagttcaaatgtattatttatggagtgggggc attgggcttaacaacagtaagcattcttggactatacctgaggatgggaactctcagaag actatgccttctgcttcagttcctccaaataaaatacaaagtttgcaaatactgccaacc actcgggtcatgtcggcggagatagctacaactccagaggcaagaacttctgaagacagt cttcttaaatcaacactgcctccctcagaaacaagtgcacctgctgagggtgtgagaaat caaactctcacatccacagagaaagcagaaggagtggtcaagttacagaatcttaccctc ccaaccaacgctagcatcaagttcaatcctggagcagaatcagtggtcctttccaattct acactgaaatttcttcagagctttgccagaaagtcaaatgaacaagcaacttctctaaac acagttggaggcactggaggcattggaggcgttggaggcactggaggcgtgggaaatcga gccccacgggaaacatacctcagccggggtgacagcagttccagccaaagaactgactac caaaaatcaaatttcgaaacaactagaggaaagaattggtgtgcttatgtacataccagg ttatctcccacagtgatattggacaaccaggtcacttatgtcccagcccaggaacagcaa agtttgatacacaccaaccaggctgaaagtcatacagctgttggcagaggagtagctgag cagcagcagcagcaaggctgtggtgacccagaagtgatgcaaaaaatgactgatcaggtg aactaccaggcaatgaaactgactcttctgcagaagaagattgacaatatttctttgact gtgaatgatgtaaggaacacttactcctccctagaaggaaaagtcagcgaagataaaagc agagaatttcaatctcttctaaaaggtctaaaatccaaaagcattaatgtactgataaga gacatagtaagagaacaatttaaaatttttcaaaatgacatgcaagagactgtagcacag ctcttcaagactgtatcaagtctatcagaggacctcgaaagcaccaggcaaataattcaa aaagttaatgaatctgtggtttcaatagcagcccagcaaaagtttgttttggtgcaagag aatcggcccactttgactgatatagtggaactaaggaatcacattgtgaatgtaaggcaa gaaatgactcttacatgtgagaagcctattaaagaactagaagtaaagcagactcattta gaaggtgctctagaacaggaacactcaagaagcattctgtattatgaatccctcaataaa actctttctaaattgaaggaagtacatgagcagcttttatcaactgaacaggtatcagac cagaagaatgctccagctgctgagtcagttagcaataatgtcactgagtacatgtctact ttacatgaaaatataaagaagcagagtttgatgatgctgcaaatgtttgaagatttgcac attcaagaaagcaagattaacaatctcaccgtctctttggagatggagaaagagtctctc agaggtgaatgtgaagacatgttatccaaatgcagaaatgattttaaatttcaacttaag gacacagaagagaatttacatgtgttaaatcaaacattggctgaagttctctttccaatg gacaataagatggacaaaatgann