GENSCAN 1.0 Date run: 7-Nov-116 Time: 16:20:03 Sequence gi568815596f:73826911_74058247 : 231337 bp : 44.90% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2495 2600 106 2 1 114 69 0 0.402 0.92 1.02 Intr + 3935 4149 215 1 2 95 76 153 0.834 12.31 1.03 Intr + 9312 9518 207 0 0 31 81 103 0.653 2.09 1.04 Intr + 17903 17978 76 2 1 56 98 51 0.911 2.42 1.05 Intr + 18257 18352 96 1 0 92 86 99 0.984 10.31 1.06 Intr + 20477 20843 367 1 1 64 86 337 0.940 25.82 1.07 Intr + 22453 22577 125 2 2 95 58 74 0.993 5.40 1.08 Intr + 23466 23603 138 2 0 92 77 97 0.573 9.66 1.09 Intr + 30211 30399 189 0 0 77 80 40 0.614 1.88 1.10 Term + 35293 35349 57 0 0 116 53 100 0.896 6.99 1.11 PlyA + 35924 35929 6 1.05 2.00 Prom + 57284 57323 40 -3.56 2.01 Init + 74402 74527 126 1 0 66 96 136 0.989 12.36 2.02 Intr + 75450 75578 129 2 0 120 65 143 0.994 16.09 2.03 Intr + 81763 81873 111 0 0 89 110 77 0.946 10.58 2.04 Intr + 82145 82229 85 1 1 84 109 104 0.971 11.49 2.05 Intr + 86575 86736 162 2 0 108 68 276 0.997 27.55 2.06 Intr + 87770 87961 192 0 0 116 73 273 0.999 28.06 2.07 Intr + 89674 89855 182 2 2 66 98 263 0.999 24.69 2.08 Term + 92522 92665 144 1 0 121 39 151 0.997 11.41 2.09 PlyA + 95715 95720 6 1.05 3.00 Prom + 98868 98907 40 -5.96 3.01 Init + 100001 100142 142 1 1 72 87 149 0.964 13.76 3.02 Intr + 107442 107507 66 1 0 61 89 38 0.284 0.08 3.03 Intr + 112000 112112 113 2 2 50 106 44 0.507 2.50 3.04 Intr + 119809 119996 188 0 2 101 98 115 0.934 12.29 3.05 Intr + 123675 123822 148 0 1 90 92 100 0.996 10.84 3.06 Intr + 130215 130330 116 2 2 36 100 70 0.845 2.25 3.07 Intr + 131236 131335 100 1 1 63 59 126 0.922 7.31 3.08 Term + 131800 131826 27 0 0 138 41 10 0.934 -0.63 3.09 PlyA + 132017 132022 6 1.05 4.00 Prom + 137732 137771 40 -1.56 4.01 Init + 159216 159274 59 2 2 90 75 120 0.931 9.68 4.02 Term + 159388 159847 460 1 1 62 43 356 0.185 23.06 4.03 PlyA + 160349 160354 6 -5.12 5.06 PlyA - 160482 160477 6 1.05 5.05 Term - 161014 160674 341 2 2 86 43 179 0.147 7.90 5.04 Intr - 166437 166360 78 1 0 58 99 60 0.268 3.62 5.03 Intr - 172184 172074 111 0 0 63 44 78 0.312 1.25 5.02 Intr - 173885 173835 51 0 0 66 105 48 0.205 3.18 5.01 Init - 182022 181953 70 0 1 77 76 50 0.065 3.91 5.00 Prom - 191807 191768 40 -2.56 6.00 Prom + 196983 197022 40 -3.86 6.01 Init + 204269 204385 117 1 0 96 109 9 0.410 4.00 6.02 Intr + 219368 221501 2134 1 1 115 123 1311 0.848 124.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 126344 126445 102 1 0 -11 44 225 0.950 6.48 S.002 Term + 183518 183636 119 0 2 99 48 103 0.883 6.10 S.003 Init + 217704 217845 142 2 1 69 97 90 0.871 6.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:73826911_74058247|GENSCAN_predicted_peptide_1|525_aa XCCIPFLELLNQVVTGAASWPWEVVPFLTHKNLSQENLVLMSDHGDVSLPPEDRVRALSQ LGSAVEVNEDIPPRRYFRSGVEIIRMASIYSEEGNIEHAFILYNKYITRETESVVSSQGL ARRRNLRMGAGVEEVRRLPRQEQQLLQGSPAGQQAASGGSRQPPPPLHPVVPALPSRLFI EKLPKHRDYKSAVIPEKKDTVKKLKEIAFPKAEELKAELLKRYTKEYTEYNEEKKKEAEE LARNMAIQQELEKEKQRVAQQKQQQLEQEQFHAFEEMIRNQELEKERLKIVQEFGKVDPG LGGPLVPDLEKPSLDVFPTLTVSSIQPSDCHTTVRPAKPPVVDRSLKPGALSNSESIPTI DGLRHVVVPGRLCPQFLQLASANTARGVETCGILCGKLMRNEFTITHVLIPKQSAGSDYC NTENEEELFLIQDQQGLITLGWIHWSLSFNTVEQYLSPMMPQMEGLSSVPGSSPYPTCLM ARCLDAFSHQGSAPTVLHPTRNSRTHQSCSHVTVVDRAVTITDLR >gi568815596f:73826911_74058247|GENSCAN_predicted_CDS_1|1578_bp nnatgctgcatcccctttttggaattgctcaaccaggtggtaaccggcgccgcttcctgg ccttgggaggtggttcctttcttaacccacaagaacctctcccaagagaacttggtcctg atgtctgaccatggagatgtgagcctcccgcccgaagaccgggtgagggctctctcccag ctgggtagtgcggtagaggtgaatgaagacattccaccccgtcggtacttccgctctgga gttgagattatccgaatggcatccatttactctgaggaaggcaacattgaacatgccttc atcctctataacaagtatatcacgagggagactgaatcagttgtgagcagccaagggctg gcccgtcgtcggaacctccgaatgggggctggggtggaggaggtgaggcgattgccgagg caggagcagcagctgttgcagggctctcctgcaggccagcaggcagcatctggaggctcc cggcagcctccccctcccctccaccccgtggtgcccgcgctgccctcaaggctctttatt gagaaactaccaaaacatcgagattacaaatctgctgtcattcctgaaaagaaagacaca gtaaagaaattaaaggagattgcatttcccaaagcagaagagctgaaggcagagctgtta aaacgatataccaaagaatatacagaatataatgaagaaaagaagaaggaagcagaggaa ttggcccggaacatggccatccagcaagagctggaaaaggaaaaacagagggtagcacaa cagaagcagcagcaattggaacaggaacagttccatgccttcgaggagatgatccggaac caggagctagaaaaagagcgactgaaaattgtacaggagtttgggaaggtagaccctggc ctaggtggcccgctagtgcctgacttggagaagccctccttagatgtgttccccacctta acagtctcatccatacagccttcagactgtcacacaactgtaaggccagctaagccacct gtggtggacaggtccttgaaacctggagcactgagcaactcagaaagtattcccacaatc gatggattgcgccatgtggtggtgcctgggcggctgtgcccacagtttctccagttagcc agtgccaacactgcccggggagtggagacatgtggaattctctgtggaaaactgatgagg aatgaatttaccattacccatgttctcatccccaagcaaagtgctgggtctgattactgc aacacagagaacgaagaagaacttttcctcatacaggatcagcagggcctcatcacactg ggctggattcattggtctttgagttttaacactgtggaacagtatctttccccgatgatg ccccagatggaaggtttgagttctgtccctgggtcctctccataccctacctgtttaatg gcccgatgtcttgatgccttctcgcatcaaggctctgcgcctacagtccttcatccaacc aggaattccagaacccatcagagctgcagccacgtgactgttgtggacagagcagtgacc atcacagaccttcgatga >gi568815596f:73826911_74058247|GENSCAN_predicted_peptide_2|376_aa MCEEETTALVCDNGSGLCKAGFAGDDAPRAVFPSIVGRPRHQGVMVGMGQKDSYVGDEAQ SKRGILTLKYPIEHGIITNWDDMEKIWHHSFYNELRVAPEEHPTLLTEAPLNPKANREKM TQIMFETFNVPAMYVAIQAVLSLYASGRTTGIVLDSGDGVTHNVPIYEGYALPHAIMRLD LAGRDLTDYLMKILTERGYSFVTTAEREIVRDIKEKLCYVALDFENEMATAASSSSLEKS YELPDGQVITIGNERFRCPETLFQPSFIGMESAGIHETTYNSIMKCDIDIRKDLYANNVL SGGTTMYPGIADRMQKEITALAPSTMKIKIIAPPERKYSVWIGGSILASLSTFQQMWISK PEYDEAGPSIVHRKCF >gi568815596f:73826911_74058247|GENSCAN_predicted_CDS_2|1131_bp atgtgtgaagaggagaccaccgcgctcgtgtgtgacaatggctctggcctgtgcaaggca ggcttcgcaggagatgatgccccccgggctgtcttcccctccattgtgggccgccctcgc caccagggtgtgatggtgggaatgggccagaaagacagctatgtgggggatgaggctcag agcaagcgagggatcctaactctcaaataccccattgaacacggcatcatcaccaactgg gatgacatggagaagatctggcaccactccttctacaatgagctgcgtgtagcacctgaa gagcaccccaccctgctcacagaggctcccctaaatcccaaggccaacagggaaaagatg acccagatcatgtttgaaaccttcaatgtccctgccatgtacgtcgccattcaagctgtg ctctccctctatgcctctggccgcacgacaggcatcgtcctggattcaggtgatggcgtc acccacaatgtccccatctatgaaggctatgccctgccccatgccatcatgcgcctggac ttggctggccgtgacctcacggactacctcatgaagatcctcacagagagaggctattcc tttgtgaccacagctgagagagaaattgtgcgagacatcaaggagaagctgtgctatgtg gccctggattttgagaatgagatggccacagcagcttcctcttcctccctggagaagagc tatgagctgccagatgggcaggttatcaccattggcaatgagcgcttccgctgccctgag accctcttccagccttcctttattggcatggagtccgctggaattcatgagacaacctac aattccatcatgaagtgtgacattgacatccgtaaggacttatatgccaacaatgtcctc tctgggggcaccaccatgtaccctggcattgctgacaggatgcagaaggagatcacagcc ctggcccccagcaccatgaagatcaagattattgctcccccagagcggaagtactcagtc tggatcgggggctctatcctggcctctctctccaccttccagcagatgtggatcagcaag cctgagtatgatgaggcagggccctccattgtccacaggaagtgcttctaa >gi568815596f:73826911_74058247|GENSCAN_predicted_peptide_3|299_aa MAAGRLFLSRLRAPFSSMAKSPLEGVSSSRGLHAGRGPRRLSIEGNIGCGNITTENTGGQ GPKGRFDAASVGKSTFVKLLTKTYPEWHVATEPVATWQNIQAAGTQKACTAQSLGNLLDM MYREPARWSYTFQTFSFLSRLKVQLEPFPEKLLQARKPVQIFERSVYSDRYIFAKNLFEN GSLSDIEWHIYQDWHSFLLWEFASRITLHGFIYLQASPQVCLKRLYQRAREEEKGIELAY LEQLHGQHEAWLIHKTTKLHFEALMNIPVLVLDVNDDFSEEVTKQEDLMREVNTFVKNL >gi568815596f:73826911_74058247|GENSCAN_predicted_CDS_3|900_bp atggccgcgggccgcctctttctaagtcggcttcgagcacccttcagttccatggccaag agcccactcgagggcgtttcctcctccagaggcctgcacgcggggcgcgggccccgaagg ctctccatcgaaggcaacattggttgtggcaatataacaacagagaacactggtggccag ggtcctaagggaaggtttgatgctgcttctgtgggaaagtccacgtttgtgaagttactc acgaaaacttacccagaatggcacgtagctacagaacctgtagcaacatggcagaatatc caggctgctggcacccaaaaagcctgcactgcccaaagtcttggaaacttgctggatatg atgtaccgggagccagcacgatggtcctacacattccagacattttcctttttgagccgc ctgaaagtacagctggagcccttccctgagaaactcttacaggccaggaagccagtacag atctttgagaggtctgtgtacagtgacaggtatatctttgcaaagaatctttttgaaaat ggttccctcagtgacatcgagtggcatatctatcaggactggcattcttttctcctgtgg gagtttgccagccggatcacattacatggcttcatctacctccaggcttctccccaggtt tgtttgaagagactgtaccagagggccagggaggaggagaaaggaattgagctggcctat ctagagcagctgcatggccaacacgaagcctggcttattcacaagacaacgaagctccac tttgaggctctgatgaacattccagtgctggtgttggatgtcaatgatgatttttctgag gaagtaaccaaacaagaagacctcatgagagaggtaaacacctttgtaaagaatctgtaa >gi568815596f:73826911_74058247|GENSCAN_predicted_peptide_4|172_aa MIPFLRAAAGVLPQILGPSRLLTVLGEKDLLVAFGGGRKRLWFCPSTYDPTSGSIMSQFQ VPLAVQPDLPGLYDFPQRQVMVGSFPGSGLSMAGSESQLRGGGDGRKKRKRCGTCEPCRR LENCGACTSCTNRRTHQICKLRKCEVLKKKVGLLKEVSRPLLGYPVPSSRAR >gi568815596f:73826911_74058247|GENSCAN_predicted_CDS_4|519_bp atgatcccttttctgagggctgctgctggtgtcctcccccagatcctgggccccagcaga ctcttgactgttctaggcgagaaggacctgttggtggcctttggaggtggcaggaaacga ctgtggttctgccccagcacctatgaccccacctctggcagcatcatgagccagtttcag gtgcccctggccgtccagccggacctgccaggcctttatgacttccctcagcgccaggtg atggtagggagcttcccggggtctgggctctccatggctgggagtgagtcccaactccga gggggtggagatggtcgaaagaaacggaaacggtgtggtacttgtgagccctgccggcgg ctggaaaactgtggcgcttgcactagctgtaccaaccgccgcacgcaccagatctgcaaa ctgcgaaaatgtgaggtgctgaagaaaaaagtagggcttctcaaggaggtaagccggccc ttgctgggctaccctgttccttcctcgagggcacggtga >gi568815596f:73826911_74058247|GENSCAN_predicted_peptide_5|216_aa MPGGGVEKSNYLSTKGVKVPGLGGQRKDFAMLTYLPVNTAGLYFKALFKPCFLQEAFSDY TSLLPTNGASNISHLTPGDANAAGPLSSKVEVRLWNIQVSQDSATGPAPPARCEALHSAV RETQSTGCQPNASHRDPSPQKQAAPSPAGRGFESHQVQLFHFPQGKDRPREGARVVWALG QMNGRARGSSKTAQASVSVQRQFYCLERPQRLHRSS >gi568815596f:73826911_74058247|GENSCAN_predicted_CDS_5|651_bp atgcctggaggcggggttgagaaatccaattacttaagcaccaagggcgtaaaagttcca ggcctcggagggcaacggaaggacttcgcgatgttaacttacttgcctgtcaacacagct gggctttacttcaaagctctgtttaagccctgcttcctccaggaagccttctctgattat accagtcttctgccaaccaacggcgcttctaacatttctcatctgacccctggtgatgcc aatgctgctggtccactgagcagcaaggtggaagtgaggctgtggaacatacaggtgtct caggactcagccacgggaccagccccaccagctcgctgtgaggctctgcactcagcggtc agggaaacccagagcacgggctgccagcccaacgccagccacagggacccgagtcctcag aagcaagcagctcccagcccagctggaagaggctttgaaagccatcaagtacaactcttc cacttcccccaaggaaaagacaggcccagagagggggcccgagttgtctgggcccttgga caaatgaatggcagagccagggggagctcgaagacagcgcaggcctccgtctctgtccag cgccagttctactgcctggagcgacctcagcggctacaccgctcctcctaa >gi568815596f:73826911_74058247|GENSCAN_predicted_peptide_6|751_aa MESSEGRRVENSSVEGTEEEKENGGGPVTGRVTETQGGQTGSELSPVDGPVPGQMDSGPV YHGDSRQLSASGVPVNGAREPAGPSLLGTGGPWRVDQKPDWEAAPGPAHTARLEDAHDLV AFSAVAEAVSSYGALSTRLYETFNREMSREAGNNSRGPRPGPEGCSAGSEDLDTLQTALA LARHGMKPPNCNCDGPECPDYLEWLEGKIKSVVMEGGEERPRLPGPLPPGEAGLPAPSTR PLLSSEVPQISPQEGLPLSQSALSIAKEKNISLQTAIAIEALTQLSSALPQPSHSTPQAS CPLPEALSPPAPFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDT PPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQSVFKRPEALPTKPKVKVEAPSSSPAP APSPVLQREAPTPSSEPDTHQKAQTALQQHLHHKRSLFLEQVHDTSFPAPSEPSAPGWWP PPSSPVPRLPDRPPKEKKKKLPTPAGGPVGTEKAAPGIKPSVRKPIQIKKSRPREAQPLF PPVRQIVLEGLRSPASQEVQAHPPAPLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQ EMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVPIQDPENQQTC LPAPESPFATRSPKQIKIESSGAVTVLSTTCFHSEEGGQEATPTKAENPLTPTLSGFLES PLKYLDTPTKSLLDTPAKRAQAEFPTCDCVX >gi568815596f:73826911_74058247|GENSCAN_predicted_CDS_6|2253_bp atggagagcagtgagggcaggagagtggagaacagctctgttgaggggacggaggaggaa aaggagaatggaggaggaccagtaaccggcagggtcacggaaacccagggtgggcagaca ggctcagagctcagcccagttgatggacctgttccaggtcagatggactcagggccagtg taccatggggactcacggcagctaagcgcctcaggggtgccggtcaatggtgctagagag cccgctggacccagtctgctggggactgggggtccttggcgggtagaccaaaagcccgac tgggaggctgccccaggcccagctcatactgctcgcctggaagatgcccacgatctggtg gccttttcggctgtggccgaagctgtgtcctcttatggggcccttagcacccggctctat gaaaccttcaaccgtgagatgagtcgtgaggctgggaacaacagcaggggaccccggcca gggcctgagggctgctctgctggcagcgaagaccttgacacactgcagacggccctggcc ctcgcgcggcatggtatgaaaccacccaactgcaactgcgatggcccagaatgccctgac tacctcgagtggctggaggggaagatcaagtctgtggtcatggaaggaggggaggagcgg cccaggctcccagggcctctgcctcctggtgaggccggcctcccagcaccaagcaccagg ccactcctcagctcagaggtgccccagatctctccccaagagggcctgcccctgtcccag agtgccctgagcattgccaaggaaaaaaacatcagcttgcagaccgccattgccattgag gccctcacacagctctcctctgccctcccgcagccttctcattccaccccccaggcttct tgcccccttcctgaggccttgtcacctcctgcccctttcagatctccccagtcttacctc cgggctccctcatggcctgtggttcctcctgaagagcactcatcttttgctcctgatagc tctgccttccctccagcaactcctagaactgagttccctgaagcctggggcactgacacc cctccagcaacgccccggagctcctggcccatgcctcgcccaagccccgatcccatggct gaactggagcagttgttgggcagcgccagtgattacatccagtcagtattcaagcggcct gaggccctgcctaccaagcccaaggtcaaggtggaggcaccctcttcctccccggccccg gccccatcccctgtacttcagagggaggctcccacgccatcctcggagcccgacacccac cagaaggcccagaccgccctgcagcagcacctccaccacaagcgcagcctcttcctagaa caggtgcacgacacctccttccctgctccttcagagccttctgctcctggctggtggccc ccaccaagttcacctgtcccacggcttccagacagaccacccaaggagaagaagaagaag ctcccaacaccagctggaggtcccgtgggaacggagaaagctgcccctgggatcaagccc agtgtccgaaagcccattcagatcaagaagtccaggccccgggaagcacagcccctcttc ccacctgtccgacagattgtcctggaagggcttaggtccccagcctcccaggaagtgcag gctcatccaccggcccctctgcctgcctcacagggctctgctgtgcccctgcccccagaa ccttctcttgcgctatttgcacctagtccctccagggacagcctgctgccccctactcag gaaatgaggtcccccagccccatgacagccttgcagccaggctccactggccctcttccc cctgccgatgacaagctggaagagctcatccggcagtttgaggctgaatttggagatagc tttgggcttcccggccccccttctgtgcccattcaggaccccgagaaccagcaaacatgt ctcccagcccctgagagcccctttgctacccgttcccccaagcaaatcaagattgagtct tcgggggctgtgactgtgctctcaaccacctgcttccattcagaggagggaggacaggag gccacacccaccaaggctgagaacccactcacacccaccctcagtggcttcttggagtca cctcttaagtacctggacacacccaccaagagtctgctggacacacctgccaagagagcc caggccgagttccccacctgcgattgcgtcgnn