GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:24:00 Sequence gi568815581f:51054374_51261842 : 207469 bp : 42.62% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 2559 2554 6 1.05 1.06 Term - 13639 13623 17 2 2 120 45 26 0.352 -1.08 1.05 Intr - 25331 25211 121 2 1 90 89 184 0.790 17.85 1.04 Intr - 26778 26675 104 1 2 31 64 130 0.480 3.87 1.03 Intr - 27710 27645 66 0 0 44 100 68 0.532 1.56 1.02 Intr - 47078 47003 76 2 1 74 86 27 0.251 -0.73 1.01 Init - 66283 65981 303 1 0 123 78 671 0.907 66.92 1.00 Prom - 73213 73174 40 -8.15 2.00 Prom + 76616 76655 40 -9.95 2.01 Init + 77960 78011 52 1 1 76 100 51 0.983 6.47 2.02 Intr + 79033 79105 73 2 1 90 73 73 0.064 3.55 2.03 Intr + 99131 99289 159 2 0 18 90 162 0.122 7.58 2.04 Intr + 101278 101407 130 1 1 95 108 150 0.902 17.48 2.05 Intr + 105607 105708 102 0 0 66 101 139 0.988 12.45 2.06 Intr + 106787 106899 113 1 2 55 94 76 0.997 3.16 2.07 Term + 107355 107472 118 0 1 101 48 123 0.999 6.63 2.08 PlyA + 108338 108343 6 1.05 3.06 PlyA - 109129 109124 6 -0.45 3.05 Term - 111182 111065 118 1 1 74 50 141 0.973 5.93 3.04 Intr - 111441 111268 174 2 0 123 69 97 0.948 9.43 3.03 Intr - 112309 112072 238 2 1 75 58 197 0.993 11.15 3.02 Intr - 112920 112478 443 2 2 38 91 239 0.151 11.37 3.01 Init - 115325 115186 140 2 2 40 86 131 0.155 7.76 3.00 Prom - 117340 117301 40 -6.85 4.14 PlyA - 117771 117766 6 1.05 4.13 Term - 126321 126203 119 1 2 96 51 146 0.999 9.42 4.12 Intr - 131786 131652 135 0 0 53 91 68 0.649 3.22 4.11 Intr - 138643 138484 160 1 1 92 74 182 0.806 15.84 4.10 Intr - 147323 147219 105 2 0 50 116 58 0.934 4.29 4.09 Intr - 147704 147649 56 0 2 73 116 27 0.911 1.78 4.08 Intr - 148562 148328 235 2 1 53 107 71 0.875 1.84 4.07 Intr - 148855 148767 89 2 2 93 93 57 0.982 5.47 4.06 Intr - 149552 149418 135 0 0 66 100 96 0.663 8.22 4.05 Intr - 155150 155004 147 0 0 83 -6 132 0.248 2.59 4.04 Intr - 164671 164557 115 1 1 83 90 115 0.644 10.30 4.03 Intr - 166090 165957 134 2 2 58 97 10 0.932 -1.66 4.02 Intr - 170836 170635 202 1 1 70 92 190 0.905 15.54 4.01 Init - 188382 188335 48 0 0 67 65 84 0.494 5.00 4.00 Prom - 188439 188400 40 -7.15 5.00 Prom + 198761 198800 40 -5.65 5.01 Init + 202460 202669 210 1 0 62 98 49 0.644 2.13 5.02 Intr + 205783 206553 771 0 0 76 72 769 0.985 64.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 117114 117231 118 0 1 86 48 136 0.934 6.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:51054374_51261842|GENSCAN_predicted_peptide_1|228_aa MELEDGVVYQEEPGGSGAVMSERVSGLAGSIYREFERLIGRYDEEVVKELMPLVVAVLEN LDSVFAQDQEHQVELELLRDDNEQLITQYEREKALRKHAEEASDDGKTNTFLKVKQFFQS VASNQKVLNRLAKEAFTKKVRFEQRCEGIIQALDDCGTHQDGGGNDEKRSDLKVDLLMDW ICEKFIEFEDSQEQEKKDLQTRVESLESQTRQLELKAKNYADQSLSAV >gi568815581f:51054374_51261842|GENSCAN_predicted_CDS_1|687_bp atggagctggaggacggtgtggtgtatcaggaggagcccggcggctccggggccgtgatg tcggagcgggtgtccggcctggccggctccatctaccgcgagttcgagcggcttatcggg cgctatgacgaggaggtggtcaaagagctgatgccgctggtggtggctgtgctggagaac ctggactcggtgttcgcgcaggaccaggagcaccaggtggagctggagctgctgcgggac gacaacgagcagctcatcacccagtacgagcgggagaaggcgctgcgcaagcacgctgag gaggcatcagatgatggcaaaacaaatacctttttaaaggttaagcagttttttcaaagt gtagcatcaaaccagaaagttttaaataggttggccaaagaagccttcaccaagaaggtg agatttgaacaaagatgtgaaggaataatccaggcattagatgattgtggcacacaccag gatggtggtggaaatgatgaaaagcggtcagatctgaaggtagatttgctgatggattgg atatgtgagaaattcattgaatttgaagactctcaagaacaggaaaaaaaggacttacag acccgagtggaatctttagaatctcaaacaagacaacttgagctgaaagcgaaaaactat gctgaccagagcctatcagcagtctga >gi568815581f:51054374_51261842|GENSCAN_predicted_peptide_2|248_aa MRALALGRGEWKKEVQEVLEPPNGRMTPLDLKFQYEETRSQQGSVGERCSAKWALRQRIC AEAFRACKCCEPRGSRARFGCWRLQPEFKPKQLEGTMANCERTFIAIKPDGVQRGLVGEI IKRFEQKGFRLVGLKFMQASEDLLKEHYVDLKDRPFFAGLVKYMHSGPVVAMVWEGLNVV KTGRVMLGETNPADSKPGTIRGDFCIQVGRNIIHGSDSVESAEKEIGLWFHPEELVDYTS CAQNWIYE >gi568815581f:51054374_51261842|GENSCAN_predicted_CDS_2|747_bp atgagggcactggccctggggagaggagagtggaaaaaggaggttcaagaagtattggaa cctcccaatgggagaatgactcctttggacttgaaatttcagtatgaggagacccgtagc cagcaaggaagcgtgggcgagcggtgttctgcaaaatgggctctccggcagcggatctgc gcagaagcgttccgtgcgtgcaagtgctgcgaaccacgtgggtcccgggcgcgtttcggg tgctggcggctgcagccggagttcaaacctaagcagctggaaggaaccatggccaactgt gagcgtaccttcattgcgatcaaaccagatggggtccagcggggtcttgtgggagagatt atcaagcgttttgagcagaaaggattccgccttgttggtctgaaattcatgcaagcttcc gaagatcttctcaaggaacactacgttgacctgaaggaccgtccattctttgccggcctg gtgaaatacatgcactcagggccggtagttgccatggtctgggaggggctgaatgtggtg aagacgggccgagtcatgctcggggagaccaaccctgcagactccaagcctgggaccatc cgtggagacttctgcatacaagttggcaggaacattatacatggcagtgattctgtggag agtgcagagaaggagatcggcttgtggtttcaccctgaggaactggtagattacacgagc tgtgctcagaactggatctatgaatga >gi568815581f:51054374_51261842|GENSCAN_predicted_peptide_3|370_aa MLQNPYAAINTEELVKKGESGQSGIGSVGRASQKTARRGKRKTNMLGWTRVGRRRGSGAR GFSAPCCPPAAGGQVAARGKEPKPGGRVTLPALHLPKGVCPHHPGKGSAGPSRRGRGQRA KGFRGSEMNSESRRGQTAAGREKHPRAGCSGEGETGASYPEELHGHEAESLLLEALDDLA HQAALHAVRLDGDEGPASLCGARRPAGRKLQGNLPLRRRAGPAGNRGAPTDRAHLRQRKR KREREESPGDAPPPPYREAGGAGRRRCAGRAESWMAFRLLSGMLFSQDALLPGCRLPTYI SSPNQPPHSHPDMGIPAHLALKCGRVFQFTLSVPGKQAPFGDAQLPWGGHKPLLSAPAAL TDSRTAVVLP >gi568815581f:51054374_51261842|GENSCAN_predicted_CDS_3|1113_bp atgttgcagaatccttatgcagccatcaacacagaagaactggtgaagaaaggagaaagt ggccagtctggaataggatcagtgggaagggcatcccagaagactgctagaagagggaaa agaaagactaacatgctagggtggacccgagtgggcaggcgcagggggtctggggcccgg ggtttttccgctccctgctgcccccctgcggcgggtgggcaggtagcggcccgcgggaag gaacccaaacccggagggcgagtcacgctaccagcgcttcaccttcccaagggcgtctgt ccccaccaccccgggaaagggtcggcgggcccaagccgccgtgggcgggggcagagggca aagggatttcgcggcagcgaaatgaactcggaaagcagacgcggacaaaccgccgccggg agggaaaaacacccgcgagccggctgcagcggggaaggggagacgggggcgagttacccg gaggaacttcatggccacgaggcggaatcccttctgctcgaagcgcttgatgatctcgcc caccaggccgcgctgcacgccgtccggcttgatggcgatgaagggccggcgtccctctgc ggagcccgaaggcccgcggggcgcaagctgcaagggaatctcccgctccgacgccgcgcg ggacctgcgggaaatcggggggcgcccacagaccgggcccacctgcggcagcgcaagcgg aagagggaacgggaggagtcgccaggggatgcgcccccaccgccttaccgggaagctgga ggggccgggcggcggcgctgcgctgggagagcagagagctggatggcgttcagactcctt agcgggatgctcttctcgcaggatgccctgctccctggctgccgcctccccacctacatt tccagccccaatcagccgccccactcccaccctgacatggggatccctgcccacctggca ctaaagtgcggtcgagttttccagttcaccctgagtgtccctggcaaacaagctcccttt ggtgatgcccagcttccctggggcggtcacaagcccctcctctcggcgcctgcagcactg actgacagccgcacagcagtggtactgccctga >gi568815581f:51054374_51261842|GENSCAN_predicted_peptide_4|559_aa MNRKVIKLVTRLTGWVTEHSSRSERKRRDSFGMFDGYDSCSEDTSSSSSSEESEEEVAPL PSNLPIIKNNGQVYTYPDGKSGMATCEMCGMVGVRDAFYSKTKRFCSVSCSRSYSSNSKK ASILARLQGKPPTKKAKVLQKQPLVAKLAAYAQYQATLQNQAKTKAVMWRLSYICVGDPS AFGFGHVLQDLMTAYLAGKESRSQIALTSGPGAGSGYNALLRYEGFENDSGLDFWCNICG SDIHPVGWCAASGKPLVPPRTIQHKYTNWKAFLVKRLTGAKTLPPDFSQKVSESMQYPFK PCMRVEVVDKRHLCRTRVAVVESVIGGRLRLVYEESEDRTDDFWCHMHSPLIHHIGWSRS IGHRFKRSDITKKQDGHFDTPPHLFAKVKEVDQSGEWFKEGMKLEAIDPLNLSTICVATI RKDVPNHGFRVGMKLEAVDLMEPRLICVATVTRIIHRLLRIHFDGWEEEYDQWVDCGVRR SFSGDEELTPPPYRTLPAQTAPEAFPPPSTSREFSPSLKTVTTLQLKEELLDGEDYNFLQ GASDQESNGSANFYIKQEP >gi568815581f:51054374_51261842|GENSCAN_predicted_CDS_4|1680_bp atgaaccggaaagtgataaagttggttacccgcttaacagggtgggtgactgaacattct tcacgatctgaaaggaaaagaagagattctttcgggatgtttgacggttatgatagctgc agtgaggacacaagcagcagctccagctccgaagagagtgaggaagaagtcgctccttta ccttctaatctcccgattatcaaaaacaatgggcaagtctacacatacccagatggtaaa tctggcatggctacctgtgagatgtgtgggatggttggcgtccgagatgctttttactct aaaacaaagcgtttctgtagcgtttcatgttcaagaagttactcgtcaaactccaagaag gcaagcattttggccagacttcagggtaagcctccaacaaagaaagcaaaagttcttcag aaacaacctttagttgctaagctagccgcatatgctcagtatcaagctaccttgcaaaat caagcaaagacaaaagcagtcatgtggaggttgtcatacatttgtgttggggacccctca gcatttggatttggccatgtcctgcaggacctgatgacagcttaccttgcagggaaggag agtcgctcccagatagccctcacgagtggccctggagcaggaagtggttacaatgccctt ttaagatatgaaggatttgaaaatgactctggtctggacttctggtgcaatatatgtggt tctgatatccatccagttggttggtgtgcagccagcggaaaacctcttgttcctcctaga actattcagcataaatatacaaactggaaagcttttctagtgaaacgacttactggtgcc aaaacactgcctcctgatttctcccaaaaggtttcagagagtatgcagtatcctttcaaa ccttgcatgagagtagaagtggttgacaagaggcatttgtgtcgaacacgagtagcagtg gtggaaagtgtaattggaggaagattaagactagtgtatgaagaaagcgaagatagaaca gatgacttctggtgccatatgcacagcccattaatacatcatattggttggtctcgaagc ataggtcatcgattcaaaagatctgatattacaaagaaacaggatggacattttgataca ccaccacatttatttgctaaggtaaaagaagtagaccagagtggggaatggttcaaggaa ggaatgaaattggaagctatagacccattaaatctttctacaatatgtgtcgcaaccatt agaaaggatgttccaaatcacggatttcgtgtaggaatgaaattagaagcagtagatctc atggagccacgtttaatatgtgtagccacagtaactcgaattattcatcgtctcttgagg atacattttgatggatgggaagaagagtatgatcagtgggtagactgtggggtgcggagg agcttctctggtgacgaagagttgactcctcctccatatcgaacccttccagcacagaca gccccggaggccttcccacctcccagcactagccgagagttcagtcccagcctcaaaaca gtgacaacactgcagctgaaggaggagttgctggatggagaggattataatttccttcaa ggagcgtctgatcaggaaagcaatggctctgccaacttctacatcaaacaagagccatga >gi568815581f:51054374_51261842|GENSCAN_predicted_peptide_5|327_aa MRKWRHRILSNKGHGATQWQNCDINLTASTPLIVSLHPISYSGFVQMGIISFSPFIVGSS KARREILNLQAPYSKDRTKTKGGKRIGACVKSSGYSQEEKQKFHSAVPVYSRGFLRSFQY VAAAKLCWKNFPLQSRRFPLHPEDWPSRRNSHTGRGWVGERELRRGRECWSRGKRLRRLL PAPGASAAAAREGPGRMRSEVPRERLRFSSNLTMPPERRRRMKLDRRTGAKPKRKPGMRP DWKAGAGPGGPPQKPAPSSQRKPPARPSAAAAAIAVAAAEEERRLRQRNRLRLEEDKPAV ERCLEELVFGDVENDEDALLRRLRGPR >gi568815581f:51054374_51261842|GENSCAN_predicted_CDS_5|981_bp atgagaaaatggaggcacaggatcttgtccaacaagggccatggagctactcaatggcag aattgtgacatcaacctcactgccagcacccctcttattgtatcacttcatcctatctca tattcagggtttgttcaaatgggaattatttccttttcaccttttattgtgggttccagc aaagcaagaagagaaatcctaaatttgcaggctccgtactccaaggaccgtaccaagacg aagggtggaaagcgtattggggcttgtgtcaagtcaagtggttacagccaagaggaaaag cagaagttccattcagccgtcccagtgtatagtcggggtttcttgcgctcctttcaatat gttgcagctgcgaaattgtgttggaaaaattttccactccagtcacgccgcttcccccta cacccggaagattggccctcccggaggaactcccacacggggcggggctgggtaggggag cgggagttacgtagagggagggagtgctggtcacgtggcaagaggctgcgcaggctcctt ccggcccccggggcttcggcggcggcggcccgcgaggggcctgggcgcatgcgcagcgag gttccacgtgagcgcctgcgtttctcctcaaacctaacgatgccgccggagcggaggaga cgaatgaaactggaccggagaaccggagcgaagccgaagcggaagcccggaatgaggccg gactggaaagccggagcggggccaggcgggcctccccaaaagcctgccccttcatcccag cggaaaccgccggcccggccgagcgcggcggccgctgcgattgcagtcgcggcggcggag gaagagagacggctccggcagcggaaccgcctgaggctggaggaggacaaaccggccgtg gagcggtgcttggaggagctggtcttcggcgacgtcgagaacgacgaggacgcgttgctg cggcgtctgcgaggcccgagg