GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:01:24 Sequence gi568815592r:2733611_2940586 : 206976 bp : 43.92% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 4594 4633 40 -2.46 1.01 Init + 32013 32834 822 2 0 120 113 1236 0.725 124.02 1.02 Intr + 35081 35272 192 1 0 100 76 92 0.997 8.89 1.03 Intr + 36585 36751 167 2 2 85 89 116 0.542 10.16 1.04 Intr + 45653 45882 230 2 2 100 111 126 0.887 13.61 1.05 Intr + 49796 49951 156 0 0 60 95 217 0.980 19.58 1.06 Intr + 50714 50793 80 0 2 116 88 -16 0.995 0.47 1.07 Intr + 51397 51642 246 0 0 77 28 392 0.747 29.76 1.08 Intr + 52400 52563 164 1 2 106 66 27 0.274 1.07 1.09 Term + 55568 55778 211 2 1 42 39 112 0.272 -1.53 1.10 PlyA + 56288 56293 6 1.05 2.06 PlyA - 56367 56362 6 1.05 2.05 Term - 57876 57768 109 2 1 94 48 57 0.128 0.28 2.04 Intr - 64362 64227 136 1 1 99 72 250 0.969 24.13 2.03 Intr - 66443 66325 119 1 2 -2 43 92 0.016 -4.19 2.02 Intr - 68722 68621 102 0 0 89 105 38 0.134 4.89 2.01 Init - 71399 71260 140 2 2 89 3 105 0.134 1.71 2.00 Prom - 75014 74975 40 -3.56 3.00 Prom + 75953 75992 40 -6.36 3.01 Init + 77537 77603 67 1 1 67 98 89 0.466 9.03 3.02 Term + 85739 85833 95 0 2 108 53 59 0.330 2.39 3.03 PlyA + 86341 86346 6 1.05 4.09 PlyA - 86866 86861 6 1.05 4.08 Term - 89714 89638 77 0 2 49 37 81 0.127 -2.90 4.07 Intr - 100402 100036 367 1 1 74 69 149 0.787 6.32 4.06 Intr - 102413 102246 168 2 0 36 86 347 0.991 29.44 4.05 Intr - 102640 102498 143 2 2 74 100 100 0.999 9.87 4.04 Intr - 104389 104272 118 1 1 75 100 67 0.880 6.64 4.03 Intr - 106984 106809 176 2 2 79 82 195 0.524 17.66 4.02 Intr - 108373 108202 172 1 1 62 90 25 0.547 -0.38 4.01 Init - 114865 114776 90 1 0 78 49 78 0.315 3.39 4.00 Prom - 121232 121193 40 -6.66 5.16 PlyA - 122824 122819 6 -0.45 5.15 Term - 126999 126512 488 1 2 53 55 259 0.147 13.96 5.14 Intr - 129245 129201 45 0 0 65 98 51 0.122 2.18 5.13 Intr - 132701 132665 37 2 1 119 49 38 0.020 0.84 5.12 Intr - 142815 142683 133 2 1 78 72 104 0.004 8.45 5.11 Intr - 156960 156641 320 0 2 81 23 218 0.027 9.36 5.10 Intr - 158378 158223 156 2 0 67 107 305 0.997 30.51 5.09 Intr - 159943 159801 143 2 2 45 75 100 0.985 4.47 5.08 Intr - 161898 161781 118 0 1 78 77 40 0.984 1.94 5.07 Intr - 162580 162443 138 1 0 62 89 110 0.796 9.16 5.06 Intr - 167011 166834 178 0 1 117 115 47 0.988 10.22 5.05 Intr - 170045 169940 106 0 1 93 47 62 0.373 1.87 5.04 Intr - 172615 172585 31 1 1 114 80 2 0.021 -0.30 5.03 Intr - 185834 185690 145 1 1 85 96 3 0.106 1.08 5.02 Intr - 187585 187475 111 0 0 45 80 66 0.006 0.99 5.01 Intr - 206877 206692 186 2 0 74 14 181 0.114 8.10 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 147579 147899 321 2 0 40 45 173 0.845 4.19 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:2733611_2940586|GENSCAN_predicted_peptide_1|755_aa MEVSGPEDDPFLSQLHQVQCPVCQQMMPAAHINSHLDRCLLLHPAGHAEPAAGSHRAGER AKGPSPPGAKRRRLSESSALKQPATPTAAESSEGEGEEGDDGGETESRESYDAPPTPSGA RLIPDFPVARSSSPGRKGSGKRPAAAAAAGSASPRSWDEAEAQEEEEAVGDGDGDGDADA DGEDDPGHWDADAAEAATAFGASGGGRPHPRALAAEEIRQMLQGKPLADTMRPDTLQDYF GQSKAVGQDTLLRSLLETNEIPSLILWGPPGCGKTTLAHIIASNSKKHSIRFVTLSATNA KTNDVRDVIKQAQNEKSFFKRKTILFIDEIHRFNKSQQVNAALLSRCRVIVLEKLPVEAM VTILMRAINSLGIHVLDSSRPTDPLSHSSNSSSEPAMFIEDKAVDTLAYLSDGDARAGLN GLQLAVLARLSSRKMFCKKSGQSYSPSRVLITENDVKEGLQRSHILYDRAGEEHYNCISA LHKSMRGSDQNASLYWLARMLEGGEDPLYVARRLVRFASEDIGLADPSALTQAVAAYQGC HFIGMPECEVLLAQCVVYFARAPKSIEVYSAYNNVKACLRNHQGPLPPVPLHLRNAPTRL MKDLGYGKGYKYNPMYSEPVDQEYLPEELRGLHISKQSFFFYFMCVLNTKYATPSAQLRV ICPYCYFLSLSLVACRAYAHKDGPFRHAALNTVCNADTGPVLEHAVLGRACSTLSAVSSS GALSSQLSVTVKSSLKVCKCLFFGFKESYFVKADF >gi568815592r:2733611_2940586|GENSCAN_predicted_CDS_1|2268_bp atggaggtgagcgggccggaagacgaccccttcctttcgcagctgcaccaggtgcagtgc cccgtgtgccagcagatgatgcccgccgcgcacatcaactcgcacctggaccgctgtctg ctgctccacccggcggggcacgcggagcccgcggccgggtcgcaccgcgccggggagcgg gccaaggggccctcgccgcccggcgccaagaggcggcggctgtcggagagctcggcgctg aagcagccagccaccccgacggcagccgagagcagcgagggcgagggtgaggagggcgac gacggcggcgagaccgagagccgcgagagctacgacgcgccgcccacacccagcggcgcc cgccttatccccgacttcccggtggcccgctccagcagccccgggaggaaggggtcgggg aagaggccggcggccgccgccgcggcggggagcgcgtctccgcgcagctgggacgaggcg gaggcgcaggaggaggaggaggccgtgggcgacggcgatggcgacggggacgcggacgcg gacggcgaggacgacccggggcactgggacgcggacgctgccgaagccgccaccgccttc ggggccagtggcgggggccgcccgcacccccgggcgctggctgccgaggagatccgacag atgctacagggcaagccgctggccgacacgatgcgtcctgacacgctgcaggattacttc gggcagagcaaggccgtgggccaggataccctgctgcgctcgctcctggagaccaacgaa atcccctcgcttatcctgtgggggccgccgggctgcggcaagaccactctggctcacatc atagccagcaacagcaagaaacatagcataaggtttgtgacattatctgcaacaaatgcc aagacaaatgatgtgcgagatgtcataaaacaagctcaaaatgaaaagagctttttcaaa aggaaaaccatcctttttattgatgagattcatcggttcaataaatctcagcaggtcaac gctgctcttctgagccgctgtcgagtgattgttcttgagaagcttccagtagaggcaatg gtgactattttaatgcgagcgatcaactccctgggaatccacgtcctagactctagccgt cccactgaccctctgagccacagcagcaacagcagctcagagcccgccatgttcatagag gataaagcagtagacaccctggcttacctcagtgacggtgacgcccgagctgggttgaac ggactgcagctggcggtgctggctaggttaagctctaggaagatgttctgtaagaagagt gggcaatcctattctcccagtagagttctgatcacagagaatgacgtgaaggagggccta cagcgatcccacattttatatgaccgggcaggtgaggagcattacaactgcatctccgcc ctgcacaagtccatgcggggctcagaccagaacgcctccctctactggctggctcgcatg ctcgagggaggagaggacccactctacgtggcacggaggcttgtcaggtttgccagcgag gacataggtctggcagacccgtctgcgttaacacaagcggttgctgcctaccaaggctgt cattttataggcatgcctgaatgtgaggtgcttctggcccagtgtgtggtctactttgcc agagccccaaagtccattgaggtgtacagcgcctacaacaacgtcaaagcctgcctgagg aaccaccaggggccactgccccccgtgcccctgcacctgaggaacgcgcccactaggctg atgaaggatttgggctatggcaaaggctacaagtacaaccccatgtacagcgagcctgtg gatcaggagtacctgcctgaagagttgagggggctccatatttctaaacagtcgtttttc ttttactttatgtgtgtcctgaacacaaaatacgccactccttctgctcagttaagagtt atttgtccctactgctacttcctctccctctccttagttgcatgtcgtgcatatgcccac aaggatggccccttcagacatgcggccctgaacacagtgtgtaacgcggacacggggcct gtgctggagcacgcggtgctggggagagcctgcagcacactttctgctgtatccagcagc ggggcactgagcagccaactttctgtaactgtaaaatcaagcttgaaagtttgcaaatgc ttgttttttggtttcaaagaatcttattttgtgaaagcagatttttaa >gi568815592r:2733611_2940586|GENSCAN_predicted_peptide_2|201_aa MVEDLCIIGIDTWAAVAQNVTPPRRHALTATAAQIHSCLPPKLPKPQRCPHAIKKIRIGL VAAICAGETELRNSVSMRLGWARTQTVTQLHLLMYVLTTQEKGPPEVNAPERVFDTPDFA ACPGQHGGRGGVGAGEEEEKEEEEEEEEEEEEEEEEEEEEEEICLGLTTNTETQSWPEPG DLELVEGLVRDSSDTGSLFPG >gi568815592r:2733611_2940586|GENSCAN_predicted_CDS_2|606_bp atggttgaggacctctgcattataggcatcgacacttgggctgctgtggcacagaacgtc acaccacctagacggcatgccctcacagcaacagcggcgcagatccactcctgcctacca cctaagctccccaagccccagaggtgtccccatgcaataaagaaaataagaattggatta gtagcagccatctgtgcaggggagactgagctccgcaactctgtgtccatgaggcttggc tgggctcgaacgcagaccgtcacccagctccatttacttatgtatgttctaacaacacag gaaaaaggacctccagaagtgaatgccccagaacgagtgtttgacactcctgactttgca gcctgcccagggcagcatggaggaaggggaggagtaggagcaggggaggaggaggagaag gaggaggaggaggaagaggaggaggaggaggaggaagaggaggaggaggaggaggaggag gaggagatctgccttgggttaacgacaaatactgaaactcagtcttggcctgaacctggt gacctggagctggtggaaggcctggtgagagacagctcagacactggctcactctttcct ggctga >gi568815592r:2733611_2940586|GENSCAN_predicted_peptide_3|53_aa MSEDASPPAKDPGAKSTEGILEGLLASKDSTILIPGPQPRAQFPVFRNINTWI >gi568815592r:2733611_2940586|GENSCAN_predicted_CDS_3|162_bp atgagcgaggatgcctctccaccagcaaaagatccaggtgcaaagagcactgaaggcatc ttggagggactccttgccagcaaggactccactatacttattcctggtccccagccacga gcacagtttccagtattcagaaacattaatacgtggatctga >gi568815592r:2733611_2940586|GENSCAN_predicted_peptide_4|436_aa MAPTPLLDVAAYASTSAKTFLAILVNIDFVATSGARTPTQSPGRAAAPPAAAGPGDASAC YKSSGPRCLLPDLAPSSEPGACLGGLSVFTMEQLSSANTRFALDLFLALSENNPAGNIFI SPFSISSAMAMVFLGTRGNTAAQLSKEFLVSTQKTYGADLASVDFQHASEDARKTINQWV KGQTEGKIPELLASGMVDNMTKLVLVNAIYFKGNWKDKFMKEATTNAPFRLNKKDRKTVK MMYQKKKFAYGYIEDLKCRVLELPYQGEELSMVILLPDDIEDESTGLKKIEEQLTLEKLH EWTKPENLDFIEVNVSLPRFKLEESYTLNSDLARLGVQDLFNSSKADLSGMSGARDIFIS KIVHKSFVEVNEEGTEAAAATAGIATFCMLMPEENFTADHPFLFFIRHNSSVLEATKSKI KVQEDLMSGKGLLTGS >gi568815592r:2733611_2940586|GENSCAN_predicted_CDS_4|1311_bp atggcccctacccccttactggatgtggctgcttatgcttctaccagtgctaaaaccttc ctcgccatattggtgaatattgactttgtggcgacctcgggagctcggactcctacgcag tcaccgggaagggccgccgccccgcccgcggctgctggcccgggtgacgcttccgcctgc tataagagcagcggccctcggtgcctccttcctgacctcgcacccagctcggagcccgga gcgtgcctcggcggcctgtcggttttcaccatggagcagctgagctcagcaaacacccgc ttcgccttggacctgttcctggcgttgagtgagaacaatccggctggaaacatcttcatc tctcccttcagcatttcatctgctatggccatggtttttctggggaccagaggtaacacg gcagcacagctgtccaaggagttcttggtttcgactcagaaaacatatggtgctgacctg gccagtgtggattttcagcatgcctctgaagatgcaaggaagaccataaaccagtgggtc aaaggacagacagaaggaaaaattccggaactgttggcttcgggcatggttgataacatg accaaacttgtgctagtaaatgccatctatttcaagggaaactggaaggataaattcatg aaagaagccacgacgaatgcaccattcagattgaataagaaagacagaaaaactgtgaaa atgatgtatcagaagaaaaaatttgcatatggctacatcgaggaccttaagtgccgtgtg ctggaactgccttaccaaggcgaggagctcagcatggtcatcctgctgccggatgacatt gaggacgagtccacgggcctgaagaagattgaggaacagttgactttggaaaagttgcat gagtggactaaacctgagaatctcgatttcattgaagttaatgtcagcttgcccaggttc aaactggaagagagttacactctcaactccgacctcgcccgcctaggtgtgcaggatctc tttaacagtagcaaggctgatctgtctggcatgtcaggagccagagatatttttatatca aaaattgtccacaagtcatttgtggaagtgaatgaagagggaacagaggcggcagctgcc acagcaggcatcgcaactttctgcatgttgatgcccgaagaaaatttcactgccgaccat ccattccttttctttattcggcataattcctcagtgctggaggccacaaagtccaagatc aaggtacaggaagatttgatgtctggcaagggcctgcttactggttcctag >gi568815592r:2733611_2940586|GENSCAN_predicted_peptide_5|778_aa XNKVTIFAVSATAHKGNADAKCKRTNLITAQKTPTGYHYKLRQPAFILLSGPTHILLIGR TKCKQQDLDSTPGVSVAVVMVMLVAGTSQVLESWKCGHHGTSWHRDADGIQGLSLEVPLM PTPYCSWKGVLRQTLREGSWVLHKKELRDFKEVCQMPTDALFLRAEALDAAGSQPTSLGC QVLNPVLKRPWRRGPCIMETLSNASGTFAIRLLKILCQDNPSHNVFCSPVSISSALAMVL LGAKGNTATQMAQALSLNTEEDIHRAFQSLLTEVNKAGTQYLLRTANRLFGEKTCQFLST FKESCLQFYHAELKELSFIRAAEESRKHINTWVSKKTEGKIEELLPGSSIDAETRLVLVN AIYFKGKWNEPFDETYTREMPFKINQEEQRPVQMMYQEATFKLAHVGEVRAQLLELPYAR KELSLLVLLPDDGVELSTVEKSLTFEKLTAWTKPDCMKSTEVEVLLPKFKLQEDYDMESV LRHLGIVDAFQQGKADLSAMSAERDLCLSKFVHKSFVEVNEEGTEAAAASSCFVVAECCM ESGPRLEKNGPDPLDSSDERIQREAFESVLAETFFTKEPLRCWCPWRAWGITKSRLISIS SGDTFHIGYDWSIILEDSGAQLASPSGPHTGGAGGAACQSRTVRSHSSALGWSMGLGAVE QGVVLVGEARAVQEPMEWVGGSGMAGCRSRALPRGKAAKARREIERSAGGLALLRDPVHP PQPLARVLSPPLPGASRAGCSECAARQAHVHPELQLASKRCTQPRFPLVPLPPHLPAS >gi568815592r:2733611_2940586|GENSCAN_predicted_CDS_5|2337_bp nnaaataaagtgacaattttcgcagtgagcgctacagctcataaaggcaacgcagacgcc aaatgcaaacgaacaaaccttatcacagcacagaaaaccccaacgggttaccactacaag ctccggcagcctgcttttattcttttatctggccccacccacatcctgctgattggtaga accaagtgcaaacagcaggacctagactcaacccctggtgtttcagtagcagtagtgatg gtgatgctggtggcaggaacctcccaagtgctggagtcttggaagtgtgggcaccatgga acttcctggcacagggatgcagatggcatccaaggtctgtctctagaagtgccactcatg ccaacgccctactgttcttggaaaggggtcctgagacaaactctaagagaaggttcttgg gtcttgcacaagaaagaactcagggactttaaggaagtgtgtcaaatgcccacagacgcc ttgtttcttcgcgcggaggccctggacgccgcaggctcccaacctacttctctgggctgt caggttctgaacccggtcctgaagaggccctggcgccgggggccctgcatcatggaaact ctttctaatgcaagtggtacttttgccatacgccttttaaagatactgtgtcaagataac ccttcgcacaacgtgttctgttctcctgtgagcatctcctctgccctggccatggttctc ctaggggcaaagggaaacaccgcaacccagatggcccaggcactgtctttaaacacagag gaagacattcatcgggctttccagtcgcttctcactgaagtgaacaaggctggcacacag tacctgctgagaacggccaacaggctctttggagagaaaacttgtcagttcctctcaacg tttaaggaatcctgtcttcaattctaccatgctgagctgaaggagctttcctttatcaga gctgcagaagagtccaggaaacacatcaacacctgggtctcaaaaaagaccgaaggtaaa attgaagagttgttgccgggtagctcaattgatgcagaaaccaggctggttcttgtcaat gccatctacttcaaaggaaagtggaatgaaccgtttgacgaaacatacacaagggaaatg ccctttaaaataaaccaggaggagcaaaggccagtgcagatgatgtatcaggaggccacg tttaagctcgcccacgtgggcgaggtgcgcgcgcagctgctggagctgccctacgccagg aaggagctgagcctgctggtgctgctgcctgacgacggcgtggagctcagcacggtggaa aaaagtctcacttttgagaaactcacagcctggaccaagccagactgtatgaagagtact gaggttgaagttctccttccaaaatttaaactacaagaggattatgacatggaatctgtg cttcggcatttgggaattgttgatgccttccaacagggcaaggctgacttgtcggcaatg tcagcggagagagacctgtgtctgtccaagttcgtgcacaagagttttgtggaggtgaat gaagaaggcaccgaggcagcggcagcgtcgagctgctttgtagttgcagagtgctgcatg gaatctggccccagactggaaaagaatggcccagatcctctggactcctcagatgagcgg attcagagagaagcttttgagagcgtgctggcggagacatttttcacaaaagagcccttg cggtgctggtgtccgtggcgtgcctgggggatcaccaagagccgcctcatcagcataagc tcaggggacacctttcacattgggtatgactggagcatcatccttgaagactcaggagcc cagttggcttcacccagtggaccccacactgggggtgcaggtggagctgcctgccagtcc cgcaccgtgcgctcgcattcctcagcccttgggtggtcgatgggactgggcgctgtggag cagggggtggtgcttgtcggggaggctcgggccgtacaggagcccatggagtgggtggga ggctcaggcatggcgggctgcaggtcccgagccctgccccgcgggaaggcagctaaggcc cggcgagaaatcgagcgcagcgccggtgggctggcactgctgcgggacccagtacaccct ccgcagccgctggcccgggtgctaagtcccccattgcccggggccagcagggccggctgc tccgagtgcgcggcccgccaagcccacgtccacccggaactccagctggccagcaagcgc tgcacgcagccccggttcccgctcgtgcctctccctccacacctccctgcaagctga