GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:48:56 Sequence gi568815591r:23405745_23621840 : 216096 bp : 42.24% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 490 681 192 2 0 63 113 88 0.436 7.34 1.02 Intr + 732 854 123 1 0 41 44 102 0.016 0.74 1.03 Intr + 2959 3046 88 2 1 63 70 48 0.017 -1.39 1.04 Term + 9080 9866 787 2 1 78 36 872 0.029 72.81 1.05 PlyA + 10486 10491 6 1.05 2.05 PlyA - 16567 16562 6 1.05 2.04 Term - 50707 50640 68 2 2 56 48 89 0.634 -1.18 2.03 Intr - 55881 55664 218 2 2 49 96 155 0.642 9.62 2.02 Intr - 62798 62738 61 0 1 134 105 70 0.989 10.17 2.01 Init - 64366 64192 175 1 1 68 80 407 0.989 37.46 2.00 Prom - 67667 67628 40 -7.05 3.03 PlyA - 68100 68095 6 1.05 3.02 Term - 68925 68192 734 1 2 66 55 337 0.533 20.84 3.01 Init - 69053 68984 70 2 1 86 49 75 0.930 4.60 3.00 Prom - 70339 70300 40 -5.15 4.00 Prom + 71308 71347 40 -4.45 4.01 Sngl + 84667 85470 804 0 0 56 38 709 0.921 56.26 4.02 PlyA + 85638 85643 6 1.05 5.10 PlyA - 85801 85796 6 1.05 5.09 Term - 100069 99994 76 0 1 59 43 100 0.571 -1.07 5.08 Intr - 100522 100394 129 0 0 18 95 169 0.825 9.49 5.07 Intr - 101791 101676 116 1 2 107 52 98 0.999 6.43 5.06 Intr - 107338 107150 189 1 0 37 84 176 0.491 10.96 5.05 Intr - 110784 110619 166 2 1 58 83 109 0.207 6.44 5.04 Intr - 116096 115963 134 2 2 58 111 104 0.265 8.22 5.03 Intr - 126102 125976 127 2 1 44 0 192 0.236 5.76 5.02 Intr - 126535 126447 89 1 2 47 105 48 0.712 0.25 5.01 Init - 131900 131832 69 2 0 41 63 81 0.380 2.00 5.00 Prom - 132854 132815 40 -3.35 6.03 PlyA - 134817 134812 6 1.05 6.02 Term - 157461 156834 628 2 1 33 34 693 0.040 51.24 6.01 Init - 165656 165592 65 2 2 91 76 88 0.976 8.57 6.00 Prom - 165811 165772 40 -5.75 7.04 PlyA - 167037 167032 6 1.05 7.03 Term - 180830 179318 1513 1 1 15 47 1526 0.873 130.19 7.02 Intr - 191566 191373 194 1 2 -39 93 191 0.403 4.27 7.01 Init - 208553 208419 135 2 0 44 53 151 0.081 7.39 7.00 Prom - 214322 214283 40 -3.25 8.02 PlyA - 215082 215077 6 1.05 8.01 Term - 215867 215182 686 0 2 74 42 280 0.874 14.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 732 880 149 1 2 41 49 135 0.823 1.98 S.002 Init - 208553 208415 139 2 1 44 66 169 0.902 10.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:23405745_23621840|GENSCAN_predicted_peptide_1|396_aa XLDMYTVTGSAQPRKVSKVLAFLIGESKREASGKQNTNQIVSGIGFFQWVLGLADFKNEA VDSHDVQMCPEFLPSSGFVVLLTSGVKPQTFAVSVTALKGSVDPKSTYPKELKAESQRDI YSPTFTVALITIAKRSRPSVDISRPCPSVGIPTSAGPCPSVGITTSASPSPSVSITASIG PHPSVSITASAGPVLQLVSPHHQVPSVSQHHPTRRSPSVSRLYHISRSRPSVGFTALPGP VCQSASPHQQVPSVSRLHCNTRSRPSVGINASPGPYPSVGITASPSPCTSVSITTSAGPR SSASPHPQVPSISRHHRIARSPSISRHHRIRRSPSVSRHHRIRRSPSVSRHHRIRRSPSV SRHHCIRRSRPSVGITASAGPCLSAGVTESAVGYSL >gi568815591r:23405745_23621840|GENSCAN_predicted_CDS_1|1191_bp nccctggacatgtatacagttacaggaagtgcacaaccacgcaaggtatctaaggtcctt gcttttctgattggagaatcaaaaagggaagcctcagggaaacaaaatactaatcaaatt gtgtccggaattggtttcttccagtgggttctcggtctcgctgacttcaagaatgaagcc gtggactcccacgatgttcagatgtgtccagagtttcttccttccagtgggttcgtggtc ttgctgacttcaggagtgaagccgcagacctttgcagtgagtgttacagctcttaaaggt agtgtggacccaaagagtacatacccaaaagaattgaaagcagagtctcaaagagatatt tattcacctacattcacggtggcattgatcacaatagccaaaaggtcccgtccatcagtc gacatcagcaggccctgtccgtcagtcggcatccccacatccgcaggtccctgtccatca gtcggcatcaccacatctgcaagtccctctccgtcagtcagcatcactgcatccataggt ccccatccgtcagtcagcatcactgcatctgcaggtcccgtccttcagttggtatcaccg catcaccaggtcccatccgtcagtcagcatcaccccacccgcaggtccccatctgtcagt cggctttaccacattagcaggtcccgtccgtcagtcggcttcactgcattaccaggtcct gtctgtcagtcagcttcaccacatcagcaggtcccgtccgtcagtcggcttcactgcaac accaggtcccgtccgtcagtcggcatcaacgcatcaccaggtccctacccatcagtcggc atcaccgcatcaccaagtccctgtacgtcagtcagcatcaccacatccgcaggtccccgt tcatcggcttcaccgcatccgcaggtcccgtccatcagtcggcatcaccgcatcgccagg tccccctccatcagtcggcatcaccgcatccgcaggtccccctccgtcagccggcatcac cgcatccgcaggtccccctccgtcagtcggcatcaccgcatccgcaggtccccctccgtc agtcggcatcactgcatccgcaggtcccgtccgtcagtcggcatcaccgcatccgcaggt ccctgtctgtcagccggcgtcactgaatcagcagttggttacagcctctag >gi568815591r:23405745_23621840|GENSCAN_predicted_peptide_2|173_aa MNKLYIGNLSENAAPSDLESIFKDAKIPVSGPFLVKTGYAFVDCPDESWALKAIEALSGK IELHGKPIEVEHSVPKRQSCKEQPTGFADGLAEEYQKSEELQDFLARATGWKMVPFTKME NIGRGADSWVENRNFGFDMVFLILTRNLSEEETNSSFLRRKLQEGPHFEEEFD >gi568815591r:23405745_23621840|GENSCAN_predicted_CDS_2|522_bp atgaacaaactgtatatcggaaacctcagcgagaacgccgccccctcggacctagaaagt atcttcaaggacgccaagatcccggtgtcgggacccttcctggtgaagactggctacgcg ttcgtggactgcccggacgagagctgggccctcaaggccatcgaggcgctttcaggtaaa atagaactgcacgggaaacccatagaagttgagcactcggtcccaaaaaggcaaagctgt aaggaacagccaacaggatttgctgatggattagctgaggagtatcagaaaagtgaagaa ttgcaagattttttggccagagcaactgggtggaagatggtgccatttaccaaaatggag aatattgggagaggagcagattcttgggtagaaaataggaactttggttttgatatggtg tttctaatacttactagaaatttaagtgaagaagaaactaacagtagtttcttgcggagg aaacttcaagaaggtccacattttgaggaggaatttgattaa >gi568815591r:23405745_23621840|GENSCAN_predicted_peptide_3|267_aa MAQIQNLRALDAGGREGTVRAPCALPPSFFLGGWGWGRAVKGQEKRERAEGNTVPPLNLN KTNEGAKGYFRATKDGNPGAAPGPMNENQQTALAEAADGDRFPPSANRKRGEAPARSRGG VSLDAGCSTWTRQAGPGERGSPALEPLLRASLAARPADPQGSARDSPPWVPSIPGPYGTP RAHTQLRPEVREGRQTHAARCVPALAGGLRWGGAATARPHAPRIRTQPQEEPEARGPAAG PRGRAVQWSRPSEASRRPTKDPPHCEP >gi568815591r:23405745_23621840|GENSCAN_predicted_CDS_3|804_bp atggcgcaaatccagaacctccgggcgctggacgctgggggaagggaagggaccgtgcgg gcaccatgcgcgttacctccctctttcttccttgggggttggggctgggggcgagctgtt aaagggcaggaaaaacgagagcgagcagaggggaacacagtcccacccctcaacctcaac aaaaccaacgagggagcaaagggttattttcgcgccacaaaagatggaaatccaggggcg gctccggggccaatgaatgaaaaccagcaaaccgctctggcggaggcggccgacggcgat aggttccccccgtcagccaatcggaagcgaggagaggcccccgcccgcagcaggggcggt gtctctctcgacgccggttgctcaacttggactcggcaagccggccccggggagcgcggg agccccgccttggagccgcttttgcgcgccagtctcgcggcccgccccgcagacccgcag ggttcggcgagggattctcccccgtgggtaccaagcatccctgggccctatggaacccct cgtgcacacacgcaactgagaccagaggtgcgggagggtcgccaaacacacgctgcccgg tgcgtccccgcgctggccggcggactcaggtggggtggcgcggcgacggcccggccacac gctccgcgcatacggacgcagccccaggaggagccagaagcccgcggtcccgcggcaggg ccccggggaagggcggtccagtggtccaggccctccgaggcgagccggcggccgacgaaa gaccctccccactgtgagccgtga >gi568815591r:23405745_23621840|GENSCAN_predicted_peptide_4|267_aa MADDASAEGGGGRARAGGRVVWAPEAPGALGRGTDVTSAEGFSSGIRVRGHGRGWGRGRG RGRGWGRGQDRGVRGGKAQDKERMPVTKLDGLAKDMKIKSLEEIYLFSLPIKESEIIDFF LGASLKDEVLKIMPVRKETRAGQRTRFKAFVAIRDYNGYAGRVEVLQGGGRRHPRGHHPD QALHCHRAQRLLGEQDRQAPHRPLQGDRPLRLCAGALHPRAQGHWHRLSSCAQEAAHDSW YLRLLHLSQGLHCHPGQLRQRHLGCCL >gi568815591r:23405745_23621840|GENSCAN_predicted_CDS_4|804_bp atggcggatgacgccagtgcagagggcgggggcgggcgggcgcgggcgggcgggcgggtg gtgtgggcgccggaggccccgggggccctgggaaggggaactgatgtgacttccgctgag ggtttcagcagtggcatccgggtccggggtcacggccgtggatggggccggggccggggt cgcggccgtggatggggccggggccaagaccgcggagttcgcggaggcaaggcccaggat aaggagcggatgcccgtcaccaagctggacggcctggccaaggacatgaagatcaagtcc ctagaggagatttatctcttctcgctgcccatcaaggaatctgagatcattgactttttc ctgggggcctctctcaaggacgaggttttgaagattatgcctgtgcggaaggagacccgc gccggccagcgcaccaggttcaaggcgtttgttgccatcagggactacaatggctacgct ggtcgggttgaagtgctccaaggaggtggccgccgccatccgcggggccatcatcctgac caagctctccattgtcaccgtgcgcagaggctactgggggaacaagatcggcaagctcca caccgtcccttgcaaggtgacaggccgctgcggctctgtgctggtgcccttcatccccgc gcccaggggcactggcatcgtctcagctcctgtgcccaagaagctgctcatgatagctgg tatctacgactgctacacctcagccaggggctgcactgccaccctgggcaacttcgccaa cgccaccttggatgctgtctctaa >gi568815591r:23405745_23621840|GENSCAN_predicted_peptide_5|364_aa MRELETPPISNIAPHAGSSPKLPNPLGQYLKGLRSYKAHLATEKSSKRLTRPSSGERLVD MSDVEENNFEGRVSTKPREPSRRRPNARGFTGTSAESRSQSKSPTGTPARVKSESRSGSR SPSRVSKHSESHSRSRSKSRSRSRRHSHRRYTRSRSHSHSHRRRSRSRSYTPEYRRRRSR SHSPMSNRRRHTGSRANPDPNTCLGVFGLSLYTTERDLREVFSRYGPLSGVNVVYDQRTG RSRGFAFVYFERIDDSKEAMERANGMELDGRRIRVDYSITKRAHTPTPGIYMGRPTHSGG GGGGGGGGGGGGGGRRRDSYYDRGYDRGYDRYEDYDYRYRRRSPSPYYSRYRSRSRSRSY SPSM >gi568815591r:23405745_23621840|GENSCAN_predicted_CDS_5|1095_bp atgagggaactggagacacctcctatttctaatattgctcctcatgcaggatcctctcct aaactcccaaatccacttggacaatatttaaaggggctaaggtcatataaggctcatctg gcgactgaaaagtcttcaaagaggttgacgcgtccgagttctggggagcgcctcgtcgac atgagtgatgtggaggaaaacaacttcgagggcagagtgagtacaaagccgcgggagccg tctaggcggcggcccaatgctcggggtttcacgggaacaagcgcggagtctcgctctcag tcaaaatctccaacgggaactcctgctcgtgtaaaatcggagagcaggtcaggatctcgt agtccatcaagggtttccaaacactctgaatcccattctcgatcaagatcaaaatccagg tcgaggtcaaggagacattctcatagacgttacactcgatccagatcccactctcactct cataggagacgatctcgaagtagatcatatacaccagaataccggcggcgaaggagccga agccattctccaatgtctaaccggagaagacatactggcagcagggcaaatccagatccc aacacttgccttggagtgtttggcctcagtttgtacacaacagagagggatcttcgtgaa gtattttctcgatatggaccattgagtggtgtcaatgtggtttatgatcagcgaactggg cgatctcgaggatttgcttttgtgtattttgagagaatagatgactcaaaggaggctatg gaaagggcaaatggaatggagctggatggtagaagaattcgggtggattattctataacc aagagagcgcacacaccaacaccaggcatctacatgggcagaccaactcatagtggtggg ggtggtggaggaggcggcggcggtggaggtggaggtggtggcagacgtcgagattcttac tatgatagaggatatgatcgtgggtatgacagatatgaagactatgattaccgatacaga agacgatcaccttctccttattatagtcgatatagatcacgatcaagatctcgttcctac agcccaagtatgtga >gi568815591r:23405745_23621840|GENSCAN_predicted_peptide_6|230_aa MAEKNPPAIVPAQEPPIGQISTNELKMGEFELAIVAGEFTDSEIMVMMLGENGMGKTTFI RMLAGRLEPDEEGEVPVLNVSYKLQKISPKSTGSVRQLLREKIRDAYTHPQFVTNVMKPL QIENIIDQEVQTLSGGELQRVALALCLGKPADVYLIDEPSAYLDSEQRLMAARVVKRFIL HAKKTAFVVEHDFIMATYLADHVIIFDGVPSTKNTVANSPQTLLAGMNTF >gi568815591r:23405745_23621840|GENSCAN_predicted_CDS_6|693_bp atggctgaaaagaaccctccagcaatcgtccctgcacaggaacctccaattggacaaatc tccacgaatgaattaaaaatgggagagtttgagctagcaattgtagctggagagtttaca gattctgaaatcatggtgatgatgctaggggaaaatggaatgggtaaaacgacatttatc agaatgcttgctggaagacttgaacctgatgaagaaggagaagtaccagttctaaatgtc agttataagttacagaaaattagtcctaaatcaactggaagtgttcgccagttactacgt gaaaagataagagatgcttatacacatccacaatttgtgactaatgtaatgaagcctctg caaattgaaaacatcatcgatcaagaggtgcagacattatcgggtggtgaactacagcga gtagctttagccctttgtttgggcaaacctgctgatgtctatttaattgatgaaccatct gcatatttggattctgagcaaagactgatggcagctcgagttgtcaaacgtttcatactt catgcaaaaaagacagcctttgttgtggaacatgacttcatcatggccacctatctagcg gatcatgtcatcattttcgatggtgttccatctactaagaacacagttgcaaacagtcct caaacccttttggctggcatgaatacattttag >gi568815591r:23405745_23621840|GENSCAN_predicted_peptide_7|613_aa MESTGEDVVDTVEMTTKDLEYYVNFIDKAAVGFVWTDSNFERSSTNEEAAARASSRPAPT LGQNDAAARQPERAAGSSVLEDEPARGIPSALTRCRRWKKGSGVLFPTLRVFENANTGNR PTVKERRAGLKQSPRTGLEVPGSRVGGGGAALHRRGGAGTPAAVPPPSWTSCGTSARAWP PERCLILEGTIPQSEAAGGVTVNTIGAENKQRRSRSWSSSSDRTRRRRREDSYHVRRRCS RTFSRSSSQHSSRKAKSVEDDTEGHLIYHVGDWLQERYEIVSTLGKGTFGRVVQCVDHRR RGARVALKIIKNVEKYKEAARLEIKVLEKINEKDPGKNLCVQMFDWFDYHGHMCISLELL GLSTFDFLKDNNHLPYPIHQVHHMASQLCQAVKFLHDNKLTHTDLKPENILFVNSDYELT YNLEKKRHERSVKSTAVRVGDFGSATFDHEHHSTIVSTRHYRAPEVILELGWSQPCDVWS IGCIIFEYYVGFTLFQTHDNRQHLATMERILGPIPSRMIRKTRKQKYFYRGRLDWDENTS AGRYVRENCKPLRQYLTSEAEEDHQLFDLIESMLEYEPAQRLTLGEALQHPFFSRLWAEP PNKLWDSSQDISP >gi568815591r:23405745_23621840|GENSCAN_predicted_CDS_7|1842_bp atggaatctactggtgaagatgttgtggacactgttgaaatgacaacaaaggatttagaa tattatgtaaactttattgacaaggcagcagtaggatttgtgtggactgactccaatttt gaaagaagttctaccaatgaggaagcggcggcacgtgcttccagccgccctgcacctacc ctgggtcagaacgacgcagctgcgcggcaaccagagagggcggcggggtcttctgtgttg gaagatgagcccgccagaggcataccctcggccttaactcgctgcaggcgctggaagaag ggctctggcgtcttgttcccgaccctccgagtttttgaaaacgccaatactggaaacagg ccgacagttaaggagaggcgggcgggactgaagcagagcccgagaacggggctggaggtc ccagggtcccgggttggtgggggtggagcagcacttcatcgccgcgggggtgctgggact ccggccgcagtgccgccgccatcatggacttcctgtgggacaagcgcacgggcctggccg ccagaaagatgcctcatcctcgaaggtaccattcctcagagcgaggcagccgggggagtt actgtgaacactatcggagccgaaaacaagcaacgaagaagccgttcctggtcaagtagt agtgaccggacacgacggcgccggcgagaggacagctaccatgtccggaggaggtgcagc cggacatttagccgctcgtcttcgcagcacagcagccggaaagccaagagtgtagaggac gacactgagggccacctcatctaccatgtcggggactggctacaagagcgatatgaaatc gtcagcaccttaggaaaggggaccttcggccgagttgtacaatgtgttgaccatcgcagg cgtggggctcgagttgccctgaagatcattaagaatgtggagaagtataaggaagcagct cgacttgagatcaaagtgctggagaaaatcaacgagaaagaccctggcaagaacctctgt gtccagatgtttgactggttcgactaccatggccacatgtgtatctccttggagcttctg ggccttagcaccttcgatttcctcaaagacaacaaccacctgccctaccccatccaccaa gtgcaccacatggcctcccagctgtgccaggctgtcaagttcctccatgataacaagctg acacatacagacctcaagcctgaaaatattctgtttgtgaattcagactatgagctcacc tacaacctagagaagaagcgacatgagcgcagtgtgaagagcacagctgtgcgggtggga gactttggcagtgccacctttgaccatgagcaccatagcaccattgtctccactcgccat taccgagcaccagaagtcatccttgagttgggttggtcacagccttgtgatgtgtggagt ataggctgcatcatctttgagtactatgtgggcttcaccctcttccagacccatgacaac agacagcatctagccacgatggaaaggatcttgggtcctatcccttcccggatgatccga aagacaagaaaacagaaatatttttaccggggtcgcctggattgggatgagaacacatca gctggacgctatgttcgtgagaactgcaaaccgctgcggcagtatctgacctcagaggca gaggaagaccaccagctcttcgatctgattgaaagcatgctagagtatgaaccagctcag cggctgaccttgggtgaagcccttcagcatcctttcttctcccgcctttgggctgagcca cccaacaagttgtgggactccagtcaggatatcagtccgtga >gi568815591r:23405745_23621840|GENSCAN_predicted_peptide_8|228_aa XLEVLARAIRQEKEIKSIQLGKEEVQLSLFADDMIVYLENPIISAQNLLKVISNFSKVSG YNINVQKSQAFLYTNNTQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLN EIKEDTNKWKNIPCSWIGRINIVKMAILAKVIYRLNAIPIKLPMTFFTELEKTTLKFIWN QKRARIAKSVLSQKNKAGGITLPDFKLYYKATVTKTAWDWYQNRDTYQ >gi568815591r:23405745_23621840|GENSCAN_predicted_CDS_8|687_bp ntgttggaagttctggccagggcaatcaggcaggagaaagaaataaagagtattcaatta ggaaaagaggaagtccaattgtccctgtttgcagatgacatgattgtatatctagaaaac cccatcatctcagcccaaaatctccttaaggtgataagcaacttcagcaaagtctcagga tacaatatcaatgtgcaaaaatcacaagcattcttatacaccaataacacacaaacagag agccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacttagga atccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaac gaaataaaagaggatacaaacaaatggaagaacattccatgctcatggataggaagaatc aatatcgtgaaaatggccatactggccaaggtaatttatagattgaatgccatccccatc aagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatggaac caaaaaagagcccgcattgccaagtcagtcctaagccaaaagaacaaagctggaggcatc acactacctgacttcaaactatactacaaggctacagtaaccaaaacagcatgggactgg taccaaaacagagatacataccaatag