GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:48:34 Sequence gi568815576r:36411546_36626121 : 214576 bp : 49.97% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 352 347 6 1.05 1.07 Term - 10122 9725 398 1 2 0 50 236 0.462 6.24 1.06 Intr - 16311 16209 103 0 1 76 96 58 0.116 5.15 1.05 Intr - 32474 32371 104 0 2 136 85 -11 0.027 3.29 1.04 Intr - 46380 46292 89 2 2 8 105 27 0.140 -3.99 1.03 Intr - 51149 50956 194 2 2 32 116 85 0.536 3.99 1.02 Intr - 52558 52373 186 1 0 31 103 96 0.272 5.29 1.01 Init - 53809 53615 195 1 0 84 47 195 0.473 13.26 1.00 Prom - 54149 54110 40 -6.86 2.14 PlyA - 55524 55519 6 1.05 2.13 Term - 56372 56259 114 2 0 105 48 281 0.997 24.37 2.12 Intr - 65311 65188 124 0 1 63 64 172 0.896 12.89 2.11 Intr - 69292 69030 263 1 2 137 79 154 0.876 15.69 2.10 Intr - 69659 69603 57 2 0 106 47 72 0.876 3.98 2.09 Intr - 75444 75382 63 0 0 23 108 60 0.003 0.41 2.08 Intr - 78722 78643 80 0 2 82 80 86 0.005 6.47 2.07 Intr - 82258 82088 171 2 0 136 93 170 0.999 22.21 2.06 Intr - 84663 84422 242 2 2 90 105 336 0.998 32.69 2.05 Intr - 86611 86446 166 2 1 82 64 203 0.997 16.32 2.04 Intr - 89862 89696 167 2 2 92 99 286 0.995 29.70 2.03 Intr - 92822 92553 270 1 0 61 78 535 0.999 46.46 2.02 Intr - 93221 92970 252 1 0 99 96 282 0.997 26.65 2.01 Init - 94877 94351 527 2 2 72 110 1121 0.998 105.03 2.00 Prom - 98513 98474 40 -6.46 3.19 PlyA - 99337 99332 6 1.05 3.18 Term - 99455 99442 14 0 2 103 40 2 0.555 -4.74 3.17 Intr - 100241 99958 284 1 2 92 93 587 0.781 56.66 3.16 Intr - 101057 100915 143 2 2 55 72 211 0.998 15.35 3.15 Intr - 105062 104933 130 1 1 81 84 216 0.945 21.20 3.14 Intr - 105245 105160 86 2 2 84 94 110 0.997 9.82 3.13 Intr - 105886 105756 131 2 2 76 77 91 0.947 7.21 3.12 Intr - 107365 107218 148 1 1 99 91 179 0.998 19.11 3.11 Intr - 107992 107860 133 0 1 98 75 276 0.999 27.95 3.10 Intr - 109143 109031 113 0 2 64 77 96 0.968 5.28 3.09 Intr - 111736 111664 73 0 1 80 95 32 0.856 2.51 3.08 Intr - 112435 112350 86 1 2 17 78 90 0.503 -0.48 3.07 Intr - 113187 113051 137 1 2 62 96 174 0.998 15.89 3.06 Intr - 114164 114119 46 2 1 121 108 21 0.998 5.38 3.05 Intr - 114586 114454 133 0 1 63 83 137 0.102 11.35 3.04 Intr - 117779 117661 119 2 2 -13 -21 161 0.021 -5.64 3.03 Intr - 124854 124751 104 1 2 62 116 45 0.452 4.59 3.02 Intr - 125215 125064 152 0 2 90 71 171 0.473 15.41 3.01 Init - 126059 125734 326 2 2 89 -89 560 0.345 35.20 3.00 Prom - 127387 127348 40 -4.16 4.06 PlyA - 128078 128073 6 1.05 4.05 Term - 153341 152806 536 0 2 124 43 1010 0.999 94.51 4.04 Intr - 154948 154808 141 2 0 124 92 184 0.990 22.72 4.03 Intr - 176003 175920 84 0 0 118 75 90 0.879 10.49 4.02 Intr - 176534 176427 108 0 0 83 49 54 0.626 1.26 4.01 Init - 177191 177125 67 2 1 80 94 23 0.811 3.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 78722 78463 260 0 2 82 42 233 0.991 13.81 S.002 Init - 114576 114454 123 0 0 70 83 120 0.869 10.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:36411546_36626121|GENSCAN_predicted_peptide_1|422_aa MAVSHAGCAHGILLVILFDQFQDTDLTGDAAGSGLSAHGHSLGATLGPMTVDGVGRHLLE AFSHQCLPLPGARASSTPTPAAWGKLLQEAKVVLAKLRKSRMCGLWAGNGPENNVGAVSM EGRMLATVFTILSPSCTISKLGLMTVPANEVTAQQVEARWHRLEVPEVLSVWLSPFPYPQ PHLKQETPGFPRKALRDQLQLPELTEKVQGQPKEGTGLRSAGSSKLYHLFLDNPGYSLYL KILNKILSAKFLLPCKVAYAYFIHEEMLPERGPDPDPKRGFLDLRQEIIQGVSHCAWPLA DALKSINNAKKRGKRQVLIRPCSKVIVRFLTVMIKHDYTGEFEIIDGHRAGKIVVTVTGR LNKCGVISPRFDVQLKDLEKWQNNLLPSRQFGFIVLTTSAGIMDHEEARQNHTGGKILGF FF >gi568815576r:36411546_36626121|GENSCAN_predicted_CDS_1|1269_bp atggccgtgagtcatgctggctgtgctcacggaatcctcctggtgatcctgtttgaccaa ttccaggacactgacctcaccggggatgccgctggctctgggctgtcagcccacggtcat tctctgggtgccacattgggccccatgactgtggatggggttggcagacacctgctggaa gcctttagccaccagtgcctgcccctaccaggggccagggccagctcaacccccaccccg gccgcctggggaaagctgctccaggaagcgaaggtagtgctggccaagctgcggaaatcc cgaatgtgtgggctctgggcagggaacggcccagaaaacaatgttggggccgtgtccatg gagggcaggatgctggcgacggttttcaccattttgagccccagttgcaccatctccaag ttggggctcatgacagtccctgcaaatgaggtcacggctcagcaggtggaagcacgctgg caccgattggaagttccagaagtgttatctgtgtggctttctcctttcccctacccccag ccacatcttaaacaggaaacaccgggcttcccaaggaaggccctcagggaccagctccaa ctgcctgagttgacagagaaggtccagggccagccaaaggaagggacaggcctgaggtca gctgggtcttcgaaattatatcacctcttcctagataatccaggatactctctttatctc aagattcttaacaaaatcctctcagcaaagttccttttgccatgtaaggtcgcttacgcc tattttatacatgaagaaatgttaccagaaaggggtcctgatccagaccccaagagagga ttcttggatctcaggcaagaaataattcagggcgtgagccactgcgcctggcccctggct gatgccctcaagagcatcaacaatgccaaaaagagaggcaaacgccaggtgcttattagg ccatgctccaaagtcatcgtccgatttctcactgtgatgataaagcatgattacactggc gaatttgaaatcattgatggtcacagagctgggaaaattgttgtgactgtcacaggcagg ctcaacaagtgtggagtgatcagccccagatttgatgtgcaactcaaagatctagaaaaa tggcagaataatctgcttccatcccgccagtttggtttcattgtactgacaacctcagct ggcatcatggaccatgaagaagcaagacaaaaccacacaggagggaaaatcctgggattc tttttctag >gi568815576r:36411546_36626121|GENSCAN_predicted_peptide_2|831_aa MGLSAAAPLWGPPGLLLAIALHPALSVPPRRDYCVLGAGPAGLQMAYFLQRAGRDYAVFE RAPRPGSFFTRYPRHRKLISINKRYTGKANAEFNLRHDWNSLLSHDPRLLFRHYSRAYFP DARDMVRYLGDFADTLGLRVQYNTTIAHVTLDKDRQAWNGHYFILTDQKGQVHQCSVLFV ATGLSVPNQVDFPGSEYAEGYESVSVDPEDFVGQNVLILGRGNSAFETAENILGVTNFIH MLSRSRVRLSWATHYVGDLRAINNGLLDTYQLKSLDGLLESDLTDLAILKDSKGKFHVTP KFFLEEANTNQSADSITLPQDDNDNFAMRVPYDRVIRCLGWNFDFSIFNKSLRLNSGNAF GKKYPLIRASYESKGSRGLFILGTASHSVDYRKSAGGFIHGFRYTVRAVHRLLEHRHHSV TWPATELPITQLTSSIVRRVNEASGLYQMFGVLADVILLKENSTAFEYLEEFPIQMLAQL ETLTGRKAKHGLFVINMEYGRNFSGPDKDVFFDDRSVGHTEDAWQSNFLHPVIYYYRYLP TEQEVRFRPAHWPLPRPTAIHHIVEDFLTDWTAPIGHILPLRRFLENCLDTDLRSFYAES CFLFALTRQKLPPFCQQGYLRMQGLQGRKKRAGDLGERILMYHSDQCGQYNSNFLEVAVG RPIMCMAQRLLLRRFLASVISRKPSQGQWPPLTSRALQTPQCSPGGLTVTPNPARTIYTT RISLTTFNIQDGPDFQDRVVNSETPVVVDFHAQWCGPCKILGPRLEKMVAKQHGKVVMAK VDIDDHTDLAIEYEVSAVPTVLAMKNGDVVDKFVGIKDEDQLEAFLKKLIG >gi568815576r:36411546_36626121|GENSCAN_predicted_CDS_2|2496_bp atgggcctctccgctgcggccccgttgtggggtcccccggggctgctcctggccatcgcc ctgcacccagcgctgtcggtgcccccgcgccgggactactgcgtgctgggcgctgggccc gcgggcctgcagatggcctacttcctgcagcgcgctggacgcgactacgcagtgttcgag cgggccccgcggcccggcagcttcttcacacgctacccgcggcaccgcaagctcatcagc atcaacaagcggtacacgggcaaggctaacgccgagttcaacctccgccacgactggaac tctctgctcagccacgacccccggctgctcttcagacactactcgcgtgcctacttcccc gacgcccgcgacatggtgcgctacctgggtgacttcgcggacacgctggggctccgtgtc cagtacaacaccaccatcgcccacgtcactctggacaaggaccgacaggcctggaatggc cactacttcatcctaactgaccagaagggccaggtgcatcagtgcagcgtcctctttgta gccactggtttatcagtccccaaccaggttgacttccctggctccgaatatgcagagggt tacgagtccgtgtccgtggaccctgaggactttgtaggccagaatgtgctgatcctgggt cgtgggaactcggcctttgagacagcagagaacatcttgggtgtcacaaactttatccat atgctcagccgctcccgggtccgtctgtcctgggccacccactacgttggagacctcaga gccatcaacaatggcctgctggatacctaccagctcaagtccctggacgggctgctcgag tctgacctgacggatctggccatcctgaaggacagcaaaggcaagttccatgtcaccccg aaattcttcctggaagaagccaacaccaaccagagtgccgactccatcaccctcccccag gacgacaatgacaactttgccatgcgcgtgccctatgaccgggtaatccgctgcctgggc tggaactttgacttctccattttcaataagtccctcagacttaactcgggaaatgcattc ggcaagaagtacccgctgattcgagctagctacgaatccaaaggaagccggggtctgttt atcctgggtactgccagccactcggtggactaccggaaatctgctgggggcttcatccac ggattccgatacacagtgcgtgctgttcaccggctcctggagcaccgccaccacagcgtc acctggcccgccactgagctccccatcacacagctgaccagctccatcgtgcggcgcgtg aatgaggcttctgggctctaccagatgttcggtgtgctggccgatgtcatcctgttgaag gagaattccacggcctttgagtacctggaggagttccccatacagatgctggcccagctg gagacactcacagggaggaaggcaaagcacgggctcttcgtcatcaacatggaatatggc agaaatttctctggccccgacaaggacgtcttctttgatgaccggtctgtggggcacaca gaagatgcctggcagtctaactttcttcatcctgtcatctactactatagatacctcccc accgaacaggaggtgaggttccgccctgcacactggcccctgcctcggcccacggccatc catcacatcgtggaagacttcttaacagactggactgccccgatcgggcacatcctacct ctgaggcgcttcctggagaactgtttggacaccgatttgcgaagcttctatgcagagtcc tgcttcctgttcgccctcacgcgccagaagttgccacccttttgccagcaggggtacctg aggatgcagggactccaaggcagaaagaaacgggctggcgacctgggggaaaggatcctc atgtaccacagcgaccaatgtgggcagtacaatagcaacttcctggaagttgctgtagga agaccaataatgtgtatggctcagcgacttcttctgaggaggttcctggcctctgtcatc tccaggaagccctctcagggtcagtggccacccctcacttccagagccctgcagacccca caatgcagtcctggtggcctgactgtaacacccaacccagcccggacaatatacaccacg aggatctccttgacaacctttaatatccaggatggacctgactttcaagaccgagtggtc aacagtgagacaccagtggttgtggatttccacgcacagtggtgtggaccctgcaagatc ctggggccgaggttagagaagatggtggccaagcagcacgggaaggtggtgatggccaag gtggatattgatgaccacacagacctcgccattgagtatgaggtgtcagcggtgcccact gtgctggccatgaagaatggggacgtggtggacaagtttgtgggcatcaaggatgaggat cagttggaggccttcctgaagaagctgattggctga >gi568815576r:36411546_36626121|GENSCAN_predicted_peptide_3|785_aa MKCILVAIEGTEALFYWTDEEFEESLQLKFGQSENEEEELPALQDQLSPLLAPVIISSMT MLEKLSDTYTCFSMENSNSLYVLHLFGECLFIAINGDHTKSEGDLQRKLLRPPDLGQRVQ LWEHFQSLLWTYSRLREQEQCFAMEVITGRRSPQDLELSEIFEDSSFNPPLLLPQNYIEV EQEGKAVEKNTALKANEDAWRNAVTDFRVDLRFTAREFFRPTQRKRESSLRQRRFWKMAK FMTPVIQDNPSGWGPCAVPEQFRDMPYQPFSKGDRLGKVADWTGATYQDKRYTNKYSSQF GGGSQYAYFHEEDESSFQLVDTARTQKTAYQRNRMRFAQRNLRRDKDRRNMLQFNLQILP KSAKQKERERIRLQKKFQKQFGVRQKWDQKSQKPRDSSVEVRSDWEVKEEMDFPQLMKMR YLEVSEPQDIECCGALEYYDKAFDRITTRSEKPLRSIKRIFHTVTTTDDPVIRKLAKTQG NVFATDAILATLMSCTRSVYSWDIVVQRVGSKLFFDKRDNSDFDLLTVSETANEPPQDEG NSFNSPRNLAMEATYINHNFSQQCLRMGKERYNFPNPNPFVEDDMDKNEIASVAYRYRRW KLGDDIDLIVRCEHDGVMTGANGEVSFINIKTLNEWDSRHCNGVDWRQKLDSQRGAVIAT ELKNNSYKLARWTCCALLAGSEYLKLGYVSRYHVKDSSRHVILGTQQFKPNEFASQINLS VENAWGILRCVIDICMKLEEGKYLILKDPNKQVIRVYSLPDGTFSSDEDEEEEEEEEEEE EEEET >gi568815576r:36411546_36626121|GENSCAN_predicted_CDS_3|2358_bp atgaagtgcatcttggtggccattgagggcacagaggccctcttctactggacggatgag gagtttgaagagagtctccagctgaagttcgggcagtcagagaatgaggaagaagagctc cctgccctacaggaccagctcagccccctcctagccccggtcatcatctcctccatgacg atgctggagaagctctcggacacctacacctgcttctccatggaaaacagcaactccctg tatgtccttcacctgtttggagaatgcctgttcattgccatcaatggcgaccacaccaag agcgagggggacctgcagcggaagctgctgcggcccccagacctggggcagcgtgtccag ctgtgggagcactttcagagcctgctgtggacctatagccgcctgcgggagcaggaacag tgcttcgccatggaggtgattactgggcgcaggagtccgcaggacttggagctgtcagaa atctttgaagacagttccttcaaccctcccctcctcctcccacaaaattatattgaggtg gaacaggaggggaaagcagtggagaagaacacagcgctaaaggctaatgaggacgcctgg cgaaacgcagtaacggatttccgggtggaccttcgctttacggctcgtgagttcttccgc ccaacccagaggaagcgggagagcagtttacgacagcgccgattttggaagatggcaaag ttcatgacacccgtgatccaggacaacccctcaggctggggtccctgtgcggttcccgag cagtttcgggatatgccctaccagccgttcagcaaaggagatcggctaggaaaggttgca gactggacaggagccacataccaagataagaggtacacaaataagtactcctctcagttt ggtggtggaagtcaatatgcttatttccatgaggaggatgaaagtagcttccagctggtg gatacagcgcgcacacagaagacggcctaccagcggaatcgaatgagatttgcccagagg aacctccgcagagacaaagatcgtcggaacatgttgcagttcaacctgcagatcctgcct aagagtgccaaacagaaagagagagaacgcattcgactgcagaaaaagttccagaaacaa tttggggttaggcagaaatgggatcagaaatcacagaaaccccgagactcttcagttgaa gttcgtagtgattgggaagtgaaagaggaaatggattttcctcagttgatgaagatgcgc tacttggaagtatcagagccacaggacattgagtgttgtggggccctagaatactacgac aaagcctttgaccgcatcaccacgaggagtgagaagccactgcggagcatcaagcgcatc ttccacactgtcaccaccacagacgaccctgtcatccgcaagctggcaaaaactcagggg aatgtgtttgccactgatgccatcctggccacgctgatgagctgtacccgctcagtgtat tcctgggatattgtcgtccagagagttgggtccaaactcttctttgacaagagagacaac tctgactttgacctcctgacagtgagtgagactgccaatgagccccctcaagatgaaggt aattccttcaattcaccccgcaacctggccatggaggcaacctacatcaaccacaatttc tcccagcagtgcttgagaatggggaaggaaagatacaacttccccaacccaaacccgttt gtggaggacgacatggataagaatgaaatcgcctctgttgcgtaccgttaccgcaggtgg aagcttggagatgatattgaccttattgtccgttgtgagcacgatggcgtcatgactgga gccaacggggaagtgtccttcatcaacatcaagacactcaatgagtgggattccaggcac tgtaatggcgttgactggcgtcagaagctggactctcagcgaggggctgtcattgccacg gagctgaagaacaacagctacaagttggcccggtggacctgctgtgctttgctggctgga tctgagtacctcaagcttggttatgtgtctcggtaccacgtgaaagactcctcacgccac gtcatcctaggcacccagcagttcaagcctaatgagtttgccagccagatcaacctgagc gtggagaatgcctggggcattttacgctgcgtcattgacatctgcatgaagctggaggag ggcaaatacctcatcctcaaggaccccaacaagcaggtcatccgtgtctacagcctccct gatggcaccttcagctctgatgaagatgaggaggaagaggaggaggaagaagaggaagaa gaagaggaagaaacttaa >gi568815576r:36411546_36626121|GENSCAN_predicted_peptide_4|311_aa MHQLSEPPRQSAREEFLLSPLSEQLAAGFGEFRAELQMGKCFAGSSKGLWQHLAAPVRGN FKGLCKQIDHFPEDADYEADTAEYFLRAVRASSIFPILSVILLFMGGLCIAASEFYKTRH NIILSAGIFFVSAGLSNIIGIIVYISANAGDPSKSDSKKNSYSYGWSFYFGALSFIIAEM VGVLAVHMFIDRHKQLRATARATDYLQASAITRIPSYRYRYQRRSRSSSRSTEPSHSRDA SPVGIKGFNTLPSTEISMYTLSRDPLKAATTPTATYNSDRDNSFLQVHNCIQKENKDSLH SNTANRRTTPV >gi568815576r:36411546_36626121|GENSCAN_predicted_CDS_4|936_bp atgcatcagctcagtgaacctccacgacaatctgccagggaagaattcttgttatcgcca ctctcagagcagttggcggctgggtttggtgaattccgtgcggaattgcagatgggaaag tgttttgcaggctcttctaaagggctgtggcagcatctggcagcaccagtgcgggggaat ttcaaaggtctgtgcaagcaaattgatcacttcccagaggatgcagattacgaagctgac acagcagaatatttcctccgggccgtgagggcctccagcattttcccaatcctgagtgtg attctgcttttcatgggtggcctctgcatcgcagccagcgagttctacaaaactcgacac aacatcatcctgagtgccggcatcttcttcgtgtctgcaggtctgagtaacatcattggc atcatagtgtacatatctgccaatgccggagacccctccaagagcgactccaaaaagaat agttactcatacggctggtccttctacttcggggccctgtccttcatcatcgccgagatg gtcggggtgctggcggtgcacatgtttatcgaccggcacaaacagctgcgggccacggcc cgcgccacggactacctccaggcctctgccatcacccgcatccccagctaccgctaccgc taccagcgccgcagccgctccagctcgcgctccacggagccctcacactccagggacgcc tcccccgtgggcatcaagggcttcaacaccctgccgtccacggagatctccatgtacacg ctcagcagggaccccctgaaggccgccaccacgcccaccgccacctacaactccgacagg gataacagcttcctccaggttcacaactgtatccagaaggagaacaaggactctctccac tccaacacagccaaccgccggaccacccccgtataa