GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:09:30 Sequence gi568815576r:36390011_36606422 : 216412 bp : 50.11% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 20017 20149 133 0 1 74 94 45 0.156 3.92 1.02 Term + 20286 20398 113 1 2 27 37 132 0.199 0.72 1.03 PlyA + 22278 22283 6 1.05 2.08 PlyA - 24575 24570 6 1.05 2.07 Term - 31657 31260 398 2 2 0 50 236 0.462 6.24 2.06 Intr - 37846 37744 103 1 1 76 96 58 0.116 5.15 2.05 Intr - 54009 53906 104 1 2 136 85 -11 0.027 3.29 2.04 Intr - 67915 67827 89 0 2 8 105 27 0.140 -3.99 2.03 Intr - 72684 72491 194 0 2 32 116 85 0.536 3.99 2.02 Intr - 74093 73908 186 2 0 31 103 96 0.272 5.29 2.01 Init - 75344 75150 195 2 0 84 47 195 0.473 13.26 2.00 Prom - 75684 75645 40 -6.86 3.14 PlyA - 77059 77054 6 1.05 3.13 Term - 77907 77794 114 0 0 105 48 281 0.997 24.37 3.12 Intr - 86846 86723 124 1 1 63 64 172 0.896 12.89 3.11 Intr - 90827 90565 263 2 2 137 79 154 0.876 15.69 3.10 Intr - 91194 91138 57 0 0 106 47 72 0.876 3.98 3.09 Intr - 96979 96917 63 1 0 23 108 60 0.003 0.41 3.08 Intr - 100257 100178 80 1 2 82 80 86 0.005 6.47 3.07 Intr - 103793 103623 171 0 0 136 93 170 0.999 22.21 3.06 Intr - 106198 105957 242 0 2 90 105 336 0.998 32.69 3.05 Intr - 108146 107981 166 0 1 82 64 203 0.997 16.32 3.04 Intr - 111397 111231 167 0 2 92 99 286 0.995 29.70 3.03 Intr - 114357 114088 270 2 0 61 78 535 0.999 46.46 3.02 Intr - 114756 114505 252 2 0 99 96 282 0.997 26.65 3.01 Init - 116412 115886 527 0 2 72 110 1121 0.998 105.03 3.00 Prom - 120048 120009 40 -6.46 4.19 PlyA - 120872 120867 6 1.05 4.18 Term - 120990 120977 14 1 2 103 40 2 0.555 -4.74 4.17 Intr - 121776 121493 284 2 2 92 93 587 0.781 56.66 4.16 Intr - 122592 122450 143 0 2 55 72 211 0.998 15.35 4.15 Intr - 126597 126468 130 2 1 81 84 216 0.945 21.20 4.14 Intr - 126780 126695 86 0 2 84 94 110 0.997 9.82 4.13 Intr - 127421 127291 131 0 2 76 77 91 0.947 7.21 4.12 Intr - 128900 128753 148 2 1 99 91 179 0.998 19.11 4.11 Intr - 129527 129395 133 1 1 98 75 276 0.999 27.95 4.10 Intr - 130678 130566 113 1 2 64 77 96 0.968 5.28 4.09 Intr - 133271 133199 73 1 1 80 95 32 0.856 2.51 4.08 Intr - 133970 133885 86 2 2 17 78 90 0.503 -0.48 4.07 Intr - 134722 134586 137 2 2 62 96 174 0.998 15.89 4.06 Intr - 135699 135654 46 0 1 121 108 21 0.998 5.38 4.05 Intr - 136121 135989 133 1 1 63 83 137 0.102 11.35 4.04 Intr - 139314 139196 119 0 2 -13 -21 161 0.021 -5.64 4.03 Intr - 146389 146286 104 2 2 62 116 45 0.452 4.59 4.02 Intr - 146750 146599 152 1 2 90 71 171 0.473 15.41 4.01 Init - 147594 147269 326 0 2 89 -89 560 0.345 35.20 4.00 Prom - 148922 148883 40 -4.16 5.06 PlyA - 149613 149608 6 1.05 5.05 Term - 174876 174341 536 1 2 124 43 1010 0.999 94.51 5.04 Intr - 176483 176343 141 0 0 124 92 184 0.990 22.72 5.03 Intr - 197538 197455 84 1 0 118 75 90 0.888 10.49 5.02 Intr - 198069 197962 108 1 0 83 49 54 0.631 1.26 5.01 Init - 198726 198660 67 0 1 80 94 23 0.808 3.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 6187 6035 153 1 0 63 43 127 0.928 3.62 S.002 Init - 8567 8514 54 2 0 79 86 51 0.911 3.28 S.003 Term - 100257 99998 260 1 2 82 42 233 0.991 13.81 S.004 Init - 136111 135989 123 1 0 70 83 120 0.869 10.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:36390011_36606422|GENSCAN_predicted_peptide_1|81_aa VTYTGGAAESAAAKFQAPAQPHSPQPGSPSTLPARKSCALHPRTGCSGSSATFLLWQEEL KFQKPEKQAAVKTAEARPTAL >gi568815576r:36390011_36606422|GENSCAN_predicted_CDS_1|246_bp gttacatacacgggcggtgcagcggagagcgcggcggcgaagtttcaggcccctgcccag ccccactcgccgcagccaggtagcccctccacccttcctgcccggaagtcctgcgcgctc cacccaaggacaggatgctctgggagctctgccacgtttctgctgtggcaggaagaactg aaattccagaagccggagaaacaagccgccgtcaagactgctgaagcgaggcctacggcc ctgtaa >gi568815576r:36390011_36606422|GENSCAN_predicted_peptide_2|422_aa MAVSHAGCAHGILLVILFDQFQDTDLTGDAAGSGLSAHGHSLGATLGPMTVDGVGRHLLE AFSHQCLPLPGARASSTPTPAAWGKLLQEAKVVLAKLRKSRMCGLWAGNGPENNVGAVSM EGRMLATVFTILSPSCTISKLGLMTVPANEVTAQQVEARWHRLEVPEVLSVWLSPFPYPQ PHLKQETPGFPRKALRDQLQLPELTEKVQGQPKEGTGLRSAGSSKLYHLFLDNPGYSLYL KILNKILSAKFLLPCKVAYAYFIHEEMLPERGPDPDPKRGFLDLRQEIIQGVSHCAWPLA DALKSINNAKKRGKRQVLIRPCSKVIVRFLTVMIKHDYTGEFEIIDGHRAGKIVVTVTGR LNKCGVISPRFDVQLKDLEKWQNNLLPSRQFGFIVLTTSAGIMDHEEARQNHTGGKILGF FF >gi568815576r:36390011_36606422|GENSCAN_predicted_CDS_2|1269_bp atggccgtgagtcatgctggctgtgctcacggaatcctcctggtgatcctgtttgaccaa ttccaggacactgacctcaccggggatgccgctggctctgggctgtcagcccacggtcat tctctgggtgccacattgggccccatgactgtggatggggttggcagacacctgctggaa gcctttagccaccagtgcctgcccctaccaggggccagggccagctcaacccccaccccg gccgcctggggaaagctgctccaggaagcgaaggtagtgctggccaagctgcggaaatcc cgaatgtgtgggctctgggcagggaacggcccagaaaacaatgttggggccgtgtccatg gagggcaggatgctggcgacggttttcaccattttgagccccagttgcaccatctccaag ttggggctcatgacagtccctgcaaatgaggtcacggctcagcaggtggaagcacgctgg caccgattggaagttccagaagtgttatctgtgtggctttctcctttcccctacccccag ccacatcttaaacaggaaacaccgggcttcccaaggaaggccctcagggaccagctccaa ctgcctgagttgacagagaaggtccagggccagccaaaggaagggacaggcctgaggtca gctgggtcttcgaaattatatcacctcttcctagataatccaggatactctctttatctc aagattcttaacaaaatcctctcagcaaagttccttttgccatgtaaggtcgcttacgcc tattttatacatgaagaaatgttaccagaaaggggtcctgatccagaccccaagagagga ttcttggatctcaggcaagaaataattcagggcgtgagccactgcgcctggcccctggct gatgccctcaagagcatcaacaatgccaaaaagagaggcaaacgccaggtgcttattagg ccatgctccaaagtcatcgtccgatttctcactgtgatgataaagcatgattacactggc gaatttgaaatcattgatggtcacagagctgggaaaattgttgtgactgtcacaggcagg ctcaacaagtgtggagtgatcagccccagatttgatgtgcaactcaaagatctagaaaaa tggcagaataatctgcttccatcccgccagtttggtttcattgtactgacaacctcagct ggcatcatggaccatgaagaagcaagacaaaaccacacaggagggaaaatcctgggattc tttttctag >gi568815576r:36390011_36606422|GENSCAN_predicted_peptide_3|831_aa MGLSAAAPLWGPPGLLLAIALHPALSVPPRRDYCVLGAGPAGLQMAYFLQRAGRDYAVFE RAPRPGSFFTRYPRHRKLISINKRYTGKANAEFNLRHDWNSLLSHDPRLLFRHYSRAYFP DARDMVRYLGDFADTLGLRVQYNTTIAHVTLDKDRQAWNGHYFILTDQKGQVHQCSVLFV ATGLSVPNQVDFPGSEYAEGYESVSVDPEDFVGQNVLILGRGNSAFETAENILGVTNFIH MLSRSRVRLSWATHYVGDLRAINNGLLDTYQLKSLDGLLESDLTDLAILKDSKGKFHVTP KFFLEEANTNQSADSITLPQDDNDNFAMRVPYDRVIRCLGWNFDFSIFNKSLRLNSGNAF GKKYPLIRASYESKGSRGLFILGTASHSVDYRKSAGGFIHGFRYTVRAVHRLLEHRHHSV TWPATELPITQLTSSIVRRVNEASGLYQMFGVLADVILLKENSTAFEYLEEFPIQMLAQL ETLTGRKAKHGLFVINMEYGRNFSGPDKDVFFDDRSVGHTEDAWQSNFLHPVIYYYRYLP TEQEVRFRPAHWPLPRPTAIHHIVEDFLTDWTAPIGHILPLRRFLENCLDTDLRSFYAES CFLFALTRQKLPPFCQQGYLRMQGLQGRKKRAGDLGERILMYHSDQCGQYNSNFLEVAVG RPIMCMAQRLLLRRFLASVISRKPSQGQWPPLTSRALQTPQCSPGGLTVTPNPARTIYTT RISLTTFNIQDGPDFQDRVVNSETPVVVDFHAQWCGPCKILGPRLEKMVAKQHGKVVMAK VDIDDHTDLAIEYEVSAVPTVLAMKNGDVVDKFVGIKDEDQLEAFLKKLIG >gi568815576r:36390011_36606422|GENSCAN_predicted_CDS_3|2496_bp atgggcctctccgctgcggccccgttgtggggtcccccggggctgctcctggccatcgcc ctgcacccagcgctgtcggtgcccccgcgccgggactactgcgtgctgggcgctgggccc gcgggcctgcagatggcctacttcctgcagcgcgctggacgcgactacgcagtgttcgag cgggccccgcggcccggcagcttcttcacacgctacccgcggcaccgcaagctcatcagc atcaacaagcggtacacgggcaaggctaacgccgagttcaacctccgccacgactggaac tctctgctcagccacgacccccggctgctcttcagacactactcgcgtgcctacttcccc gacgcccgcgacatggtgcgctacctgggtgacttcgcggacacgctggggctccgtgtc cagtacaacaccaccatcgcccacgtcactctggacaaggaccgacaggcctggaatggc cactacttcatcctaactgaccagaagggccaggtgcatcagtgcagcgtcctctttgta gccactggtttatcagtccccaaccaggttgacttccctggctccgaatatgcagagggt tacgagtccgtgtccgtggaccctgaggactttgtaggccagaatgtgctgatcctgggt cgtgggaactcggcctttgagacagcagagaacatcttgggtgtcacaaactttatccat atgctcagccgctcccgggtccgtctgtcctgggccacccactacgttggagacctcaga gccatcaacaatggcctgctggatacctaccagctcaagtccctggacgggctgctcgag tctgacctgacggatctggccatcctgaaggacagcaaaggcaagttccatgtcaccccg aaattcttcctggaagaagccaacaccaaccagagtgccgactccatcaccctcccccag gacgacaatgacaactttgccatgcgcgtgccctatgaccgggtaatccgctgcctgggc tggaactttgacttctccattttcaataagtccctcagacttaactcgggaaatgcattc ggcaagaagtacccgctgattcgagctagctacgaatccaaaggaagccggggtctgttt atcctgggtactgccagccactcggtggactaccggaaatctgctgggggcttcatccac ggattccgatacacagtgcgtgctgttcaccggctcctggagcaccgccaccacagcgtc acctggcccgccactgagctccccatcacacagctgaccagctccatcgtgcggcgcgtg aatgaggcttctgggctctaccagatgttcggtgtgctggccgatgtcatcctgttgaag gagaattccacggcctttgagtacctggaggagttccccatacagatgctggcccagctg gagacactcacagggaggaaggcaaagcacgggctcttcgtcatcaacatggaatatggc agaaatttctctggccccgacaaggacgtcttctttgatgaccggtctgtggggcacaca gaagatgcctggcagtctaactttcttcatcctgtcatctactactatagatacctcccc accgaacaggaggtgaggttccgccctgcacactggcccctgcctcggcccacggccatc catcacatcgtggaagacttcttaacagactggactgccccgatcgggcacatcctacct ctgaggcgcttcctggagaactgtttggacaccgatttgcgaagcttctatgcagagtcc tgcttcctgttcgccctcacgcgccagaagttgccacccttttgccagcaggggtacctg aggatgcagggactccaaggcagaaagaaacgggctggcgacctgggggaaaggatcctc atgtaccacagcgaccaatgtgggcagtacaatagcaacttcctggaagttgctgtagga agaccaataatgtgtatggctcagcgacttcttctgaggaggttcctggcctctgtcatc tccaggaagccctctcagggtcagtggccacccctcacttccagagccctgcagacccca caatgcagtcctggtggcctgactgtaacacccaacccagcccggacaatatacaccacg aggatctccttgacaacctttaatatccaggatggacctgactttcaagaccgagtggtc aacagtgagacaccagtggttgtggatttccacgcacagtggtgtggaccctgcaagatc ctggggccgaggttagagaagatggtggccaagcagcacgggaaggtggtgatggccaag gtggatattgatgaccacacagacctcgccattgagtatgaggtgtcagcggtgcccact gtgctggccatgaagaatggggacgtggtggacaagtttgtgggcatcaaggatgaggat cagttggaggccttcctgaagaagctgattggctga >gi568815576r:36390011_36606422|GENSCAN_predicted_peptide_4|785_aa MKCILVAIEGTEALFYWTDEEFEESLQLKFGQSENEEEELPALQDQLSPLLAPVIISSMT MLEKLSDTYTCFSMENSNSLYVLHLFGECLFIAINGDHTKSEGDLQRKLLRPPDLGQRVQ LWEHFQSLLWTYSRLREQEQCFAMEVITGRRSPQDLELSEIFEDSSFNPPLLLPQNYIEV EQEGKAVEKNTALKANEDAWRNAVTDFRVDLRFTAREFFRPTQRKRESSLRQRRFWKMAK FMTPVIQDNPSGWGPCAVPEQFRDMPYQPFSKGDRLGKVADWTGATYQDKRYTNKYSSQF GGGSQYAYFHEEDESSFQLVDTARTQKTAYQRNRMRFAQRNLRRDKDRRNMLQFNLQILP KSAKQKERERIRLQKKFQKQFGVRQKWDQKSQKPRDSSVEVRSDWEVKEEMDFPQLMKMR YLEVSEPQDIECCGALEYYDKAFDRITTRSEKPLRSIKRIFHTVTTTDDPVIRKLAKTQG NVFATDAILATLMSCTRSVYSWDIVVQRVGSKLFFDKRDNSDFDLLTVSETANEPPQDEG NSFNSPRNLAMEATYINHNFSQQCLRMGKERYNFPNPNPFVEDDMDKNEIASVAYRYRRW KLGDDIDLIVRCEHDGVMTGANGEVSFINIKTLNEWDSRHCNGVDWRQKLDSQRGAVIAT ELKNNSYKLARWTCCALLAGSEYLKLGYVSRYHVKDSSRHVILGTQQFKPNEFASQINLS VENAWGILRCVIDICMKLEEGKYLILKDPNKQVIRVYSLPDGTFSSDEDEEEEEEEEEEE EEEET >gi568815576r:36390011_36606422|GENSCAN_predicted_CDS_4|2358_bp atgaagtgcatcttggtggccattgagggcacagaggccctcttctactggacggatgag gagtttgaagagagtctccagctgaagttcgggcagtcagagaatgaggaagaagagctc cctgccctacaggaccagctcagccccctcctagccccggtcatcatctcctccatgacg atgctggagaagctctcggacacctacacctgcttctccatggaaaacagcaactccctg tatgtccttcacctgtttggagaatgcctgttcattgccatcaatggcgaccacaccaag agcgagggggacctgcagcggaagctgctgcggcccccagacctggggcagcgtgtccag ctgtgggagcactttcagagcctgctgtggacctatagccgcctgcgggagcaggaacag tgcttcgccatggaggtgattactgggcgcaggagtccgcaggacttggagctgtcagaa atctttgaagacagttccttcaaccctcccctcctcctcccacaaaattatattgaggtg gaacaggaggggaaagcagtggagaagaacacagcgctaaaggctaatgaggacgcctgg cgaaacgcagtaacggatttccgggtggaccttcgctttacggctcgtgagttcttccgc ccaacccagaggaagcgggagagcagtttacgacagcgccgattttggaagatggcaaag ttcatgacacccgtgatccaggacaacccctcaggctggggtccctgtgcggttcccgag cagtttcgggatatgccctaccagccgttcagcaaaggagatcggctaggaaaggttgca gactggacaggagccacataccaagataagaggtacacaaataagtactcctctcagttt ggtggtggaagtcaatatgcttatttccatgaggaggatgaaagtagcttccagctggtg gatacagcgcgcacacagaagacggcctaccagcggaatcgaatgagatttgcccagagg aacctccgcagagacaaagatcgtcggaacatgttgcagttcaacctgcagatcctgcct aagagtgccaaacagaaagagagagaacgcattcgactgcagaaaaagttccagaaacaa tttggggttaggcagaaatgggatcagaaatcacagaaaccccgagactcttcagttgaa gttcgtagtgattgggaagtgaaagaggaaatggattttcctcagttgatgaagatgcgc tacttggaagtatcagagccacaggacattgagtgttgtggggccctagaatactacgac aaagcctttgaccgcatcaccacgaggagtgagaagccactgcggagcatcaagcgcatc ttccacactgtcaccaccacagacgaccctgtcatccgcaagctggcaaaaactcagggg aatgtgtttgccactgatgccatcctggccacgctgatgagctgtacccgctcagtgtat tcctgggatattgtcgtccagagagttgggtccaaactcttctttgacaagagagacaac tctgactttgacctcctgacagtgagtgagactgccaatgagccccctcaagatgaaggt aattccttcaattcaccccgcaacctggccatggaggcaacctacatcaaccacaatttc tcccagcagtgcttgagaatggggaaggaaagatacaacttccccaacccaaacccgttt gtggaggacgacatggataagaatgaaatcgcctctgttgcgtaccgttaccgcaggtgg aagcttggagatgatattgaccttattgtccgttgtgagcacgatggcgtcatgactgga gccaacggggaagtgtccttcatcaacatcaagacactcaatgagtgggattccaggcac tgtaatggcgttgactggcgtcagaagctggactctcagcgaggggctgtcattgccacg gagctgaagaacaacagctacaagttggcccggtggacctgctgtgctttgctggctgga tctgagtacctcaagcttggttatgtgtctcggtaccacgtgaaagactcctcacgccac gtcatcctaggcacccagcagttcaagcctaatgagtttgccagccagatcaacctgagc gtggagaatgcctggggcattttacgctgcgtcattgacatctgcatgaagctggaggag ggcaaatacctcatcctcaaggaccccaacaagcaggtcatccgtgtctacagcctccct gatggcaccttcagctctgatgaagatgaggaggaagaggaggaggaagaagaggaagaa gaagaggaagaaacttaa >gi568815576r:36390011_36606422|GENSCAN_predicted_peptide_5|311_aa MHQLSEPPRQSAREEFLLSPLSEQLAAGFGEFRAELQMGKCFAGSSKGLWQHLAAPVRGN FKGLCKQIDHFPEDADYEADTAEYFLRAVRASSIFPILSVILLFMGGLCIAASEFYKTRH NIILSAGIFFVSAGLSNIIGIIVYISANAGDPSKSDSKKNSYSYGWSFYFGALSFIIAEM VGVLAVHMFIDRHKQLRATARATDYLQASAITRIPSYRYRYQRRSRSSSRSTEPSHSRDA SPVGIKGFNTLPSTEISMYTLSRDPLKAATTPTATYNSDRDNSFLQVHNCIQKENKDSLH SNTANRRTTPV >gi568815576r:36390011_36606422|GENSCAN_predicted_CDS_5|936_bp atgcatcagctcagtgaacctccacgacaatctgccagggaagaattcttgttatcgcca ctctcagagcagttggcggctgggtttggtgaattccgtgcggaattgcagatgggaaag tgttttgcaggctcttctaaagggctgtggcagcatctggcagcaccagtgcgggggaat ttcaaaggtctgtgcaagcaaattgatcacttcccagaggatgcagattacgaagctgac acagcagaatatttcctccgggccgtgagggcctccagcattttcccaatcctgagtgtg attctgcttttcatgggtggcctctgcatcgcagccagcgagttctacaaaactcgacac aacatcatcctgagtgccggcatcttcttcgtgtctgcaggtctgagtaacatcattggc atcatagtgtacatatctgccaatgccggagacccctccaagagcgactccaaaaagaat agttactcatacggctggtccttctacttcggggccctgtccttcatcatcgccgagatg gtcggggtgctggcggtgcacatgtttatcgaccggcacaaacagctgcgggccacggcc cgcgccacggactacctccaggcctctgccatcacccgcatccccagctaccgctaccgc taccagcgccgcagccgctccagctcgcgctccacggagccctcacactccagggacgcc tcccccgtgggcatcaagggcttcaacaccctgccgtccacggagatctccatgtacacg ctcagcagggaccccctgaaggccgccaccacgcccaccgccacctacaactccgacagg gataacagcttcctccaggttcacaactgtatccagaaggagaacaaggactctctccac tccaacacagccaaccgccggaccacccccgtataa