GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:03:31 Sequence gi568815582r:52339228_52646723 : 307496 bp : 39.58% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8319 8350 32 2 2 51 100 44 0.346 1.16 1.02 Intr + 20588 20750 163 2 1 57 86 120 0.533 7.76 1.03 Term + 27493 27657 165 0 0 68 48 152 0.982 6.13 1.04 PlyA + 29263 29268 6 1.05 2.00 Prom + 31119 31158 40 -9.05 2.01 Init + 33860 34144 285 1 0 72 35 143 0.471 4.56 2.02 Intr + 34349 34524 176 1 2 86 82 95 0.692 6.52 2.03 Intr + 34788 34915 128 0 2 112 -21 41 0.336 -5.00 2.04 Term + 35296 35582 287 2 2 15 47 221 0.559 5.28 2.05 PlyA + 36032 36037 6 1.05 3.00 Prom + 36127 36166 40 -9.35 3.01 Init + 37086 37185 100 2 1 64 94 45 0.716 3.17 3.02 Intr + 39939 40101 163 1 1 68 22 200 0.756 9.51 3.03 Intr + 41036 41169 134 2 2 53 66 99 0.557 3.57 3.04 Term + 70156 70244 89 2 2 86 44 116 0.302 3.84 3.05 PlyA + 70705 70710 6 1.05 4.14 PlyA - 71405 71400 6 1.05 4.13 Term - 84153 84017 137 1 2 106 42 25 0.136 -3.10 4.12 Intr - 85027 84889 139 1 1 40 80 88 0.287 2.32 4.11 Intr - 95351 95175 177 2 0 -30 105 139 0.027 2.99 4.10 Intr - 100768 100013 756 1 0 51 82 801 0.036 66.53 4.09 Intr - 105129 105049 81 0 0 32 92 117 0.924 5.42 4.08 Intr - 106994 106767 228 2 0 101 94 206 0.985 19.64 4.07 Intr - 111319 111050 270 1 0 93 89 205 0.914 17.92 4.06 Intr - 124961 124707 255 2 0 68 66 110 0.255 3.52 4.05 Intr - 129429 129282 148 2 1 42 92 32 0.008 -1.68 4.04 Intr - 134354 134248 107 2 2 54 75 93 0.007 2.69 4.03 Intr - 182470 182296 175 0 1 18 32 128 0.042 -0.78 4.02 Intr - 187338 187166 173 0 2 76 51 129 0.107 5.82 4.01 Init - 204120 204067 54 0 0 45 119 55 0.540 5.63 4.00 Prom - 204964 204925 40 -6.45 5.04 PlyA - 206293 206288 6 1.05 5.03 Term - 207164 207051 114 2 0 93 48 59 0.491 -0.01 5.02 Intr - 208408 208141 268 0 1 28 75 235 0.938 12.81 5.01 Init - 208616 208423 194 2 2 29 75 190 0.341 10.29 5.00 Prom - 214039 214000 40 -7.05 6.00 Prom + 225832 225871 40 -6.45 6.01 Init + 226243 226334 92 0 2 36 110 76 0.367 4.62 6.02 Intr + 228095 228173 79 2 1 28 105 57 0.538 0.03 6.03 Intr + 228766 228906 141 0 0 58 55 88 0.123 2.13 6.04 Intr + 235959 236022 64 2 1 79 94 95 0.325 6.57 6.05 Intr + 244656 244693 38 1 2 63 84 52 0.039 -0.64 6.06 Intr + 245938 246108 171 0 0 80 21 130 0.339 4.72 6.07 Intr + 248583 248694 112 2 1 58 59 90 0.143 2.13 6.08 Intr + 260735 260864 130 0 1 61 56 112 0.008 4.13 6.09 Intr + 268804 268884 81 1 0 83 59 100 0.051 4.43 6.10 Intr + 289961 290069 109 2 1 71 92 38 0.723 1.87 6.11 Term + 290711 290827 117 1 0 67 47 147 0.874 6.06 6.12 PlyA + 290977 290982 6 1.05 7.05 PlyA - 291728 291723 6 1.05 7.04 Term - 298532 298361 172 1 1 66 42 105 0.414 0.02 7.03 Intr - 301450 301379 72 0 0 96 44 85 0.504 2.50 7.02 Intr - 303197 303017 181 0 1 35 40 161 0.705 4.00 7.01 Init - 304186 303892 295 1 1 62 86 161 0.349 10.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 134187 134335 149 1 2 73 49 159 0.830 7.58 S.002 Intr - 251840 251709 132 1 0 107 115 97 0.901 13.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:52339228_52646723|GENSCAN_predicted_peptide_1|119_aa MIFDKGAKIIQTLTNTDIGTEKWSGATANTQKCGGSFGTWYRVEAGRVLRCMQEKANIAV KGLLKTIEHGMEKNKKAGRFAQKSITVATAYLRIWNPNEKPSEEEKVHSKNENRIGNVN >gi568815582r:52339228_52646723|GENSCAN_predicted_CDS_1|360_bp atgattttcgacaaaggtgccaagataatccaaaccctgacaaatactgatattggtact gagaagtggagtggtgctacagcaaatacccaaaaatgtggaggcagttttggaacttgg tatagggtagaggctggaagagttttgagatgcatgcaagaaaaagccaatattgctgtg aaaggacttttaaagacaatagaacatggcatggagaagaataaaaaagctggtcgattt gctcagaaatctataactgtcgcaacagcatatttgagaatttggaaccccaatgaaaag cccagtgaagaagagaaagttcattccaaaaatgagaacagaataggaaatgtgaactga >gi568815582r:52339228_52646723|GENSCAN_predicted_peptide_2|291_aa MMLASAQLLGRPQEAYNHGRDKEGASPSHSQSSKEGKRRYYTLLNNWISQELTITDDRTK EDVVKPLETTCPHDPITSHEAPSPTLRITIQHEIWSFSLCQKREVGSGSLGSFQLCCPMF QCCCRCWTHLEPPKARSLPISKMKMLMKIVLAPRTQGLTCSKQMDWLPFSICLKAKKQGP TFTLSEKPRTLVTCSCNDLQKGEATGLKFQSWEVTKLGLDSMTYESQSCSDPTSWAPLMR HYSVTGTGKLRSEFQANEASTKLPFSDPPPSSTPAVCRVFQPSRKPNAAKQ >gi568815582r:52339228_52646723|GENSCAN_predicted_CDS_2|876_bp atgatgctggcatctgctcagcttctggggaggcctcaggaagcttacaatcatggcaga gacaaagagggagccagcccttcacatagccaaagcagcaaggaggggaagaggaggtac tacacacttttaaacaactggatctcacaagaactcactatcaccgatgaccgcaccaag gaggatgttgttaaaccattagaaaccacctgcccccatgatcccattacctcccatgag gccccatctccaacattgaggattacaattcaacatgagatttggtctttctccctctgc caaaagagggaagtgggctcagggtccctgggatccttccagctctgctgtccaatgttc caatgctgctgccgatgttggacgcacctggagccacccaaggccaggtctcttccaatc agtaaaatgaagatgctaatgaagattgttctggcacctagaactcaaggcctcacctgt tctaagcaaatggattggctgcctttcagcatctgtcttaaagccaagaaacaaggaccc acattcactttatcagaaaaacccagaaccctggtcacttgtagctgcaatgaccttcaa aaaggtgaagcaacaggcctcaagttccaaagctgggaggtcacaaagctgggtcttgac tccatgacttatgagagtcaatcctgttctgaccccacatcatgggcccctctgatgcgt cattactcagtaactggcaccgggaaactgaggagtgaatttcaggccaatgaagcatcc acgaaacttcctttttcagatccccctccaagcagtacccctgctgtctgcagggtcttc cagccttccaggaagccaaatgcagcaaagcaatga >gi568815582r:52339228_52646723|GENSCAN_predicted_peptide_3|161_aa MGPSQIQRYRNSALPVHGKTCKDLTATFSPPPRDNCEAFVKTICTKPETPFSLGFGTNYP VIGTTPQNLMTALSEGPISKEFLPEANIPKLSSSVRLIQDELVVSVGAKAIAKLKFLYIQ RPEKGQMCYKATGLDPQKEDLKPEIFAISEEKFHYGYYTIY >gi568815582r:52339228_52646723|GENSCAN_predicted_CDS_3|486_bp atgggcccatcccagattcaaaggtacagaaattcggctctacctgttcatggaaagaca tgcaaagacttgacggcaacatttagtccaccacccagagacaattgtgaagcttttgtg aagaccatctgcactaaaccagagactccctttagtttgggatttggaaccaactaccct gtaattggcactactccccagaatcttatgaccgccctctcagagggcccaatttctaaa gaatttctccctgaagccaacattccgaagctgtcttccagtgtcaggttaattcaagat gaacttgttgtatcagttggtgcaaaagcaattgcaaaactgaagtttttatatatccaa aggccagaaaagggacaaatgtgttataaggctacaggcctcgaccctcagaaagaggac ctgaagcctgagatatttgccatcagtgaagaaaaatttcattatggttactacacaatt tactaa >gi568815582r:52339228_52646723|GENSCAN_predicted_peptide_4|899_aa MLIGTFRISDFWIRDAELPSPTTPIPCRPTISRLYGNEKLLFFTKKIGGDKVINTYFVQE SIRATILEHSFGLLEGTIQRPLKGPRIIGEIILASTEAVYAQLITALIFRYSSGNSVIGT DGHAQRQNIFKGHKGVVNRAFTCVWTLQSGLCQHVTVRGTLDPSWHSSFVNTSILSKSQP ECFIMPHINVKQSVFFQFGNNNNYMNMAEANNAFFAASEQTFHTPSLGDEEFEIPPITPP PESDPALGMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLDLPSITISRNLVEQDGVLHSS GLHMDQSHTQVSQYRQDPSLIMRSIVHMTDAARSGVMPPAQLTTINQSQLSAQLGLNLGG ASMPHTSPSPPASKSATPSPSSSINEEDADEANRAIGEKRAAPDSGKKPKTPKKKKKKDP NEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDSLGEEQKQVYKRKTEAAK KEYLKALAAYRASLVSKSGIFSPKMQAAAESAEAQTIRSVQQTLASTNLTSSLLLNTPLS QHGTVSASPQTLQQSLPRSIAPKPLTMRLPMNQIVTSVTIAANMPSNIGAPLISSMGTTM VGSAPSTQVSPSVQTQQHQMQLQQQQQQQQQQMQQMQQQQLQQHQMHQQIQQQMQQQHFQ HHMQQHLQQQQQHLQQQINQQQLQQQLQQRLQLQQLQHMQHQSQPSPRQHSPVASQITSP IPAIGSPQPASQQHQSQIQSQTQTQVLSQMCLGGCVGSEQAKIKVILVDVEDISRQGESR ENPFRLKTQAPQNSRRESNKIGQTAFVETLLSSDSLLLVISPSLALARDTLDSVFQYTVF LAIADGLRGRHLSVETCQCPKPCWSISPLCKDALCSSLDVTLIYEINEAKLEKHHDPLS >gi568815582r:52339228_52646723|GENSCAN_predicted_CDS_4|2700_bp atgctcattggaacatttcggatttcagatttttggattagggatgctgaactgccctcc cccaccacccccatcccctgccgaccaacgatttcaaggctttatggaaatgaaaagttg ctcttcttcacaaaaaagattgggggagataaagtgattaacacttattttgttcaggaa tccatccgagctaccatccttgaacactcatttggcttgttggaaggaaccatccaaagg cctctgaagggacccaggatcatcggagagatcatacttgcatccactgaagctgtgtac gcacagttgatcactgcactcatatttaggtactctagtggtaacagtgtcatcggcaca gatggccatgcacaaaggcaaaacatatttaaggggcataaaggggtagttaaccgagcc ttcacgtgcgtgtggactttgcagtccggcctctgccagcatgtgaccgtgcgtggcaca cttgatccatcatggcacagttcatttgtcaacaccagcattctttccaaatcacagcca gaatgctttattatgccgcatattaacgtaaaacaatctgttttctttcagtttggaaat aataataactatatgaatatggctgaggcgaacaatgcgttcttcgctgccagtgagcag acattccacacaccaagccttggggacgaggaattcgaaattccaccaatcacgcctcct ccagagtcagaccctgccctaggcatgccggatgtactgctaccctttcaagccctcagc gatccattgccttcccagggaagtgaattcacaccccagtttccccctcaaagcctggac ctcccttccattacaatctcaagaaatctcgtggaacaagatggcgtgcttcatagcagt gggttgcatatggatcagagccacacacaagtgtcccagtaccggcaggatccctccctg atcatgcggtccatcgtccacatgaccgatgctgcgcgttctggggtcatgcctcctgcc cagctcaccaccatcaaccagtctcagctcagcgcccagttggggttgaatttgggaggt gccagtatgcctcacacatctccttcacctccagcaagcaaatcagccactccctcccct tccagctccatcaatgaagaggatgctgatgaagccaacagagccattggagagaaaaga gctgctccagactctggcaagaagcccaagactccaaagaaaaagaaaaagaaagatccc aatgagccacagaagccagtgtcagcatatgccctgtttttcagagacacacaggctgca attaaaggtcaaaaccccaatgcaacctttggagaggtctcaaaaattgtagcatctatg tgggacagccttggagaagaacaaaagcaggtatataaaaggaaaacagaagctgccaaa aaagaatacctgaaggccctggcggcatacagggccagcctcgtttctaagtctggaatt ttctcccccaaaatgcaggctgctgctgagtcagcagaagcccagaccatccgttctgtt cagcagaccctggcgtcgaccaatctaacatcctctctccttctcaacactccactgtct caacatggaacagtgtcagcatcacctcagactctccagcaatccctccctaggtcaatc gctcccaaacccttaaccatgagactccccatgaaccagattgtcacatcagtcaccatt gcagccaacatgccctcgaacattggggctccactgataagctccatgggaacgaccatg gttggctcagcaccctccacccaagtgagtccttcggtgcaaacccagcagcatcagatg caattgcagcagcagcagcagcagcaacaacaacagatgcaacagatgcagcagcagcaa ctccagcagcaccaaatgcatcagcaaatccagcagcagatgcagcagcagcatttccag caccacatgcagcagcacctgcagcagcagcagcagcatctccagcagcaaattaatcaa cagcagctgcagcagcagctgcagcagcgcctccagctgcagcagctgcaacacatgcag caccagtctcagccttctcctcggcagcactcccctgtcgcctctcagataacatccccc atccctgccatcgggagcccccagccagcctctcagcagcaccagtcgcaaatacagtct cagacacagactcaagtattatcgcagatgtgccttggaggttgtgtaggatctgaacag gccaagataaaagtcatcctggtagatgtagaagacatctcccgccaaggggagagcaga gaaaatccattcaggttgaaaacacaggcacctcaaaacagcagaagagaatcaaacaaa attggacaaacagcttttgtggagacactacttagcagtgattctctgctcttggtgatt tctccttctctggcactggcccgtgacacactggacagtgtttttcaatacacagtcttt ctggccatagctgatggactcaggggcagacacctgtccgtggaaacttgccaatgcccc aagccttgttggtcaatatcccctttgtgcaaggatgccctgtgttcatctttagatgta acattgatatatgagataaatgaggctaaattggaaaagcaccatgatcccctgtcatga >gi568815582r:52339228_52646723|GENSCAN_predicted_peptide_5|191_aa MPFEYKQEQREYEENRHEASPGKLQVCLPVKGQAPTPLLGNPTGELGVPAPGYRETLVQV GVWVRGHSKGKAGSRRGARGQAPMKKEGGHGEESDRKGADGLGMLGPGEAVVAQEEGGTH GTQGWRPLRPRKGGERGALRGSGDKEDRSGKGGETLLCCPQKAKRARGLKEGLACDCLLL LLVNTIAAPLD >gi568815582r:52339228_52646723|GENSCAN_predicted_CDS_5|576_bp atgcctttcgaatacaaacaagagcaaagagaatacgaagagaaccgacacgaggcttca cctgggaagcttcaagtctgcctacctgtgaaaggtcaggccccaacaccccttctggga aatcctacaggtgaacttggggtcccggctcccggctaccgggagacactcgtgcaggtt ggcgtctgggttaggggacattcgaaaggcaaggcggggtcgaggagaggggcccgggga caggcacccatgaagaaggaagggggtcatggggaggaaagcgacaggaagggagcagac ggcttggggatgctggggcccggggaagctgtggtcgcgcaggaggagggtgggacacac gggacgcagggctggcgtccgcttcggccgaggaaaggaggggagagaggagcgctccga ggcagtggggacaaagaggaccggagcgggaaggggggcgagactctcctttgctgcccc caaaaagcgaagagagcccgaggactgaaagaagggctggcgtgtgattgtttgttgctg ttattagtaaacacgatcgcagcccccctggactga >gi568815582r:52339228_52646723|GENSCAN_predicted_peptide_6|377_aa MPGTQQVLNKCLLIQELTVLKPTDGKSVEGSVSSQLCLLAEIPEQVLVWGSCQRVEEVLI EYGLWCLAPWWQQESPGAGSEWYIMQERGEEVPDNPESLGAAAPVPHLAMWKAEPLRVGG SCGIPPCRRPIVGPHPVTLPKAFAGNKRIVKLTNISRDKEVWGDNALVNGLVFESLLILQ HLEDDHEEPMLPSIRMTRLQHEEDAILWNWVSRRRHSKVHEQSEGGASKCRGVLNAGTSK IMGPASDKSVLAASFHGGMAREGEGEKQKGAKLIPLIGSVLTIQKAVVNGLELSADKEAF LEFYLRPDPSCQTPTESDPQIARRSLSKLSPPDQKRRSYNKGSALKEADAMLKPITLKLT PHLGRAGSTITGFPVQE >gi568815582r:52339228_52646723|GENSCAN_predicted_CDS_6|1134_bp atgcctggaacacagcaggtgctgaataaatgtttgttgatccaggaactgactgtgttg aagcccacagatgggaaatcagtagaaggcagtgtttccagtcagctctgcttgctggct gagataccagagcaagtgctggtctggggaagttgtcagcgtgtagaggaggtgcttatt gagtatggattgtggtgtcttgcaccttggtggcagcaggaaagtcccggggcaggaagt gagtggtacataatgcaggagaggggagaggaggtgcctgataaccctgagagtctggga gctgctgccccagttcctcatcttgcaatgtggaaagcagaacctctccgggttggagga tcctgtgggatcccaccttgcagacggcctattgtgggacctcatcctgtgacccttccc aaagcttttgctggtaataaaaggattgtaaagttgacaaatataagcagggacaaagaa gtttggggagataatgctctagtgaatgggctagtatttgaaagtttattgatcctgcaa cacctagaagatgatcacgaagagccaatgctgccatcaatcaggatgacaaggctgcag catgaagaagatgccatattgtggaattgggtgtcaaggagaagacattcgaaggtgcat gaacagagtgaaggtggtgcaagcaagtgcagaggagttctaaatgctgggacatccaag atcatgggacccgcatccgacaagagtgttctagctgcatcattccatggtggaatggcc agagagggtgagggagagaagcaaaagggggccaaactcatccctctgatagggagcgtc ctcacaattcagaaagcagttgtcaatgggctggagctttcagctgataaggaggcattt ctggaattctacctcaggcctgatcccagctgccagactccaacggagtcagatcctcaa attgccaggagatcgctatccaaactgtcaccccctgatcagaaaagaaggagttataac aaaggatcggcccttaaggaggctgatgcgatgttaaagcccatcactctaaaattgaca ccacacctgggaagagcaggcagcaccatcactggctttcctgttcaagaatga >gi568815582r:52339228_52646723|GENSCAN_predicted_peptide_7|239_aa MDRNQDNSEGLENRNYRKYLVAEQRDEKATTTSKHENTSLLVSLSPNTVCDWLSLSHVPW WLPGCAEERRGKDRAFLASIMGNEHLICYLIQLTHNGQGSEKYNPTWESCFLAMLCIMEW EAGTFGGWRLTFATAEEMGDGDGLTAPPMDKGPLWSLGRSTDPQDGNPALPVLSLTLQKK ERRRKRFVPQATPPLPDLERTVASRAVPSYLLLLHTGTKGPEASPEQRPNWDPTQKELM >gi568815582r:52339228_52646723|GENSCAN_predicted_CDS_7|720_bp atggacaggaaccaggacaactctgaagggctagaaaataggaattacagaaagtatctt gtagcagagcagagggacgagaaagctactaccacctccaaacatgaaaatacctctctc ctggtttcactctctcctaacacagtatgcgattggttgagtttaagtcatgttccctgg tggctccctggctgtgctgaggagaggagaggaaaagatagggcctttttggcttccatc atgggaaatgagcacctgatttgctatctcatccagttgacacacaacggacaaggaagt gagaaatacaaccccacctgggagagctgtttcctagcaatgctctgcatcatggaatgg gaagcaggaacctttggaggatggagactaacctttgccacagcagaggaaatgggagat ggagatggattgacagcccctccaatggataaaggtccactatggagtttgggaagatcc acagatccacaggatggaaatcctgcattacctgtcctgtctctcactctacaaaagaag gaacggaggagaaaaaggtttgtcccccaagcaactccccctctgccagacctggagagg actgtggctagcagggcagtgccgtcttaccttctgctactgcacactgggactaagggg ccagaggcctcaccagaacaaaggcctaactgggacccaacacagaaggagctcatgtga