GENSCAN 1.0 Date run: 5-Nov-116 Time: 21:11:32 Sequence gi568815596f:186386383_186608730 : 222348 bp : 37.80% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 4361 4573 213 1 0 39 43 195 0.664 5.03 1.02 PlyA + 4818 4823 6 1.05 2.00 Prom + 5437 5476 40 -1.55 2.01 Init + 6266 6308 43 1 1 68 92 38 0.178 2.83 2.02 Term + 19027 19136 110 2 2 106 45 92 0.234 4.39 2.03 PlyA + 20226 20231 6 1.05 3.00 Prom + 20702 20741 40 -7.45 3.01 Init + 24640 24703 64 0 1 82 95 39 0.202 5.36 3.02 Intr + 25794 26081 288 1 0 44 81 129 0.210 4.09 3.03 Intr + 31794 31983 190 1 1 -27 61 146 0.004 -1.88 3.04 Intr + 45323 45553 231 2 0 61 47 226 0.803 11.77 3.05 Intr + 45572 45864 293 2 2 -28 42 204 0.405 0.05 3.06 Intr + 46450 46754 305 2 2 63 -25 231 0.299 4.68 3.07 Intr + 47289 47505 217 2 1 20 3 208 0.185 2.65 3.08 Intr + 47778 47897 120 1 0 91 94 81 0.655 8.55 3.09 Intr + 47954 48405 452 0 2 -5 57 374 0.207 17.09 3.10 Intr + 49766 49953 188 1 2 47 8 127 0.309 -1.83 3.11 Intr + 50348 50514 167 2 2 64 10 175 0.284 5.98 3.12 Intr + 51650 51783 134 0 2 52 47 103 0.113 2.04 3.13 Term + 71637 71882 246 2 0 -27 35 255 0.670 3.51 3.14 PlyA + 72002 72007 6 1.05 4.00 Prom + 80383 80422 40 -6.35 4.01 Init + 89130 89807 678 2 0 42 86 238 0.359 14.18 4.02 Intr + 99764 99894 131 1 2 84 50 193 0.949 13.67 4.03 Intr + 99939 100075 137 0 2 100 81 163 0.985 16.09 4.04 Intr + 100219 100409 191 2 2 77 -17 108 0.028 -2.42 4.05 Intr + 108851 108952 102 1 0 88 111 126 0.999 14.35 4.06 Intr + 113800 113911 112 0 1 -1 92 117 0.973 2.23 4.07 Intr + 114891 115043 153 1 0 42 86 111 0.949 5.42 4.08 Intr + 116114 116205 92 0 2 65 39 82 0.981 -0.21 4.09 Intr + 117650 117832 183 1 0 67 78 193 0.987 15.26 4.10 Intr + 119069 119215 147 1 0 45 68 80 0.469 1.21 4.11 Intr + 119358 119459 102 2 0 77 94 252 0.999 24.15 4.12 Intr + 120331 120454 124 0 1 64 115 60 0.989 5.54 4.13 Term + 122161 122351 191 2 2 45 47 306 0.826 18.83 4.14 PlyA + 122938 122943 6 1.05 5.00 Prom + 125510 125549 40 -6.45 5.01 Sngl + 134795 135202 408 1 0 57 41 294 0.602 17.64 5.02 PlyA + 135900 135905 6 1.05 6.00 Prom + 146646 146685 40 -1.05 6.01 Sngl + 175715 175948 234 1 0 90 48 158 0.788 6.85 6.02 PlyA + 176062 176067 6 1.05 7.00 Prom + 181210 181249 40 -4.85 7.01 Init + 186551 186605 55 1 1 50 81 65 0.203 3.73 7.02 Intr + 192257 192343 87 0 0 95 47 63 0.296 1.92 7.03 Term + 200141 200277 137 0 2 87 46 54 0.312 -1.70 7.04 PlyA + 201357 201362 6 1.05 8.00 Prom + 201645 201684 40 -4.55 8.01 Init + 203957 204141 185 1 2 83 88 316 0.927 27.74 8.02 Intr + 211364 211537 174 2 0 66 53 131 0.835 5.53 8.03 Intr + 215639 215769 131 2 2 68 99 115 0.991 10.02 8.04 Term + 216924 217030 107 1 2 50 46 107 0.977 0.29 8.05 PlyA + 217260 217265 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 45063 45121 59 2 2 65 66 45 0.820 0.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:186386383_186608730|GENSCAN_predicted_peptide_1|70_aa MSTFNVNKLVVKEAYLKIIRAIYDIPTANIILNRQKLKAFLLSMTRTRMPTLTTPIRHST GSPSHGNQAR >gi568815596f:186386383_186608730|GENSCAN_predicted_CDS_1|213_bp atgtcaaccttcaatgtcaacaaacttgtcgtcaaagaggcatacctcaaaataataaga gccatctatgacatacccacagccaacatcatactgaacaggcaaaagctgaaagcgttc ctcctttcgatgaccagaacaaggatgcccactctcaccactcccattcgacatagtact ggaagtcctagccacggcaatcaagcaagataa >gi568815596f:186386383_186608730|GENSCAN_predicted_peptide_2|50_aa MDTAGGPYPKEINPAEFMQVQVAGKKHTIMLATLKFKMINLKKVLHAAKE >gi568815596f:186386383_186608730|GENSCAN_predicted_CDS_2|153_bp atggatacagctggaggcccttatcctaaggaaattaacccagctgaattcatgcaggta caagtggctgggaagaaacacacaatcatgctggctactcttaaattcaaaatgattaac ctcaagaaggttcttcatgctgccaaggaatga >gi568815596f:186386383_186608730|GENSCAN_predicted_peptide_3|964_aa MTSKGHSTVLSIQLLQDDGEHVNDGMLLCMTHFMARWVRKHPVHDRQFRSHGDLMNHSQT FSFHQSPVTDQELSLKPLRQDSKDILLALPAEEKLLLLGLMDRNGEKGIGQVNGCLSEKS EKKKEEQRYQQTRVRHQNNVKMHQFLSNFCKVTDVALSFLEQGKEENPIHASIHHFSDKN RPQLESDNWHHKRDPEVSEPSVPAASGWVMWPQHGLWYPVAAVLLRWALVETWVLVAGSL MSMEKALKQLEVQSTKKQVGWAFLTALQEVHTKSLRDAAQVRALQVQAQSLEAWLHSSEK ELKAAMNGDFQMQTGHLETWLQSLEKELEAAVNAGLGPWSQPEIPTLIRMRKNPHWCCLH EVTTAMAALREVEGHRWDREVCAVKKGKMFRLRGPPIPWEKRGPQQVTYLQMWIDVILAR ADQEKINKQPNDVLLTLWTKLSLEHQFQKMPKGKDISYRLPDGHTEITETIKKVEKVQIV RGTHILYDSAMLPARKPDGTWWMMADYQELNKVTPPLQATVPSVMDLMNYLEGAAPLLQQ HLAACGWTVNKSKVQGPGLSAKFLEVIWSALYGEAVANFSGPPGILMDICAPFSSNDKKP LHLLTKKDSTWDSDDAAETTFLAAKWAIQQAQILWVVDQGCLFELDVHVTTDGFGWGLWQ CTEWLRMQVGFWPQLWKEAELQFTLTEKQLAAVYASLQACESVTGWAAVVMQMTYLIAGW VSTNDDLLRSGRGTNGNLLLPVPMPLKVGEQKTWLWPWTLQPPPPPPPMPVVGHCISLEK GLQWICTALPEAAAVDGLPWHTQSGPADNSTQLETWGLMADTWNTMQQALDKGCRKTHEL AHAQRKGKAMSKLSTIPRVVFQLPPFAPLTPLRHQLEPDTEVAKDYNRRESGLITRALKR KQVLPDKRDSKHQRTPRQGDYLMLALNIEGATGQRMQVASRSGKQPLDDCQQGNVDLSTT TTRN >gi568815596f:186386383_186608730|GENSCAN_predicted_CDS_3|2895_bp atgacttcaaaaggccattctactgttctatcaatccagctgcttcaggatgatggggaa catgttaatgatggaatgctgctatgcatgacccattttatggctagatgggtcagaaag cacccagttcatgataggcaattcaggtcgcatggtgacttgatgaaccatagtcaaacg ttcagtttccaccaaagcccagtaacagaccaagagctgtctctcaaacctctgagacag gacagtaaagatatattgctggccttgccagctgaagagaaattgcttctgctgggcctt atggacaggaatggagaaaaaggcattggccaagtcaatggctgcctatcagaaaaaagt gaaaagaagaaagaagaacaacgctaccaacagaccagagtcagacatcagaataatgta aaaatgcatcaatttctgagcaatttttgcaaggtcacagatgtggctctaagctttctg gaacaaggcaaagaagaaaatccgattcatgcctcaattcatcatttctcagataaaaac agacctcagttagaatctgacaattggcatcacaaacgggatcctgaggtgagtgagcct tcagtccctgctgcctctggctgggtcatgtggccacaacatgggctgtggtacccagtg gcagctgtgctgctcagatgggctctggtagaaacctgggtgttggtagctgggtccctc atgagcatggagaaggcactgaagcagctggaagtgcagagcaccaagaagcaagttgga tgggcatttttgactgcactacaagaagtacacaccaagtccctgagggatgcagcacag gtaagggccctccaggtgcaggcacagagcctggaggcctggctacacagctcagaaaaa gagttaaaagctgccatgaatggggatttccagatgcagacagggcacctggagacctgg ctacagagcttggaaaaggaattagaggctgctgtgaatgcaggcctgggtccctggtct cagccagagattcccactctgatacggatgaggaagaacccccattggtgctgcctacat gaagtgaccactgctatggcagctctccgggaagtagaaggccatcggtgggaccgagag gtctgtgcagtaaagaaggggaagatgttccgcttgcggggcccccccatcccatgggag aaaaggggaccccaacaagtgacatacttacagatgtggatagatgtgattctggccaga gctgaccaagagaaaatcaataagcagcccaatgatgtactcttaactttgtggacaaag ttgtccctggagcaccaattccagaaaatgcccaaggggaaggacatttcttatcgtttg ccagatgggcatacagagataactgagacaattaaaaaggtggagaaggtgcagatagtg cgaggaacccacatcctatatgattctgcaatgttgccagctagaaagcctgatggaact tggtggatgatggcagactatcaggaactgaataaagtaacaccccctttgcaagcaact gtaccatcagtcatggatttgatgaactatttagaaggagcagcgcccctcttgcagcaa catttggcagcatgtggttggactgtcaataaatccaaggtccaaggacctggattatct gccaaatttttggaagttatctggtcggccctctatggtgaagcagttgcaaacttttct gggcctcctgggatactgatggacatttgtgccccatttagctcaaatgataaaaaacca ttgcatctgttaacaaagaaggactctacctgggattcagatgatgcagctgagaccacc ttcctggcagccaagtgggctattcagcaggcacaaatcctatgggtagttgaccagggg tgcctgtttgagctggatgtgcatgtgaccacagatggttttggttggggcctatggcag tgcacagagtggctgagaatgcaagtaggcttttggccccaactatggaaggaagctgag ctgcagtttaccttgacagagaaacagctagcagctgtatatgcctcccttcaggcttgt gagagtgtgacaggatgggctgcagttgtcatgcagatgacttacctgatagcgggatgg gtaagtaccaatgatgacctcctccgatcaggtagggggacaaatggtaacctgttgttg cctgtcccaatgcccctgaaggtaggagaacagaaaacctggctgtggccatggaccctg caaccaccacccccgcctccaccaatgccggtggttggccattgtatctccctggagaag ggcttacaatggatctgcactgcccttccagaagcagctgctgtggatggcttaccttgg cacacacaatcagggcctgcagataactcaacacagctggagacttggggtctgatggct gacacctggaatacaatgcagcaagctttggacaaaggatgccgcaaaacccatgagctt gctcatgcccagagaaagggtaaagccatgtcaaaactgtctactattcctcgagtggtt ttccagctacctccatttgccccgctgactcccctcagacatcagttagaacctgacaca gaggttgcaaaagattacaatcgcagagagtctggcctaatcacacgagccctaaaaagg aaacaggttcttcctgataaaagagattcaaagcatcagaggactccacgtcagggagat tatctgatgctggctttgaatatagagggggccacagggcaaagaatgcaggtggcctct aggagtggaaagcagcccctagatgactgtcagcaaggaaatgtggacctcagtactaca accacaagaaattaa >gi568815596f:186386383_186608730|GENSCAN_predicted_peptide_4|780_aa MIISIDAEKAFDKIQQHFMLKTLNKLGTDGTYLKIIRAIYDKPTANIILNGQKLEAFPLK TGTRQGYPFSPLLFNIVLKVLARAIRQEKEIKGIQLGKKEVKLSLFADDMIVYLENPIVS AQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSEFPFTIASKRIKYLGIQLT RDVKDLFKENYKPLFNEIKEDTNKWKNIPCSWVGRINIVKMAILPKWDAVGAHAQLRFRE PLLATHYALRHRPVAKAAFAGPMSDSLSVRARTPDGIQLRVSLAGAICLRNAPQETGSGR GQQKGGAKKEGEDYRTPDDASSLITIVTLLAPPHLGFASPPTRPCSPTHAPGCEMTPRDW HRVLSFASPLRIEEIFQGYDKTFGLKNKKGAKQQKFIKAVTHQVKFGQQNPRQVAQSEAE KKLKKDDKKKELQELNELFKPVVAAQKISKGADPKSVVCAFFKQGQCTKGDKCKFSHDLT LERKCEKRSVYIDARDEELEKDTMDNWDEKKLEEVVNKKHGEAEKKKPKTQIVCKHFLEA IENNKYGWFWVCPGGGDICMYRHALPPGFVLKKDKKKEEKEDEISLEDLIERERSALGPN VTKITLESFLAWKKRKRQEKIDKLEQDMERRKADFKAGKALVISGREVFEFRPELVNDDD EEADDTRYTQGTGGDEVDDSVSVNDIDLSLYIPRDVDETGITVASLERFSTYTSDKDENK LSEASGGRAENGERSDLEEDNEREGTENGAIDAVPVDENLFTGEDLDELEEELNTLDLEE >gi568815596f:186386383_186608730|GENSCAN_predicted_CDS_4|2343_bp atgattatatcaatagatgcagaaaaggcctttgacaaaattcaacaacacttcatgcta aaaactctcaataaattaggtactgatgggacgtatctcaaaataataagagctatttat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggataccctttctcaccactcctattcaacatagtgctgaaagtt ctggccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaaaaggaa gtcaaattgtccctgtttgcagacgacatgattgtatatctagaaaaccccattgtctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtacaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatg agtgaattcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca agggatgtgaaggacctcttcaaggagaactacaaaccactgttcaatgaaataaaagag gatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaa atggccatactgcccaagtgggacgcagtgggcgcgcacgcgcagctccgcttccgggag ccgctcctcgctacccactatgctctgcgacatcgacctgtcgcaaaggccgcgtttgcg gggccaatgagcgactcgctttccgtgcgggccagaacccctgacggtattcagctgcgc gtaagtctggccggtgccatctgtctccgcaatgccccccaagaaacaggctcaggccgg gggcagcaaaaaggcggagcaaaaaaagaaggagaagattatcgaacccctgatgacgct agctctctcattactatagtgacgctgctggcgcccccgcacctgggctttgcctctccg cccacacgcccgtgttcacccacccatgcacccgggtgcgaaatgacccctcgggactgg cacagggttttatccttcgctagtcctctcagaattgaggagattttccaaggttatgac aaaactttcggtttgaagaataagaaaggagcaaagcaacagaagtttatcaaggctgtc acacatcaagttaaatttggtcaacaaaatccacgtcaggtagcacagagtgaagctgaa aagaaattgaagaaggatgacaagaagaaagaattgcaggagctaaatgagctgttcaaa cctgtagttgctgctcaaaaaataagtaaaggtgcagatcccaagtctgtagtatgtgca ttcttcaagcaaggacagtgtactaaaggagataagtgtaagttctcccatgacttgact ctggagagaaaatgtgaaaagcgaagtgtttacattgatgcaagagatgaagaacttgaa aaagatactatggataattgggatgagaaaaagctggaagaagtagtgaacaagaagcac ggtgaggcggaaaagaaaaaaccaaaaactcaaatagtgtgcaagcatttcctggaagct attgaaaacaacaagtatggctggttttgggtatgccctggagggggtgatatttgcatg tatcgtcatgcacttcctcctggatttgtgttgaaaaaagataaaaagaaagaagagaaa gaagatgaaatttcattagaagatctaattgagagagagcgttctgccctaggtccaaat gttaccaaaatcactctagaatcttttcttgcctggaagaaaaggaaaagacaagaaaag attgataaacttgaacaagatatggaaagaaggaaagctgacttcaaagcagggaaagca ctagtgatcagtggtcgtgaagtgtttgaatttcgtcctgaactggtcaatgatgatgat gaggaagcagatgatacccgctacacccagggaacaggtggtgatgaggttgatgattca gtgagtgtaaatgacatagatttaagcctgtacatcccaagagatgtagatgaaacaggt attactgtagccagtcttgaaagattcagcacatatacttcagataaagatgaaaacaaa ttaagtgaagcttctggaggtagggctgaaaatggtgaaagaagtgacttggaagaggac aacgagagggagggaacggaaaatggagccattgatgctgttcctgttgatgaaaatctt ttcactggagaggatttggatgaactagaagaagaattaaatacacttgatttagaagaa tga >gi568815596f:186386383_186608730|GENSCAN_predicted_peptide_5|135_aa MNPSEMQRKAPPRRQRHRSRAPSAHKMNRMVMSEEQMKLPSTKKAEPPTWAQLKKLTQLA KKKPREHKGDTNSREHAACSFEDCINSVCRCTQQLRRERPSRTSHDDNGGFVEKKGEMWG KERDIRLLLCLCRKK >gi568815596f:186386383_186608730|GENSCAN_predicted_CDS_5|408_bp atgaacccgtcggagatgcaaagaaaagcacctccacggagacagagacaccgcagtcga gcaccatcggctcacaagatgaacagaatggtgatgtcagaagaacagatgaagttgcca tccaccaagaaggcggagccgccgacatgggcacaattaaagaagctgacacagttagct aaaaaaaagcctagagaacacaaaggtgacacaaactccagagaacatgctgcttgcagc tttgaagactgtatcaacagtgtctgcaggtgtacccagcagctccgaagagagcgacca tcgagaacgagccatgatgacaacggtggttttgtcgaaaagaagggggaaatgtgggga aaagaaagagatatcagactgttactgtgtctatgtagaaagaagtaa >gi568815596f:186386383_186608730|GENSCAN_predicted_peptide_6|77_aa MTSPNKLNKAPVTNPREPEICDLSQREFKIGVLRKLSEIQDNTEKEFRVLADKFNQEIGI TLKNEAEILKLKNSVDK >gi568815596f:186386383_186608730|GENSCAN_predicted_CDS_6|234_bp atgacctcaccaaacaaactaaacaaggcaccagtgacaaaccccagagagccagaaata tgtgacctttcacagagagaattcaaaataggtgttttgaggaagctcagtgaaattcaa gataacacagagaaggaattcagagttctagcagataaatttaaccaagagattggaata actttaaaaaatgaagcagaaattctgaagctgaaaaattcagttgacaaataa >gi568815596f:186386383_186608730|GENSCAN_predicted_peptide_7|92_aa MYGLLKVVSQGMVAAAGAGLTRRYYSIKLTQRSWQLNAIKAVFHPIAYKKTEAQIGWIDF PRVIQLVVGKARNGTQVLYFTETTGIKIFTKV >gi568815596f:186386383_186608730|GENSCAN_predicted_CDS_7|279_bp atgtatggactccttaaggtggtttcccaaggcatggtagcagctgctggagctggattg actaggcggtattatagtatcaaattaacacagagatcttggcagcttaatgcaataaaa gctgtatttcaccctattgcatataagaaaactgaggcacagattggttggattgatttc cccagggtcatacaactagtagttggcaaagccaggaatggaacccaggtgctttatttt acagaaaccactggaatcaagatttttactaaagtatag >gi568815596f:186386383_186608730|GENSCAN_predicted_peptide_8|198_aa MAFPPRRRLRLGPRGLPLLLSGLLLPLCRAFNLDVDSPAEYSGPEGSYFGFAVDFFVPSA SSSVVALDSHRSKPYCVNPAVKCACKGSRLHASYENLIPGDPSWTVSSRKHPPLCPGKES RMFLLVGAPKANTTQPGIVEGGQVLKCDWSSTRRCQPIEFDATVLKYIDSGKYVSETGIM MWTVPFDIVISAKMIVSH >gi568815596f:186386383_186608730|GENSCAN_predicted_CDS_8|597_bp atggcttttccgccgcggcgacggctgcgcctcggtccccgcggcctcccgcttcttctc tcgggactcctgctacctctgtgccgcgccttcaacctagacgtggacagtcctgccgag tactctggccccgagggaagttacttcggcttcgccgtggatttcttcgtgcccagcgcg tcttcatcagtggtggcattagattctcataggagcaaaccctattgtgtgaaccctgct gtgaagtgtgcatgcaagggatctaggttacatgcttcttatgagaacctaatacctggt gatccgagttggacagtttcatcccgaaagcatccccctctatgtccagggaaggaaagc cggatgtttcttctcgtgggagctcccaaagcaaacaccacccagcctgggattgtggaa ggagggcaggtcctcaaatgtgactggtcttctacccgccggtgccagccaattgaattt gatgcaacagtactaaaatatattgattctgggaaatatgtcagtgaaacaggaatcatg atgtggactgttccatttgacattgtgatcagtgccaaaatgattgttagtcattaa