GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:42:36 Sequence gi568815588r:117032212_117238056 : 205845 bp : 44.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 1092 1307 216 2 0 55 42 180 0.821 5.27 1.02 PlyA + 1323 1328 6 1.05 2.00 Prom + 18209 18248 40 -3.56 2.01 Sngl + 40630 41019 390 0 0 73 48 149 0.861 5.62 2.02 PlyA + 42039 42044 6 1.05 3.04 PlyA - 42450 42445 6 1.05 3.03 Term - 46075 46015 61 0 1 59 43 72 0.452 -2.92 3.02 Intr - 48578 47485 1094 2 2 15 53 352 0.052 13.21 3.01 Init - 50440 49403 1038 1 0 44 41 365 0.056 21.40 3.00 Prom - 50533 50494 40 -4.96 4.02 PlyA - 50702 50697 6 1.05 4.01 Sngl - 51954 50929 1026 0 0 88 43 555 0.998 48.09 4.00 Prom - 53653 53614 40 -3.36 5.06 PlyA - 53886 53881 6 1.05 5.05 Term - 57404 57280 125 0 2 64 48 72 0.063 -0.65 5.04 Intr - 73428 73280 149 2 2 61 106 74 0.323 6.38 5.03 Intr - 77922 77810 113 0 2 86 74 9 0.198 -1.52 5.02 Intr - 81062 81030 33 2 0 133 113 8 0.476 6.22 5.01 Init - 86665 86630 36 1 0 66 75 39 0.404 0.41 5.00 Prom - 87958 87919 40 -8.06 6.00 Prom + 91061 91100 40 -0.86 6.01 Init + 94602 94677 76 2 1 75 70 62 0.784 2.35 6.02 Term + 96539 96657 119 0 2 95 49 40 0.310 -0.50 6.03 PlyA + 99184 99189 6 1.05 7.04 PlyA - 101105 101100 6 1.05 7.03 Term - 102372 101797 576 0 0 78 34 666 0.993 54.67 7.02 Intr - 104448 104261 188 1 2 67 80 347 0.710 31.21 7.01 Init - 105824 105605 220 2 1 85 75 251 0.964 22.29 7.00 Prom - 107381 107342 40 -3.26 8.00 Prom + 126399 126438 40 -2.46 8.01 Init + 131156 131270 115 1 1 51 18 122 0.119 -0.12 8.02 Intr + 133485 133627 143 1 2 107 -14 89 0.102 0.67 8.03 Intr + 142751 142877 127 1 1 73 102 66 0.397 6.75 8.04 Intr + 143130 143254 125 1 2 82 41 32 0.369 -1.80 8.05 Term + 143805 143936 132 2 0 16 32 174 0.456 2.69 8.06 PlyA + 144234 144239 6 1.05 9.00 Prom + 146537 146576 40 -3.76 9.01 Init + 152203 152319 117 0 0 51 46 188 0.607 9.11 9.02 Intr + 165302 165500 199 1 1 42 96 202 0.038 15.32 9.03 Intr + 168948 169076 129 1 0 80 106 93 0.997 10.97 9.04 Term + 177286 178088 803 2 2 107 49 653 0.993 56.71 9.05 PlyA + 179330 179335 6 1.05 10.00 Prom + 180501 180540 40 -2.76 10.01 Init + 182272 182419 148 0 1 43 84 93 0.646 4.65 10.02 Intr + 184099 184279 181 2 1 49 115 65 0.341 4.23 10.03 Term + 196454 196484 31 2 1 122 43 42 0.032 0.43 10.04 PlyA + 197306 197311 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:117032212_117238056|GENSCAN_predicted_peptide_1|71_aa MREDPNKQNQNEKGDITTDTAEIQRIISGYYEQLHASKLENLEEMDKFLDTYNLPRLNHE EIQNLNRPNNK >gi568815588r:117032212_117238056|GENSCAN_predicted_CDS_1|216_bp atgagagaagatccaaataaacaaaatcagaatgaaaaaggagacattacaactgatact gcagaaattcaaaggatcattagtggatactatgagcaactacatgccagtaaattggaa aatctagaagaaatggacaaattcctagacacatacaatctaccaagactgaatcatgaa gaaatccaaaacctgaacagaccaaataacaagtaa >gi568815588r:117032212_117238056|GENSCAN_predicted_peptide_2|129_aa MDPNQEEIPDLLEKEFRRLVIKLIWEAPEKGKAQCKEIQKMIQEVKGEKFKEIDSINKKI SKVQDTLDTLTEMQNALESLSNRIEQVKERNSELKDKVFELTQSNKDKEKRIRKYKGSKK FGIMLNDQT >gi568815588r:117032212_117238056|GENSCAN_predicted_CDS_2|390_bp atggatccaaaccaagaagaaatccctgatttacttgaaaaagaatttaggaggttagtt attaagctaatctgggaggcaccagagaaaggcaaagcccaatgcaaggaaatccaaaaa atgatacaagaagtgaagggagaaaaatttaaggaaatagatagcataaataaaaaaata tcaaaagttcaggacacattggacacacttacagaaatgcaaaatgctctggaaagtctc agcaatagaattgaacaagtaaaagaaagaaattcagagctcaaagacaaggtcttcgaa ttaacccaatccaacaaagacaaagaaaaaagaataagaaaatacaaaggctccaagaaa tttgggattatgttaaacgaccaaacctaa >gi568815588r:117032212_117238056|GENSCAN_predicted_peptide_3|730_aa MGDFNTPLSTLDRSMRQKFNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHILGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDAIKNDKGDVTTDPTEVQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLN QEEVESLNRPITGSEIVAIINSLPPKKSPGPDGFTAEFYQRYKEELRIKYLGIQLTRDVK DLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFT ELEKTTLKFIWNQKRARITKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQ WNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTK INSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKDKIDKWDLIKLKS FCTAKETTIRVNRQPTKWEKIFTTYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNR HFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRWLTDGKRVF DCALSSESYR >gi568815588r:117032212_117238056|GENSCAN_predicted_CDS_3|2193_bp atgggagactttaacaccccactgtcaacattagacagatcaatgagacagaaattcaac aaggatacccaggaattgaactcagctctgcaccaagcggacctaatagacatctacaga actctccaccccaaatcaacagagtatacatttttttcagcaccacaccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagcgcaatcaaactagaactcaggattaagaatctc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagacctctagcaagactaataaag aaaaaaagagagaagaatcaaatagacgcaataaaaaatgataaaggggatgtcaccacc gatcccacagaagtacaaactaccatcagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggataaattccttgacacatacactctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggatctgaaattgtggcaataatc aatagcttaccacccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggaactgagaataaaatacctaggaatccaacttacaagggatgtgaag gacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaa tggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactg cccaaggtaatttacagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatcaccaag tcaatcctaagccaaaagaacaaagctggaggcatcacactgcctgacttcaaactatac tacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaa tggaacagaacagagccctcagaaataacgccgcatatctacaactatctgatctttgac aaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaa aactggctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaa atcaattcaagatggattaaagacttaaacgttagacctaaaaccataaaaaccctagaa gaaaacctaggcattaccattcaggacataggcatgggcaaggacttcatgtctaaaaca ccaaaagcaatggcaacaaaagacaaaattgacaaatgggatctaattaaactaaagagc ttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaaaatgggagaaa attttcacaacctactcatctgacaaagggctaatatccagaatctacaatgaactcaaa caaatttacaagaaaaagacaaacaaccccatcaaaaagtgggcgaaggacatgaacaga cacttctcaaaagaagacatttatgcagccaaaaaacacatgaaaaagtgctcatcatca ctggccatcagagaaatgcaaatcaaaaccacaatgagataccatctcacaccagttaga atggcaatcattaaaaagtcaggaaacaacagatggcttacagatgggaagcgtgtcttc gattgtgctttgtcttcagagtcataccggtag >gi568815588r:117032212_117238056|GENSCAN_predicted_peptide_4|341_aa MGKKQNRKTGNSKNKSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELWEDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLTGVPESDGENGTKLENT LQDIIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKILRAAREK GRVTLKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRISCPAKLSFISEGEIKYF TDKQMLRDFVTTRTALKELLKEALNMERNNRYQPLQNHAKM >gi568815588r:117032212_117238056|GENSCAN_predicted_CDS_4|1026_bp atggggaaaaaacagaacagaaaaactggaaactctaaaaataagagcgcctctcctcct ccaaaggaacgcagctcctcaccagcaacggaacaaagctggatggagaatgactttgac gagctgagagaagaaggcttcagaagatcaaattactctgagctatgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaacacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaatgaaatgaagcgagaaggaaagtttagagaaaaa agaataaaaagaaacgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaat ctacgtctgactggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacact ctgcaggatattatccaggagaacttccccaatctagcaaggcaggccaatgttcagatt caggaaatacagagaacaccacaaagatactcctcgagaagagcaactccaagacacata attgtcagattcaccaaagttgaaatgaaggaaaaaatattaagggcagccagagagaaa ggtcgggttaccctcaaagggaagcccatcagactaacagcggatctctcggcagaaacc ctacaagccagaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaa cccagaatttcatgtccagccaaactaagcttcataagtgaaggagaaataaaatacttt acagacaagcaaatgctgagagattttgtcaccaccaggactgccctaaaagagctcctg aaggaagcgctaaacatggaaaggaacaaccggtaccagccgctgcaaaatcatgccaaa atgtaa >gi568815588r:117032212_117238056|GENSCAN_predicted_peptide_5|151_aa MVVCCTDHRITQGKTEFVLASFKKEYGSEGYERCYREQKLVAVIILAILTLTTSRIQYKL RKYTVIMVGLSLDYVCKTYSSKRVLCLLEIGVKKTRILVLALPNASCVTVEVLNLDVVQF IFTFVACAFDALSKKIIAKSSVMKLLLYVVF >gi568815588r:117032212_117238056|GENSCAN_predicted_CDS_5|456_bp atggtggtttgctgcactgatcatcgcatcacccagggcaaaaccgaatttgtgttagca tctttcaagaaagaatacggttctgagggatatgagagatgttatagagaacagaaatta gttgcagttatcatattagctattttgaccttaactaccagcagaatccagtacaaactc agaaaatatactgtcatcatggttggtttatctttggactacgtttgtaaaacctacagt agcaaacgggttctatgcttattagaaataggagtaaaaaagaccaggattctggtgctg gctctgccaaatgccagctgtgtgactgtggaagttcttaatcttgatgtagttcagttt atttttacttttgttgcctgtgcttttgacgccctatccaaaaaaatcattgccaaatcc agcgtcatgaagcttttgctctatgttgtcttttaa >gi568815588r:117032212_117238056|GENSCAN_predicted_peptide_6|64_aa MAQAPRLSRPWALLPSGLRPLPQFTELMSLQTEAFSSNQEGCASECFSANLKGSPPPFPK LPGL >gi568815588r:117032212_117238056|GENSCAN_predicted_CDS_6|195_bp atggcgcaggctccacgcctttcccgcccttgggccctcctgccttccgggctgaggcct ctgccccaattcacagaactcatgtctcttcaaacagaagctttcagttccaaccaagaa gggtgtgcatctgaatgcttttcagcaaacctgaagggctctcctcctcctttccccaaa ctaccaggtctgtaa >gi568815588r:117032212_117238056|GENSCAN_predicted_peptide_7|327_aa MDVRCHSDAEAARVSKNAHKESRESKGAEGNLPAAFLKEPQGAFSASGAAEDCNKSKSNS AADPDYCRRILVRDAKGSIREIILPKGLDLDRPKRTRTSFTAEQLYRLEMEFQRCQYVVG RERTELARQLNLSETQVKVWFQNRRTKQKKDQGKDSELRSVVSETAATCSVLRLLEQGRL LSPPGLPALLPPCATGALGSALRGPSLPALGAGAAAGSAAAAAAAAPGPAGAASPHPPAV GGAPGPGPAGPGGLHAGAPAAGHSLFSLPVPSLLGSVASRLSSAPLTMAGSLAGNLQELS ARYLSSSAFEPYSRTNNKEGAEKKALD >gi568815588r:117032212_117238056|GENSCAN_predicted_CDS_7|984_bp atggacgttcgatgccactcggacgccgaggctgcccgggtctcgaagaacgcgcacaag gagagtcgggagagcaagggcgcggaggggaacctcccagccgccttcctcaaggagccg cagggcgccttctcagcgtcgggcgctgctgaggattgtaacaaaagtaaatccaattcc gcagcggacccggattactgccgccggatcctggtccgagatgccaaggggtccatccga gagatcatcctgcccaagggcctggacttggaccggcctaagaggacgcgcacgtccttc accgcggagcagctctatcggctggagatggagttccagcgctgccagtacgtggtgggc cgcgagaggaccgagctcgcccggcagcttaacctctccgagacccaggtgaaggtctgg ttccagaaccggcgcaccaagcagaagaaggaccagggcaaggactcggagctacgctcg gtggtgtcggagaccgcggccacgtgcagcgtgctacggctgctggagcagggccgcctg ttgtcgccgcccggcctgcctgcgctgctgccgccttgcgccacgggcgctctcggctca gcgctgcgcgggcccagcttgccggccctgggcgcgggcgccgctgcaggctcggccgcc gcagccgccgccgccgccccgggcccagcgggcgctgcatccccgcacccgccggctgtg ggcggtgctccaggtcccgggcccgccgggccggggggattgcacgcaggcgccccggcc gcgggccacagcctcttcagcctgccggtgccctcgctgctcggctccgtcgccagccgc ctgtcctccgccccgttaacaatggctggttcgctagctgggaatttgcaagaactctcc gcccgatatctgagctcctcggccttcgagccttactcccggaccaacaataaagaaggg gccgagaaaaaagcgctggactga >gi568815588r:117032212_117238056|GENSCAN_predicted_peptide_8|213_aa MQPRGPRLLPPVLSFLDAREASRSPLVAMLSGPRRGPARPSQWSKTNPVVAAFTKERQPG RPHRRQRADREDRALFRMLRGEGAVGRTAFPPPPTSFMSWEEAKSSSIVGAPSGGSFRKR SSPEDFAEDRSQPRKQIGSFRGAGTAPPPAFPLASGRDWSASWRPVGRGQWTGDNCYRFQ REKLENLKLSISTNLRNMGGHRLDTTAPLRQLI >gi568815588r:117032212_117238056|GENSCAN_predicted_CDS_8|642_bp atgcaaccgcgtgggccccgcctgctccccccggtcctgtctttcctcgacgctcgggag gcgtcccgcagccccctagtcgcgatgctctcaggtccccggcgtggaccagcacggcca agtcagtggtcaaaaaccaacccggtggtagctgcctttaccaaggaaagacagccagga cgcccgcaccgacggcaacgcgcggacagagaggaccgtgccctctttcgcatgttacgg ggcgagggggctgttggcagaacagcatttcccccgcccccgacttcctttatgagctgg gaggaagcaaaatccagctctattgttggagctccctctggtggcagtttccgaaaacga tccagccctgaggattttgcggaagatcggtctcagccccgcaagcagattggcagcttc cggggtgctgggacggcgccccctcctgccttcccgctagcatctggcagggactggagt gcttcctggagacccgtaggccggggacagtggactggggataattgctaccgctttcaa cgagagaaactcgagaatctgaaactcagtatttctacgaatttgcgcaacatgggaggt catcgcctggacaccactgcccccttgcggcaactcatctaa >gi568815588r:117032212_117238056|GENSCAN_predicted_peptide_9|415_aa MFSSLLTALPLPLPLPLRVLVNSVVPAPASEVAALLENQARRCCPEALGKLFPGLCFLCF LVTYALVGAVVFSAIEDGQVLVAADDGEFEKFLEELCRILNCSETVVEDRKQDLQGHLQK VKPQWFNRTTHWSFLSSLFFCCTVFSTVGYGYIYPVTRLGKYLCMLYALFGIPLMFLVLT DTGDILATILSTSYNRFRKFPFFTRPLLSKWCPKSLFKKKPDPKPADEAVPQIIISAEEL PGPKLGTCPSRPSCSMELFERSHALEKQNTLQLPPQAMERSNSCPELVLGRLSYSIISNL DEVGQQVERLDIPLPIIALIVFAYISCAAAILPFWETQLDFENAFYFCFVTLTTIGFGDT VLEHPNFFLFFSIYIIVGMEIVFIAFKLVQNRLIDIYKNVMLFFAKGKFYHLVKK >gi568815588r:117032212_117238056|GENSCAN_predicted_CDS_9|1248_bp atgttctccagcctcctgactgcgctgccgctgccgctgccgctaccgctgcgtgtcctc gtaaactcagtggttccagcgccagcctctgaggtagcagctctcctggagaaccaggcc aggagatgctgcccagaggccctgggaaagctcttccctggcctctgcttcctctgcttt ctggtgacctacgccctggtgggtgctgtggtcttctctgccattgaggacggccaggtc ctggtggcagcagatgatggagagtttgagaagttcttggaggagctctgcagaatcttg aactgcagtgaaacagtggtggaagacagaaaacaggatctccaggggcatctgcagaag gtgaagcctcagtggtttaacaggaccacacactggtccttcctgagctcgctctttttc tgctgcacggtgttcagcaccgtgggctatggctacatctaccccgtcaccaggcttggc aagtacttgtgcatgctctatgctctctttggtatccccctgatgttcctcgttctcacg gacacaggcgacatcctggcaaccatcttatctacatcttataatcggttccgaaaattc cctttctttacccgccccctcctctccaagtggtgccccaaatctctcttcaagaaaaaa ccggaccccaagcccgcagatgaagctgtccctcagatcatcatcagtgctgaagagctt ccaggccccaaacttggcacatgtccttcacgcccaagctgcagcatggagctgtttgag agatctcatgcgctagagaaacagaacacactgcaactgcccccacaagccatggagagg agtaactcgtgtcccgaactggtgttgggaagactctcatactccatcatcagcaacctg gatgaagttggacagcaggtggagaggttggacatccccctccccatcattgcccttatt gtttttgcctacatttcctgtgcagctgccatcctccccttctgggagacacagttggat ttcgagaatgccttctatttctgctttgtcacactcaccaccattgggtttggggatact gttttagaacaccctaacttcttcctgttcttctccatttatatcatcgttggaatggag attgtgttcattgctttcaagttggtgcaaaacaggctgattgacatatacaaaaatgtt atgctattctttgcaaaagggaagttttaccaccttgttaaaaagtga >gi568815588r:117032212_117238056|GENSCAN_predicted_peptide_10|119_aa MQSTLATQVGAHDRTEGLFFQFNKHSLNAGYLPGSEYPKMNQQHLQKHREVPCSACTSHL QPPGTWTKAFGATHPAAVGVVPFSGPPALNAVLLVPSPPLEYEETEDQGRKPKNLESDV >gi568815588r:117032212_117238056|GENSCAN_predicted_CDS_10|360_bp atgcaatcaacactggccactcaagtaggagcccatgatcgcactgaagggctgttcttt cagttcaacaaacattcactgaatgctggctatcttccaggctcagagtatccaaagatg aatcagcagcatcttcagaaacacagagaagtgccctgctccgcttgcacatctcatctc cagcctcccggtacctggaccaaagcctttggtgcgactcacccagccgctgtgggggtc gtcccgttcagtggtccaccagccctgaatgctgtattgttagtgccctccccgccttta gaatacgaagaaactgaagatcagggacggaagccaaagaacctggagtctgatgtctaa