GENSCAN 1.0 Date run: 7-Nov-116 Time: 20:18:41 Sequence gi568815577r:33803529_34014502 : 210974 bp : 45.22% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7447 7694 248 0 2 97 75 120 0.889 7.86 1.02 Intr + 10385 10544 160 2 1 104 103 72 0.694 10.29 1.03 Intr + 14739 14877 139 2 1 0 77 96 0.222 -0.26 1.04 Intr + 19959 20125 167 1 2 66 82 132 0.893 10.08 1.05 Intr + 23290 23335 46 0 1 76 119 3 0.629 0.18 1.06 Intr + 26096 26191 96 0 0 86 45 97 0.362 5.18 1.07 Intr + 32913 33104 192 1 0 96 101 405 0.997 42.06 1.08 Intr + 38216 38325 110 0 2 76 62 81 0.045 4.30 1.09 Intr + 40582 40777 196 0 1 19 52 73 0.013 -4.21 1.10 Intr + 53208 53329 122 1 2 82 94 208 0.956 21.01 1.11 Intr + 55158 55264 107 2 2 82 80 66 0.230 4.21 1.12 Intr + 61623 61806 184 0 1 98 78 462 0.840 45.99 1.13 Term + 63060 63167 108 2 0 57 42 98 0.751 0.61 1.14 PlyA + 68540 68545 6 1.05 2.00 Prom + 70697 70736 40 -6.46 2.01 Init + 71597 71728 132 1 0 58 4 196 0.807 8.34 2.02 Intr + 71826 71993 168 2 0 46 53 196 0.973 12.04 2.03 Intr + 77402 77515 114 1 0 104 100 33 0.709 6.74 2.04 Intr + 78715 78927 213 0 0 67 98 235 0.984 21.31 2.05 Intr + 80022 80143 122 2 2 80 100 118 0.998 11.49 2.06 Intr + 81513 81595 83 0 2 73 30 113 0.839 3.28 2.07 Intr + 81911 81994 84 0 0 57 91 57 0.726 2.69 2.08 Intr + 82759 82932 174 2 0 81 101 319 0.999 32.41 2.09 Intr + 84624 84768 145 1 1 77 21 214 0.792 12.94 2.10 Term + 87588 87699 112 0 1 74 43 114 0.734 3.53 2.11 PlyA + 92783 92788 6 1.05 3.05 PlyA - 95427 95422 6 1.05 3.04 Term - 98406 98165 242 1 2 106 34 138 0.351 6.39 3.03 Intr - 105683 105554 130 2 1 45 70 68 0.618 0.97 3.02 Intr - 108871 108761 111 1 0 68 92 98 0.632 8.78 3.01 Init - 112118 112014 105 2 0 59 16 144 0.428 4.62 3.00 Prom - 115286 115247 40 -3.86 4.02 PlyA - 115293 115288 6 1.05 4.01 Sngl - 124517 122895 1623 2 0 37 47 522 0.973 38.33 4.00 Prom - 125765 125726 40 -4.96 5.02 PlyA - 125934 125929 6 1.05 5.01 Sngl - 127178 126162 1017 2 0 88 43 554 0.976 47.94 5.00 Prom - 136814 136775 40 -3.86 6.07 PlyA - 137442 137437 6 1.05 6.06 Term - 145385 145270 116 0 2 94 41 50 0.076 -0.37 6.05 Intr - 158672 158591 82 2 1 64 80 42 0.059 0.21 6.04 Intr - 166022 165874 149 0 2 83 41 125 0.472 7.25 6.03 Intr - 177809 177721 89 1 2 101 73 36 0.198 3.01 6.02 Intr - 206221 206104 118 2 1 47 109 25 0.060 0.02 6.01 Init - 210285 210234 52 0 1 65 74 39 0.198 1.52 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577r:33803529_34014502|GENSCAN_predicted_peptide_1|624_aa VDESQTGEPGWLGGELKGKTGWFPANYAEKIPENEVPAPVKPVTDSTSAPAPKLALRETP APLAVTSSEPSTTPNNWADFSSTWPTSTNEKPETDNWDAWAAQPSLTVPSAGQLRQRSAF TPATATGSSPSPVLGQGEKVEGLQAQALYPWRAKKDNHLNFNKNDVITVLEQQDMWWFGE VQEFIAMYTYESSEQGDLTFQQGDVILVTKKDGDWWTGTVGDKAGVFPSNYVRLKDSEGS GTAGKTGSLGKKPEIAQVIASYTATGPEQLTLAPGQLILIRKKNPVCQVIGMYDYTAQND DELAFNKGQIINVLNKEDPDWWKGEVNGQVGLFPSNYVKLTTDMDPSQQFPKIPKRIFME MTLDANGPKNARKNSSFFEMSGIRGLICPTNPCTNPGPLPDSRRPTGGSKQLFKRLRCTP AMCRDLWTKDQSLLSRLLVDVDKLPVFHTVDGCSDLHLLDMLTPTERKRQGYIHELIVTE ENYVNDLQLVTEIFQKPLMESELLTEKEVAMIFVNWKELIMCNIKLLKALRVRKKMSGEK MPVKMIGDILSAQLPHMQPYIRFCSRQLNGAALIQQKTDEAPDFKEFVKLEQVPVPGDSY QVSGLQSMQLPVSKALNQTTAFDS >gi568815577r:33803529_34014502|GENSCAN_predicted_CDS_1|1875_bp gtggatgaaagccaaactggagaacccggctggcttggaggagaattaaaaggaaagaca gggtggttccctgcaaactatgcagagaaaatcccagaaaatgaggttcccgctccagtg aaaccagtgactgattcaacatctgcccctgcccccaaactggccttgcgtgagaccccc gcccctttggcagtaacctcttcagagccctccacgacccctaataactgggccgacttc agctccacgtggcccaccagcacgaatgagaaaccagaaacggataactgggatgcatgg gcagcccagccctctctcaccgttccaagtgccggccagttaaggcagaggtccgccttt actccagccacggccactggctcctccccgtctcctgtgctaggccagggtgaaaaggtg gaggggctacaagctcaagccctatatccttggagagccaaaaaagacaaccacttaaat tttaacaaaaatgatgtcatcaccgtcctggaacagcaagacatgtggtggtttggagaa gttcaagaatttattgccatgtacacttacgagagttctgagcaaggagatttaaccttt cagcaaggggatgtgattttggttaccaagaaagatggtgactggtggacaggaacagtg ggcgacaaggccggagtcttcccttctaactatgtgaggcttaaagattcagagggctct ggaactgctgggaaaacagggagtttaggaaaaaaacctgaaattgcccaggttattgcc tcatacaccgccaccggccccgagcagctcactctcgcccctggtcagctgattttgatc cgaaaaaagaacccagtgtgccaggtgattgggatgtacgactacaccgcgcagaatgac gatgagctggccttcaacaagggccagatcatcaacgtcctcaacaaggaggaccctgac tggtggaaaggagaagtcaatggacaagtggggctcttcccatccaattatgtgaagctg accacagacatggacccaagccagcaatttccaaaaatacccaagagaatttttatggaa atgacattggatgccaatggtcccaaaaatgcaaggaaaaatagctccttctttgaaatg tctggtattcggggtctgatttgccctaccaatccgtgcacgaaccctggcccccttcct gatagcaggaggcccaccggtggaagcaagcagctttttaaacgccttcgctgcactcct gcaatgtgcagagacctctggaccaaagaccagagcctactcagcagactcttggttgat gtggacaagctgcctgtgtttcacacggtcgacgggtgttcagacttacatctcttggat atgttgaccccaactgaaagaaagcgacaaggatacatccacgagctcattgtcaccgag gagaactatgtgaatgacctgcagctggtcacagagatttttcaaaaacccctgatggag tctgagctgctgacagaaaaagaggttgctatgatttttgtgaactggaaggagctgatt atgtgtaatatcaaactactaaaagcgctgagagtccgcaagaagatgtccggggagaag atgcctgtgaagatgattggagacatcctgagcgcacagctgccgcacatgcagccctac atccgcttctgcagccgccagctcaacggggctgccctgatccagcagaagacggatgag gccccagacttcaaggagttcgtcaaactggagcaggtcccggtccctggagacagttat caggtctcagggctccagtcaatgcagctgcccgtttctaaggcattaaaccagaccaca gcctttgacagctaa >gi568815577r:33803529_34014502|GENSCAN_predicted_peptide_2|448_aa MGPFTASRNATHWSHQGVAVICYVIAMWVLRAAIAQVGAVHERMILENTPENHPDHSHLK HALEKAEELCSQVNEGVREKENSDRLEWIQAHVQCEGLSENLVEVMDPREVLEPRFSAHR VSKGQWREGAVPFLFAERQLVFNSVTNCLGPRKFLHSGKLYKAKSNKELYGFLFNDFLLL TQITKPLGSSGTDKVFSPKSNLQYKMYKTPIFLNEVLVKLPTDPSGDEPIFHISHIDRVY TLRAESINERTAWVQKIKAASELYIETEKKKREKAYLVRSQRATGIGRLMVNVVEGIELK PCRSHGKSNPYCEVTMGSQCHITKTIQDTLNPKWNSNCQFFIRDLEQEVLCITVFERDQF SPDDFLGRTEIRVADIKKDQGSKGPVTKCLLLHEVPTGEIVVRLDLQLFDEPWMDSIISW NQVCCDHESLKMWLIRIQAWTNLERGTF >gi568815577r:33803529_34014502|GENSCAN_predicted_CDS_2|1347_bp atgggaccattcactgcttcccgcaatgccacccactggagccatcaaggagtggccgtc atctgctacgtgattgccatgtgggtgctcagagctgcgattgctcaagtgggcgccgtc catgaaaggatgatcctggaaaacacccctgaaaaccacccggaccacagccacttgaag cacgccctggagaaggcggaagagctctgttcccaggtgaacgaaggggtgcgggagaag gagaactctgaccggctggagtggatccaggcccacgtgcagtgtgaaggcctgtctgag aacctagtggaggtgatggacccccgagaggtgttagagccgcgtttcagcgcccaccgt gtctctaagggtcagtggagggagggagctgttcccttcctgtttgctgagcggcaactt gtgttcaattcagtgaccaattgcttggggccgcgcaaatttctgcacagtgggaagctc tacaaggccaagagcaacaaggagctgtatggcttccttttcaacgacttcctcctgctg actcagatcacgaagcctttggggtcttctggcaccgacaaagtcttcagccccaaatca aacctgcagtataaaatgtataaaacacctattttcctaaatgaggttctagtaaaatta cccaccgacccttctggagacgagcccatcttccacatctcccacattgaccgcgtctat actctccgagcagaaagcataaatgaaaggactgcctgggtgcagaaaatcaaagctgct tctgaactctacatagagactgagaaaaagaagcgcgagaaagcgtacctggtccgttcc caaagggcaacaggcattggaaggttgatggtgaacgtggttgaaggcatcgagttgaaa ccctgtcggtcacatggaaagagcaacccgtactgtgaggtgaccatgggttcccagtgc cacatcaccaagacgatccaggacactctgaaccccaagtggaattccaactgccagttc ttcatccgagacctggagcaggaagtcctctgcatcactgtgttcgagagggaccagttc tcaccagatgattttttgggtcggacggagatccgtgtggcggacatcaagaaagaccag ggctccaaaggtccagttacgaagtgtcttctgctgcacgaagtccccacgggagagatt gtggtccgcttggacctgcagttgtttgatgagccgtggatggacagcatcatctcctgg aaccaggtttgctgtgaccatgagagcctgaagatgtggctcattcggatccaggcctgg actaatctggaaagagggactttctag >gi568815577r:33803529_34014502|GENSCAN_predicted_peptide_3|195_aa MLRLGKSGSLQGYGTWRVGATQDDPAFVPRDAYGAPPVQVYGIEGRYATALYSAASKQNK LEQVEKELLRVAQILKEPKVAASVLNPYVKRSIKVKSLNDITAKERFSPLTTNLIMRPAL QKVASYTQSQLLQTCDCQWTNRLCGSLMLLMLPRSFRVKLSKLPVDQAHTILGNDEALHM SPYEVTRENLEIYVQ >gi568815577r:33803529_34014502|GENSCAN_predicted_CDS_3|588_bp atgcttcggctggggaagtctggcagtcttcaaggttatgggacctggcgagtgggagcg actcaggacgacccagcgtttgtcccccgggatgcctacggggcacctcctgttcaggta tacggtattgaaggtcgctatgccacagctctttattctgctgcatcaaaacagaataag ctggagcaagtagaaaaggagttgttgagagtagcacaaatcctgaaggaacccaaagtg gctgcttctgttttgaatccctatgtgaagcgttccattaaagtgaaaagcctaaatgac atcacagcaaaagagaggttctctcccctcactaccaatctgatcatgaggcctgctctg cagaaggtggcgtcatacacacaatcacagctcctacagacatgtgactgccaatggaca aaccgtctttgcgggagcctcatgctgctgatgctgcccagatccttcagagtcaaactc agcaagcttcctgttgatcaagcacatacaatcttggggaatgatgaggcccttcatatg tcgccttacgaggtcaccagagagaacctagagatctatgtacaatga >gi568815577r:33803529_34014502|GENSCAN_predicted_peptide_4|540_aa MNIDAKILNKILANGIQQHIKKLIHHDQVGFIPGMQGCFNIRKSINLIQHINRAKDKNHM IISIDAEKAFDKIQQPFMLKTLNKLGIDGMYFKIIRAIYDKPTANIILNGQKLEAFPLKT GTRQGWPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDIIVYLENPIVSA QNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIVSELPFTIASKRIKYLGIQLTR DVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMP FFTELEKTTLKFIWNQKRARIAKSILSQKDKAGGITLPDFKLYYKATVTKTAWYWYQNRD IDQWNRTEPSEITPHIYNYLIFDQPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTP YTKLNSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMCKTPKAMATKAKIDKWDLIK LKSFCTAKETTIRVNRQPAKLEKIFATYSSDKGLISRIYNELKQMYKKKQTTPSKSGRRT >gi568815577r:33803529_34014502|GENSCAN_predicted_CDS_4|1623_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaacggaatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctgcttcaat atacgcaaatcaataaatctaatccagcatataaacagagccaaagacaaaaaccacatg attatctcaatagatgcagaaaaagcctttgacaaaattcaacaacccttcatgctaaaa actctcaataaattaggtattgatgggatgtatttcaaaataataagagctatctatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaact ggcacaagacagggatggcctctctcaccactcctattcaacatagtgttggaagttctg gccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtc aaattgtccctgtttgcagacgacataattgtatatctagaaaaccccattgtctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgta caaaaatcacaagcattcttatacaccaacaacagacaaacagagagccaaatcgtgagt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggat acaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatg gccatactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatgcct ttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgc atcgccaagtcaatcctaagccaaaaggacaaagctggaggcatcacactacctgacttc aaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagat atagatcaatggaacagaacagagccctcagaaataacgccgcatatctacaactatctg atctttgaccaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatgg tgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacacct tatacaaaactcaattcaagatggattaaagacttaaacgttagacctaaaaccataaaa accctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggacttcatg tgtaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaa ctaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctgcaaaa ttggagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatctacaat gaactcaaacaaatgtacaagaaaaaacaaacaaccccatcaaaaagtgggcgaaggaca tga >gi568815577r:33803529_34014502|GENSCAN_predicted_peptide_5|338_aa MGKKQNRKTGSSKMQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEEYITRITNAEKCLKELMELKTKARELREECRSLRSRWDQLEERVSV MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV TLKGKPIRLTADLLAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDK QMLSDFVTTRPALKELLKEALNMERNNRYQPLQNHAKM >gi568815577r:33803529_34014502|GENSCAN_predicted_CDS_5|1017_bp atggggaaaaaacagaacagaaaaactggaagctctaaaatgcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgactttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggtaaagaagttgaaaactttgaaaaaaatttagaagaatatataactagaata accaatgcagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgggatcaactggaagaaagggtatcagtg atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatttgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaatactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt accctcaaagggaagcccatcagactaacagcggatctcttggcagaaaccctacaagcc agaagagagtgggggccgatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagcgaaggagaaataaaatactttacagacaag caaatgctgagcgattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagcg ctaaacatggaaaggaacaaccggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815577r:33803529_34014502|GENSCAN_predicted_peptide_6|201_aa MPFAFHHDYEASPAKWNSRSRSCRIPYGPLSQSDPVLLPLCWSVGLTHADGVFLASRISH RAISFYRKNPNTLRFNHVKSACVWKYMDSLVAADIIFYNHIYILAFLFQVVLLRNCSNFQ ELKMYCLKSNSCRDFQKNFQINLFLSNSTPSGPGSSAVAETTTGSKRHRQMVLWWSPAQK MATSGAKLPQQQALSTAASNR >gi568815577r:33803529_34014502|GENSCAN_predicted_CDS_6|606_bp atgccttttgccttccaccatgattatgaggcctccccagccaagtggaactccaggagt cggtcctgccgtattccttacggaccactgagccagtcagatccagtgcttcttcccctg tgctggtcagtggggctgacccatgcagatggtgttttcctggcttccaggattagtcac cgggccatatccttttatcgcaagaacccaaatactcttagattcaaccacgtgaaatct gcatgtgtgtggaagtatatggacagtttggtggccgcagacatcatcttctacaaccac atctacatccttgcctttctgtttcaagttgttctcttgagaaactgttcaaattttcag gagctgaaaatgtactgccttaaatccaactcatgccgagatttccagaagaacttccaa atcaacctgtttctctccaactccactcccagcggcccaggttcttctgctgttgctgag acaaccacaggaagtaaaaggcacagacagatggtgctctggtggagccctgcccagaaa atggccacttccggagcaaaactgccacagcaacaggccttgtcaacagcagcctccaac cgttaa