GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:50:18 Sequence gi568815594f:155113940_155315082 : 201143 bp : 35.66% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 14138 14201 64 1 1 58 33 117 0.446 4.56 1.02 Intr + 15430 15510 81 2 0 54 98 53 0.346 1.59 1.03 Intr + 20394 20534 141 1 0 38 83 75 0.440 1.40 1.04 Term + 21565 21782 218 2 2 44 55 127 0.759 1.32 1.05 PlyA + 22236 22241 6 1.05 2.00 Prom + 28209 28248 40 -2.65 2.01 Init + 42710 42793 84 1 0 75 86 37 0.246 3.07 2.02 Term + 43278 43454 177 2 0 33 49 156 0.325 2.90 2.03 PlyA + 44516 44521 6 1.05 3.06 PlyA - 45441 45436 6 1.05 3.05 Term - 49350 49247 104 1 2 108 42 54 0.472 0.26 3.04 Intr - 53780 53584 197 1 2 57 83 153 0.396 9.84 3.03 Intr - 61185 61127 59 0 2 55 98 15 0.092 -4.04 3.02 Intr - 62925 62806 120 0 0 57 37 183 0.771 9.87 3.01 Init - 64027 64016 12 1 0 72 107 -8 0.623 -0.27 3.00 Prom - 80658 80619 40 -5.85 4.00 Prom + 80912 80951 40 -4.85 4.01 Init + 90208 90255 48 0 0 56 100 48 0.807 3.80 4.02 Intr + 94032 94157 126 2 0 47 45 149 0.187 6.46 4.03 Intr + 95001 95126 126 2 0 25 82 118 0.146 4.86 4.04 Term + 99953 101146 1194 1 0 122 49 1281 0.778 118.32 4.05 PlyA + 102178 102183 6 1.05 5.04 PlyA - 102395 102390 6 1.05 5.03 Term - 119350 119181 170 2 2 48 43 99 0.057 -1.54 5.02 Intr - 123030 122782 249 1 0 -23 75 220 0.106 6.09 5.01 Init - 135510 135396 115 0 1 28 54 118 0.150 2.82 5.00 Prom - 137953 137914 40 -5.55 6.05 PlyA - 138015 138010 6 1.05 6.04 Term - 139432 139125 308 2 2 71 36 173 0.715 4.69 6.03 Intr - 139807 139584 224 0 2 22 57 207 0.296 7.85 6.02 Intr - 170397 168569 1829 0 2 -7 53 538 0.456 27.17 6.01 Init - 171659 170490 1170 2 0 44 41 505 0.461 34.97 6.00 Prom - 175685 175646 40 -4.45 7.02 PlyA - 176214 176209 6 1.05 7.01 Term - 185302 184884 419 2 2 82 52 197 0.824 9.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 58662 58604 59 0 2 83 45 62 0.861 2.23 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:155113940_155315082|GENSCAN_predicted_peptide_1|167_aa MPARENEEDAKAESPDPTIRSCLYLTYSEYPIHVCTDILEIDYGWLSAGDEGSSAAREMK QGLVMTNSAPNNKHLGNSPKGFLKGIGNTQRVTSRGTALYLPTVHVCTSNIARQEGVVKS LGKQEENICMTRVGTCKEKSEVLVGLKTMVAADKGVRSAGVDTDFLM >gi568815594f:155113940_155315082|GENSCAN_predicted_CDS_1|504_bp atgcctgcaagagaaaatgaggaagatgcaaaagcggaaagccctgatccaaccatcaga tcttgtctctacctcacatatagtgagtacccaatacatgtttgtacggacatcttagaa attgattatggttggctgagtgcaggggatgaagggtcaagtgcagcaagagaaatgaag caaggattggttatgacaaactctgctcccaacaacaagcatcttgggaattctccaaaa ggctttctaaaaggcattgggaatactcagagggtaacaagcagagggacagccttatac ttaccaactgtccatgtatgtacttccaatattgcaaggcaggaaggggtagtgaaatcc ttgggtaaacaggaagaaaacatttgtatgaccagggttgggacctgtaaggagaagagt gaagtcttggtcggattaaaaacaatggtggctgcagacaaaggtgtcagaagtgctggg gttgatacagatttcttgatgtga >gi568815594f:155113940_155315082|GENSCAN_predicted_peptide_2|86_aa MLVEMETAKARLMRSQVKMKTLLGIGVKDTAPCIPAAVAPALALQAPDTDWAAALRTQAA GNLDSFHVELDRECKSKGGFNASNWI >gi568815594f:155113940_155315082|GENSCAN_predicted_CDS_2|261_bp atgctggtagaaatggagacagcaaaggccaggctgatgaggtctcaggtgaaaatgaag acgttattgggaattggagtaaaggacacagctccttgcatcccagcagctgtggctcca gccttggctctacaggccccagatacagattgggctgctgctttgagaacacaagcagct ggaaaccttgatagcttccatgtggagttagacagagaatgcaagagtaaaggaggcttc aatgcttccaactggatttag >gi568815594f:155113940_155315082|GENSCAN_predicted_peptide_3|163_aa MPYEVQLTPLEDAAARHHFGRDTGSPPDTKSAGALILDFVAVTTSYFVQDSLKLTMENYD RRASFLQSVETVGLKFCVVPDSARLIVHYDGVSDLEPLLWGKVELCSEKKLKIRAKNFKK EELVEKEHEHIIMCDERLTWWFMLFSQVAAMKGKSTRTTENST >gi568815594f:155113940_155315082|GENSCAN_predicted_CDS_3|492_bp atgccatatgaggtacagctcacccctctggaagatgcagcagcaaggcaccattttgga agggacactgggtccccgccagataccaagtctgccggtgccctgatcttggactttgta gccgtcaccacttcatattttgtgcaagactctctcaaactaactatggaaaattatgat agaagagcaagttttctgcaaagtgtagagaccgtggggctaaagttttgtgttgttcct gattctgctagactcatagtgcattatgatggagtgagtgacttagaaccactgttgtgg ggaaaagtggaactttgcagtgaaaagaagctgaagataagagcaaagaattttaagaag gaggaattggtggagaaagaacatgaacacattatcatgtgtgacgagcggcttacatgg tggtttatgttgttcagccaggttgctgctatgaagggaaagtccacaagaaccacagag aactcaacctga >gi568815594f:155113940_155315082|GENSCAN_predicted_peptide_4|497_aa MAYKLLNNLSGTADIKTPVREIADHGQQDLNSLYLLVWSTGTAQLEEHQRTAPQPWAREG TEPDLPLGTFQGPLQVGWLIIGQTDCTHLVSASPQKREVQVVDSCAGCRPSGPVLKMGPI GAEADENQTVEEMKVEQYGPQTTPRGELVPDPEPELIDSTKLIEVQVVLILAYCSIILLG VIGNSLVIHVVIKFKSMRTVTNFFIANLAVADLLVNTLCLPFTLTYTLMGEWKMGPVLCH LVPYAQGLAVQVSTITLTVIALDRHRCIVYHLESKISKRISFLIIGLAWGISALLASPLA IFREYSLIEIIPDFEIVACTEKWPGEEKSIYGTVYSLSSLLILYVLPLGIISFSYTRIWS KLKNHVSPGAANDHYHQRRQKTTKMLVCVVVVFAVSWLPLHAFQLAVDIDSQVLDLKEYK LIFTVFHIIAMCSTFANPLLYGWMNSNYRKAFLSAFRCEQRLDAIHSEVSVTFKAKKNLE VRKNSGPNDSFTEATNV >gi568815594f:155113940_155315082|GENSCAN_predicted_CDS_4|1494_bp atggcttacaagttactcaataatctctcagggaccgctgatattaagacacctgttagg gaaattgctgatcatgggcagcaggatctgaactcgctttaccttctcgtttggagcaca gggaccgcccagctagaggagcaccagcgcactgcgccccagccctgggcgagggagggg acggaaccggacttgcctttgggcaccttccagggccctctccaggtcggctggctaatc atcggacagacggactgcacacatcttgtttccgcgtctccgcaaaaacgcgaggtccag gttgtagactcttgtgctggttgcaggccaagtggacctgtactgaaaatgggtccaata ggtgcagaggctgatgagaaccagacagtggaagaaatgaaggtggaacaatacgggcca caaacaactcctagaggtgaactggtccctgaccctgagccagagcttatagatagtacc aagctgattgaggtacaagttgttctcatattggcctactgctccatcatcttgcttggg gtaattggcaactccttggtgatccatgtggtgatcaaattcaagagcatgcgcacagta accaactttttcattgccaatctggctgtggcagatcttttggtgaacactctgtgtcta ccgttcactcttacctataccttaatgggggagtggaaaatgggtcctgtcctgtgccac ctggtgccctatgcccagggcctggcagtacaagtatccacaatcaccttgacagtaatt gccctggaccggcacaggtgcatcgtctaccacctagagagcaagatctccaagcgaatc agcttcctgattattggcttggcctggggcatcagtgccctgctggcaagtcccctggcc atcttccgggagtattcgctgattgagatcatcccggactttgagattgtggcctgtact gaaaagtggcctggcgaggagaagagcatctatggcactgtctatagtctttcttccttg ttgatcttgtatgttttgcctctgggcattatatcattttcctacactcgcatttggagt aaattgaagaaccatgtcagtcctggagctgcaaatgaccactaccatcagcgaaggcaa aaaaccaccaaaatgctggtgtgtgtggtggtggtgtttgcggtcagctggctgcctctc catgccttccagcttgccgttgacattgacagccaggtcctggacctgaaggagtacaaa ctcatcttcacagtgttccacatcatcgccatgtgctccacttttgccaatccccttctc tatggctggatgaacagcaactacagaaaggctttcctctcggccttccgctgtgagcag cggttggatgccattcactctgaggtgtccgtgacattcaaggctaaaaagaacctggag gtcagaaagaacagtggccccaatgactctttcacagaggctaccaatgtctaa >gi568815594f:155113940_155315082|GENSCAN_predicted_peptide_5|177_aa MFAQEQMAFLALYSESEVTHLTLHLNSSWVTPDETIGKAVTLTKRDCGFTAEVSETTNPP GGANNSGREEQTTPDAPPLRAVTLTVKVFGFTPEISETTNPPGGANNSRRATFKNCNTHG EGLLPIIVTKLTLPLEFKGFYLAVSEPSCTIFLLTVEKKHLSIAIAGLKELKTKLKI >gi568815594f:155113940_155315082|GENSCAN_predicted_CDS_5|534_bp atgtttgctcaagagcaaatggctttccttgctctttacagtgaatcagaagtaactcac ctaacccttcacttaaatagctcatgggtaactccagacgagacaattggaaaggctgta acactcaccaagagggactgcggcttcactgctgaagtcagcgagaccacgaatccacca ggaggagcaaacaattccggaagggaggaacaaacaactccggacgcaccacctttaaga gctgtaacactcactgtgaaggtttttggcttcactcctgaaatcagtgagaccacgaac ccaccaggaggagcaaacaactccagacgcgccacctttaagaactgtaacactcacggc gaaggtctgttgcctatcatagtaaccaaactcactttgcctctggaattcaaaggcttt tatttggcagtctctgagccttcatgtaccatcttcctattgactgtggaaaagaaacac ctgtctattgccattgcagggttaaaggagctaaaaaccaaactcaaaatttga >gi568815594f:155113940_155315082|GENSCAN_predicted_peptide_6|1176_aa MVKGSIQQEELTILNIYAPNTGAPRFIKQVLSDLQRDLDSHTLIMGDFNTPLSTLDRSTR QKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYSKIDHILGSKALLSKCK RTEIITNYLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNE NKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQ EITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIKKKREKNQIDTTKNDKG DITTDPTEIQTTIREYYKHLYANQPENLEEMDTFLDTYTLPRLNQEEVESLNRPITGAEI VAIINSLPTKKSPGPDGFTAEFYQRYKEELPGRDTTKKQNFRPISLMNIDAKILNKILAN RIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKIQ QPFMLKTLNKLCIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFN IVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVS GYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLRRDVKDLFKENYKPLL NEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIW NQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITP HTYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAIWRKLKLDPFLTPYTKINSRWIKDLNV RPKTIKTLEENLDITIQDIGVGKDFMSKTPKAMATKDKIDKWDLIKLKSFCTAKETTIRV NRQPTTWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAK KYMKKCSTSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRKSCESHQAPNSKFLLEKTGS RCSAQLHKIHNRANQGNHETDCGYDKKGGSEGFQLMDLGEIKALIDTTPEELAKGFSIIW ALKLKQMVEERLVPYRNIFREIKRQKTQTKTMLYFREVTLSVPASSAFTSNSSTSSTSAP PDTVRAIPSLPPPLPIQHEEDEDEDLIDDPFPLNEY >gi568815594f:155113940_155315082|GENSCAN_predicted_CDS_6|3531_bp atggtaaagggatcaattcaacaagaggagctaactatcctaaatatatatgcacccaat acaggagcacccagattcataaagcaagtcctgagtgacctacaaagagacttagactcc cacacattaataatgggagactttaacaccccactgtcaacattagacagatcaacgaga cagaaagtcaacaaggatacccaggaattgaactcagctctgcaccaagcggacctaata gacatctacagaactctccaccccaaatcaacagaatatacatttttttcagcaccacac cacacctattccaaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaa agaacagaaattataacaaactatctctcagaccacagtgcaatcaaactagaactcagg attaagaatctcactcaaaaccgctcaactacatggaaactgaacaacctgctcctgaat gactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgag aacaaagacacaacataccagaatctctgggacgcattcaaagcagtgtgtagagggaaa tttatagcactaaatgcccacaagagaaagcaggaaagatccaaaattgacaccctaaca tcacaattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaataactaaaatcagagcagaactgaaggaaatagagacacaaaaaactcttcaaaaa atcaatgaatccaggagctggttttttgaaaggatcaacaaaattgatagaccgctagca agactaataaagaaaaaaagagagaagaatcaaatagacacaacaaaaaatgataaaggg gatatcaccaccgatcccacagaaatacaaactaccatcagagaatactacaaacacctc tacgcaaatcaaccagaaaatctagaagaaatggatacattcctcgacacatacactctc ccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggagctgaaatt gtggcaataatcaatagtttaccaaccaaaaagagtccaggaccagatggattcacagcc gaattctaccagaggtacaaggaggaactgccgggcagagacacaaccaaaaaacagaat tttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggcaaac cgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggctttatccctggg atgcaaggctggttcaatatacgcaaatcaataaatgtaatccagcatataaacagagcc aaagacaaaaaccacatgattatctcaatagatgcagaaaaagcctttgacaaaattcaa caacccttcatgctaaaaactctcaataaattatgtattgatgggacgtatttcaaaata ataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactggaa gcattccctttgaaaactggcacaagacagggatgccctctctcaccgctcctattcaac atagtgttggaagttctggccagggcaatcaggcaggagaaggaaataaagggtattcaa ttaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtttatctagaa aaccccatcgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctca ggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaacaacagacaaaca gagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaataccta ggaatccaacttagaagggatgtgaaggacctcttcaaggagaactacaaaccactgctc aacgaaataaaagaagatacaaacaaatggaagaacattccatgctcatgggtaggaaga atcaatatcgtgaaaatggccatactgcccaaggtaatttacagattcaatgccatcccc atcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatgg aaccaaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggc atcacactacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtac tggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataacacca catacctacaactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggat tccctatttaataaatggtgctgggaaaactggctagccatatggagaaagctgaaactg gatcccttccttacaccttatacaaaaatcaattcaagatggattaaagatttaaacgtt agacctaaaaccataaaaaccctagaagaaaacctagacattaccattcaggacataggc gtgggcaaggacttcatgtccaaaacaccaaaagcaatggcaacaaaagacaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacaacatgggagaaaattttcgcaacctactcatctgacaaagggcta atatccagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggcgaaggacatgaacagacacttctcaaaagaagacatttatgcagccaaa aaatacatgaaaaaatgctcaacatcactggccatcagagaaatgcaaatcaaaaccact atgagatatcatctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacaga aaaagctgtgaaagtcaccaagcaccaaacagcaaattcctgctggagaaaactgggtcc agatgtagtgcacaacttcacaagattcacaacagagccaatcaagggaatcatgaaaca gattgtggatatgacaaaaaaggtgggagtgaaggctttcagcttatggatcttggagaa attaaagcactaatagacaccaccccagaggaattagcaaaaggtttttctattatatgg gcactgaaactaaagcaaatggtggaagaaaggttggtaccatatagaaatatttttaga gaaataaaaaggcaaaaaactcagacaaaaacaatgctgtatttccgtgaagtcacactg agtgtgcctgcctcttctgccttcacttccaactcctccacctcttccacttctgcccct cctgatacagtaagagcaatcccttctcttcctcctcctctgcctattcaacatgaagaa gatgaggatgaagaccttattgatgatccatttccacttaatgaatactaa >gi568815594f:155113940_155315082|GENSCAN_predicted_peptide_7|139_aa XLWNFELESDDLGYLVEEISKPQSIQDVPCLLLTTYAHMHEERNDLKLELIFKREAEHKS LENLWPGHVVEKKNPFSQEEFKQAAEICISKELPSADSQDNEKKASKAFQRPWRQPSPSQ MQRPRREEWLHGPGPALLP >gi568815594f:155113940_155315082|GENSCAN_predicted_CDS_7|420_bp natctgtggaactttgaacttgagagtgatgatttagggtatctggtggaagaaatttct aagccgcaaagcattcaagatgtgccctgcctacttctaacaacctatgctcatatgcat gaggaaagaaatgacctgaaactggaacttatatttaaaagggaagcagaacataaaagc ttggaaaatttgtggcctggccatgtggtagaaaagaaaaacccattttctcaggaggaa ttcaagcaggctgcagaaatttgcataagtaaggagttgccaagtgctgatagccaagac aatgagaaaaaggcctccaaggcatttcagagaccctggaggcagcccagcccatcacag atgcagaggcctaggagggaagaatggcttcatgggccaggaccagccctgctgccttga