GENSCAN 1.0 Date run: 2-Nov-116 Time: 18:54:40 Sequence gi568815595f:152200194_152559324 : 359131 bp : 35.96% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 10 5 6 1.05 1.03 Term - 17260 17088 173 2 2 40 55 134 0.470 2.31 1.02 Intr - 18209 18087 123 0 0 41 55 133 0.446 4.94 1.01 Init - 22977 22842 136 0 1 95 81 97 0.913 10.05 1.00 Prom - 48909 48870 40 -3.65 2.02 PlyA - 49673 49668 6 1.05 2.01 Sngl - 50687 49893 795 2 0 42 35 302 0.512 15.92 2.00 Prom - 58937 58898 40 -6.15 3.04 PlyA - 59298 59293 6 1.05 3.03 Term - 68545 68265 281 2 2 63 49 221 0.619 10.32 3.02 Intr - 69064 68702 363 2 0 106 25 266 0.940 16.33 3.01 Init - 93142 93055 88 1 1 46 107 62 0.949 4.75 3.00 Prom - 97091 97052 40 -4.35 4.00 Prom + 98401 98440 40 -6.35 4.01 Init + 100001 100174 174 1 0 89 87 93 0.978 8.69 4.02 Intr + 100688 100780 93 1 0 72 84 58 0.863 3.04 4.03 Intr + 101338 101407 70 0 1 98 52 57 0.255 0.94 4.04 Term + 117536 117642 107 0 2 78 49 50 0.037 -2.31 4.05 PlyA + 118267 118272 6 1.05 5.04 PlyA - 118285 118280 6 1.05 5.03 Term - 134264 134215 50 0 2 103 47 50 0.369 -1.11 5.02 Intr - 134865 134722 144 1 0 61 37 128 0.415 4.33 5.01 Init - 138122 138068 55 2 1 96 72 49 0.902 5.61 5.00 Prom - 139617 139578 40 -7.45 6.02 PlyA - 139630 139625 6 1.05 6.01 Sngl - 140711 140334 378 2 0 72 42 226 0.921 12.45 6.00 Prom - 142193 142154 40 -5.55 7.04 PlyA - 143278 143273 6 1.05 7.03 Term - 148719 148618 102 0 0 58 36 129 0.483 2.00 7.02 Intr - 169540 169396 145 0 1 26 41 119 0.023 0.36 7.01 Init - 172229 172084 146 2 2 70 98 97 0.548 8.54 7.00 Prom - 176591 176552 40 -4.15 8.00 Prom + 182485 182524 40 -3.35 8.01 Init + 203033 203047 15 1 0 87 48 36 0.040 -0.15 8.02 Intr + 214748 214918 171 1 0 115 91 115 0.849 13.72 8.03 Intr + 220527 220606 80 2 2 72 113 57 0.077 4.03 8.04 Intr + 221336 221505 170 2 2 15 77 160 0.819 6.27 8.05 Intr + 223075 223324 250 2 1 42 41 150 0.032 1.27 8.06 Intr + 229248 229404 157 0 1 91 71 22 0.397 -0.11 8.07 Intr + 232524 232727 204 2 0 93 75 204 0.895 18.07 8.08 Intr + 245089 245346 258 0 0 80 101 280 0.974 25.24 8.09 Intr + 247427 247580 154 1 1 50 110 87 0.699 5.82 8.10 Term + 256074 256177 104 1 2 43 28 141 0.446 1.06 8.11 PlyA + 256679 256684 6 1.05 9.00 Prom + 257755 257794 40 -6.45 9.01 Init + 257852 257922 71 1 2 68 76 73 0.152 4.67 9.02 Intr + 284036 284164 129 2 0 103 110 -10 0.050 1.59 9.03 Intr + 300800 300876 77 2 2 74 115 58 0.340 5.34 9.04 Term + 300987 301381 395 1 2 42 48 266 0.303 12.21 9.05 PlyA + 302151 302156 6 -0.45 10.00 Prom + 302594 302633 40 -8.95 10.01 Init + 305092 305178 87 0 0 65 51 74 0.432 2.10 10.02 Intr + 305780 305907 128 1 2 82 73 102 0.936 6.66 10.03 Intr + 321660 321867 208 0 1 54 69 114 0.003 4.26 10.04 Term + 327676 327810 135 0 0 106 42 35 0.004 -2.26 10.05 PlyA + 328363 328368 6 1.05 11.02 PlyA - 328634 328629 6 1.05 11.01 Sngl - 338144 337791 354 2 0 72 53 217 0.470 12.40 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 91943 91834 110 0 2 50 54 96 0.930 0.09 S.002 Init + 220461 220606 146 2 2 78 113 123 0.912 13.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:152200194_152559324|GENSCAN_predicted_peptide_1|143_aa MDAAGCHQPKSINAGKGNKIPHVLTCKLETYKRGPTDTGAYLRVEGVKLQNFVVSVTAHK GSVDPMSEQQQDLLQRVKEQSFHSVEAAGPDAKPLIARGGRASQLLRVWGPPSPHPPGTP VGLQVQAQPQFPLAPLPPHLPGS >gi568815595f:152200194_152559324|GENSCAN_predicted_CDS_1|432_bp atggatgcagctggatgccatcaacctaagtcaattaatgcagggaaagggaacaaaata ccgcacgttctcacttgtaagttggaaacttataagaggggaccaacagacaccggagct tacttgagggtagaaggagtgaagctacagaacttcgtggtgagtgttacagctcataaa ggcagtgtggacccaatgagtgagcagcagcaagatttattgcaaagagtgaaagaacaa agcttccacagtgtggaagccgctggcccagatgctaagcccctcattgcccggggcggc agggccagccagctgctccgagtgtggggcccgccaagcccacacccacctggaactcca gttggcctgcaagtgcaggcgcagccccagttcccgctggcgcctctccctccacacctc cctggaagctga >gi568815595f:152200194_152559324|GENSCAN_predicted_peptide_2|264_aa MIISTDAEKAFDKIQQPFMPKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDLIVYLENPIVS AQNLLKLISNFSKVSGYKINAQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLT RDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPM TFFTELEKTTSKFIWIQKRARIAK >gi568815595f:152200194_152559324|GENSCAN_predicted_CDS_2|795_bp atgattatctcaacagatgcagaaaaggcctttgacaaaattcaacaacccttcatgcca aaaactctcaataaattaggtattgatgggacgtatctcaaaataataagagctatctat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagacgacttgattgtatatctagaaaaccccattgtctca gctcaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gcacaaaaatcacaagcattcttatacaccaacaacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaataaaatacttaggaatccaacttaca agggacgtgaaggacctcttcaaggagaactacaaaccactgcttaatgaaataaaagag gatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaa atggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactacttcaaagttcatatggatccaaaaaagagcc cgcatcgccaagtaa >gi568815595f:152200194_152559324|GENSCAN_predicted_peptide_3|243_aa MLYIIWEELPSERVNAAFSTIQLHIAKSRGGAEGRETNYRERPAPARENRGRRAPPRTSS FSHGQTRRSPNNEAARGNCENGNLPLTFFFQRRPHCRPSAQGIVGVMIPRCPRRKMELVS SLSWEHCGRHNSFSCSEKTVACRCAASYRRGSPVNRTEPARAGHGRSPSGAESRRKPEPA RAPWIPSASRWYLPSHAFSIVTSRRACRPDALPLLLPTARHTSGFILKEALAPGFARKPA RLD >gi568815595f:152200194_152559324|GENSCAN_predicted_CDS_3|732_bp atgttatacattatatgggaagaactgccatctgaacgagttaatgcagccttcagcact attcaactccacatagcaaaaagcagaggaggtgctgagggacgcgaaaccaactaccgc gagcggcccgcccctgcccgggagaaccgcggccggcgagcacctcctcgcacttcctcc ttcagccacgggcagacccgccggagtccgaacaatgaagcggcgcggggaaactgtgaa aatggaaacttacctctcacatttttttttcagaggcggccccactgtcgaccaagtgcc caaggcattgtgggagtcatgattccaagatgcccgcgacggaaaatggaacttgtcagc tccttgtcatgggagcattgtgggcgccacaactcattcagctgcagtgaaaagactgtt gcatgtcgctgtgcagcctcttatcgccgagggtcgcctgtcaaccggacagagcccgca agagcgggtcatgggcggtcgcccagcggcgccgagagcaggaggaagccggagcccgcg cgggcgccgtggatcccctcggcctcccgttggtatcttccatctcatgcgttcagcatt gtaacctcgcggcgggcgtgcagaccggacgccctcccgcttcttctaccgactgctagg cacaccagtgggttcattttgaaagaggcccttgcaccgggctttgcaaggaagccggca aggttagattga >gi568815595f:152200194_152559324|GENSCAN_predicted_peptide_4|147_aa MAVSVTPIRDTKWLTLEVCREFQRGTCSRPDTECKFAHPSKSCQVENGRVIACFDSLKAR ATHFMENVTRSKVLNCCYSTLPEAQTLLHAVHFAKVFAKDFENMIFRVEQKGESVNPQTN DSSSIKFCFQIDVQRDVQKDDKGYIVT >gi568815595f:152200194_152559324|GENSCAN_predicted_CDS_4|444_bp atggctgttagtgtcacaccaattcgggacacaaaatggctaacactggaagtatgtaga gagttccagagggggacttgctcacggccagacacggaatgtaaatttgcacatccttcg aaaagctgccaagttgaaaatggacgagtaatcgcctgctttgattcattgaaagccaga gccacgcattttatggaaaatgtaactagaagtaaggtgctgaactgctgctacagcacc ttacctgaggcacaaactttgttgcatgctgttcattttgctaaagtttttgccaaagat ttcgagaatatgatctttagagttgaacagaaaggagaatctgtaaacccacaaacaaat gattcttcatcgatcaagttctgttttcaaatagatgttcaacgagatgttcaaaaagac gataaaggatacattgtaacctaa >gi568815595f:152200194_152559324|GENSCAN_predicted_peptide_5|82_aa MTTQHNTLGEEEQLAQSSAILAAAAAAEAGADHLDPSPCRMLAKAFLFTPTESLPVRLHL FQSVKTVAFDLTSGYNVAGQQS >gi568815595f:152200194_152559324|GENSCAN_predicted_CDS_5|249_bp atgaccacgcagcacaacaccctgggagaagaggaacaattggcacagagctcagcaata ttagcagcagcagcagcagcagaagctggagcagaccatttagacccatctccttgcagg atgctagccaaagcattcttatttacacctacagagagtttgcccgtacgtctgcacctt tttcagagtgtcaaaacagtggcgttcgacttaaccagtggctacaatgtggctggacaa cagagttaa >gi568815595f:152200194_152559324|GENSCAN_predicted_peptide_6|125_aa MQMDPGPQVPLYWLGFVYAALAALGGISGYAKVGSVQSPSAGFFFSELAGLDASQPSRNP KEHLSSPVYIWDLARYYANKILTLWNIYACGFSCRCLLIVSKLGSMYGEQILSVVAMSQL GLMKN >gi568815595f:152200194_152559324|GENSCAN_predicted_CDS_6|378_bp atgcagatggaccctggcccacaggtgcctttatactggcttggctttgtctatgcagca ctggctgctttaggtgggatcagtggttatgcaaaagtaggttctgtccagtccccttct gctggattcttcttcagtgagttagcaggcctggatgcttctcagccatcacggaatcca aaggaacatttgagttctcccgtttacatctgggatcttgctaggtattatgcgaataag attctaacactctggaacatttacgcttgtgggtttagttgcaggtgcctgctgatagtt tccaaacttggaagtatgtatggtgaacagatcctgtcagttgtagctatgtctcagctt ggactcatgaagaattaa >gi568815595f:152200194_152559324|GENSCAN_predicted_peptide_7|130_aa MRKNQHKKAENSKNQNTSSPPRDHNSSPARKQNWMENEFDELTEVGFGRLNQKEVKSLNR PITSSEIEAVIKSLSTKKSPGPDGFTAKFYQRYKEELHLKTFYDHIKSSAFFYTKDIQYT LDIAAIAAAG >gi568815595f:152200194_152559324|GENSCAN_predicted_CDS_7|393_bp atgaggaaaaaccagcacaagaaggctgaaaattccaaaaaccagaacacctcttctcct ccaagggatcataactcctcgccagcaaggaaacaaaactggatggaaaatgagtttgac gaattgacagaagtaggcttcggaagactaaaccagaaagaagtcaaatccctgaataga ccaataacaagttctgaaattgaagcagtaattaagagcctatcaaccaaaaaaagccca ggaccagatggattcacagccaaattctaccagaggtacaaagaggagctgcacttaaaa accttctatgatcatatcaagtccagtgcctttttctacaccaaagacattcaatacact ttggatattgctgcaattgctgctgccggctaa >gi568815595f:152200194_152559324|GENSCAN_predicted_peptide_8|520_aa MADAPGRCSRENCKYLHPPPHLKTQLEINGRNNLIQQKNMAMLAQQMQLANAMMPGAPLQ PVDHNSSPAREQNWTESEFDKLTEVGFRSVNKAADKFKLGAENSHSGNTENDTNILLEKS NPKTYNRQIHQGLNEGKMLRAAREKEIQTTSREYYKHLYANKVENLKEIDKFLDTYSLPR LNQEEVESLNRTITSAEIEAVIIIAYQPEKAQDQTDSQANSTRGTKRSWNRDTYKSLSQE YSQYQIYQLHCYMLSFRENRMPLDNNIALISVFEILLNPLQPMFSVAPSLATNASAAAFN PYLGPVSPSLVPAEILPTAPMLVTGNPGVPVPAAAAAAAQKLMRTDRLEVCREYQRGNCN RGENDCRFAHPADSTMIDTNDNTVTVCMDYIKGRCSREKCKYFHPPAHLQAKIKAAQYQV NQAAAAQAAATAAAMGIPQAVLPPLPKRPALEKTNGATAVFNTGIFQYQQALANMQLQQH TAFLPPVPMVHGATPATVSAATTSATSVPFAATATANQVC >gi568815595f:152200194_152559324|GENSCAN_predicted_CDS_8|1563_bp atggctgacgctcctggccgttgctccagggagaactgcaaatatcttcatccaccccca catttaaaaacgcagttggagataaatggacgcaataacttgattcagcagaagaacatg gccatgttggcccagcaaatgcaactagccaatgccatgatgcctggtgccccattacaa cccgtggatcacaactcctcgccagcaagggaacaaaactggacagaaagtgagtttgac aaattgacagaagtaggcttcagaagtgtaaacaaagccgctgataagtttaaactgggt gcagaaaattcacattcaggaaatacagagaacgacacaaatatactcctcgagaagagc aaccccaagacatataatcgtcagattcaccaaggtttaaatgaaggaaaaatgttaagg gcagccagagagaaagaaatacaaactaccagcagagaatactataaacacctctatgca aataaagtagaaaacctaaaagaaatagataaattcctggacacgtacagcctcccaaga ctcaaccaggaagaagtcgaatccctgaatagaacaataacaagtgctgaaattgaggca gtaattataatagcctatcaaccagaaaaagcccaggaccagacggattcacaggcgaat tctaccagaggtacaaagaggagctggaacagagatacttacaaatcattatcacaagaa tatagtcagtatcaaatataccaattacattgttatatgttatccttcagagaaaacagg atgccacttgataacaacattgcattaatatctgtctttgagattctgctgaaccccctg cagccaatgttttcagttgcaccaagcttagccaccaatgcatcagcagccgcctttaat ccctatctgggacctgtttctccaagcctggtcccggcagagatcttgccgactgcacca atgttggttacagggaatccgggtgtccctgtacctgcagctgctgcagctgctgcacag aaattaatgcgaacagacagacttgaggtatgtcgagagtaccaacgtggcaattgcaac cgaggagaaaatgattgtcggtttgctcatcctgctgacagcacaatgattgacaccaat gacaacacagtcactgtgtgtatggattacatcaaagggagatgctctcgggaaaagtgc aaatactttcatccccctgcacatttgcaagccaagatcaaggctgcccaataccaggtc aaccaggctgcagctgcacaggctgcagccaccgcagctgccatgggaattcctcaagct gtacttcccccattaccaaagaggcctgctcttgaaaaaaccaacggtgccaccgcagtc tttaacactggtattttccaataccaacaggctctagccaacatgcagttacaacagcat acagcatttctcccaccagttcccatggtgcacggtgctacgccagccactgtgtccgca gcaacaacatctgccacaagtgttcccttcgctgcaacagccacagccaaccaggtttgc taa >gi568815595f:152200194_152559324|GENSCAN_predicted_peptide_9|223_aa MQPACMQKFIAGPSSASIGVEGPWGCDCRESWVGYYSFASIALYTSFSRFYIALWSTIYK LADGTYRFKRFCVRTWWVLALTDFKNEATDPRGVKLQIIVVSVTALNAARLELFIPPGGF MVLFIPPGGFLVSQASGVKLWTFPVSVIAHNDSADTKVSSSKIYCKEQRNKASTVWNGTR GGCRCWLGQPAYIPLSDPTHILLIGPFYRELIGALYRELIGPF >gi568815595f:152200194_152559324|GENSCAN_predicted_CDS_9|672_bp atgcagcctgcatgcatgcagaagtttatagctggcccttcatctgcatcgattggtgtt gaaggtccttggggttgtgactgtagagaaagctgggtagggtattatagctttgcttcc atagccctatacacttcttttagcaggttttatattgcattgtggagtacaatctacaag cttgcagatggcacttacaggtttaagcgattctgtgtccggacttggtgggttcttgct ctcactgacttcaagaatgaagccacagaccctcgcggagtgaagctgcagatcatcgtg gtgagtgttacagctcttaacgcagcacgtctggagttgttcattcctcccggtgggttc atggtcttgttcattcctcccggtgggttcctggtctcgcaggcctcaggagtgaagctg tggaccttcccagtgagcgttatagctcataatgacagtgcagacacaaaagtgagcagc agcaagatttactgcaaagagcaaaggaacaaagcttccacagtgtggaatgggaccaga gggggttgccgctgctggcttgggcagcctgcttatattcccttatccgaccccacccac atcctgctgattggcccattttacagagagctgattggtgcattatacagagagctgatt ggtccattttga >gi568815595f:152200194_152559324|GENSCAN_predicted_peptide_10|185_aa MLSESQVLELGNPRACLALYSLVAVLVPKRPKALWPSVVRFAGTQVQTTGLGDSPLTRAG LNVAFVGEHQLSFLATGIKVWDNWVFRPGFHQIKIKVRVRPAVSSEMQGNCLPSSLVVGR IHLLMVVGLRPSAPKGLPLPAEDNCTFLIDSIGNQTQAYLRDTVGLALDPCNKMSHMNIL ASQCT >gi568815595f:152200194_152559324|GENSCAN_predicted_CDS_10|558_bp atgctgtccgagagtcaagtcctggaattggggaacccaagagcctgcttggcgctctac tcccttgtggcagtgttggtacctaagagacccaaggcactgtggccctcggtggtgagg tttgcaggcactcaagttcagaccactggactaggtgattcccctctgactagggctggt ttaaatgttgcctttgtgggcgagcatcagctgagttttcttgctacaggaatcaaggta tgggataactgggtctttcgcccagggtttcaccagattaaaatcaaggttcgtgttagg cctgcagtttcatctgagatgcagggaaattgtcttccaagctcactggttgttggcaga attcatttacttatggttgtaggactgaggccttcagctcctaaaggcctaccattacca gccgaagacaattgtactttcttaattgattctataggcaaccaaacacaggcatatctc agagatactgtgggtttggctctagacccctgcaacaaaatgagtcacatgaatattttg gcttcccagtgtacttaa >gi568815595f:152200194_152559324|GENSCAN_predicted_peptide_11|117_aa MHQDLFLTHNSSRGTSKLNWQGATYSYHGPLKPLQEEIPRPPQILELSGRAAWRSGRSSK QNDVEPRRFGVGASIEGMLKDGHLPRLNLLPKETLALGELSDCSGERSCPSDRGSPT >gi568815595f:152200194_152559324|GENSCAN_predicted_CDS_11|354_bp atgcatcaggacttgtttctgacccacaacagctccaggggaacaagtaagttgaactgg caaggagcaacttactcttaccatgggcctctgaaacccctgcaggaggaaatccctcga ccaccacagatactcgagttgtcaggaagagctgcttggagaagtggtaggagcagcaag caaaatgatgtggagcccagaagatttggtgtgggagcatctatagagggcatgctcaag gatggccatctccctaggctcaacttgctcccaaaggagactttagccctaggggaattg tctgactgctctggagagcggtcttgcccatcagacaggggtagtccaacctga