GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:32:29 Sequence gi568815595f:26609673_26810449 : 200777 bp : 38.70% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 13108 13423 316 0 1 87 80 185 0.608 12.31 1.02 Intr + 13718 13854 137 0 2 86 19 48 0.621 -2.93 1.03 Intr + 14004 14201 198 2 0 85 100 70 0.644 6.53 1.04 Intr + 15464 15639 176 1 2 59 78 142 0.252 8.12 1.05 Intr + 31315 31428 114 1 0 25 72 114 0.621 2.14 1.06 Term + 32499 32802 304 0 1 55 46 219 0.735 8.06 1.07 PlyA + 34409 34414 6 1.05 2.03 PlyA - 34679 34674 6 1.05 2.02 Term - 42842 42721 122 0 2 120 41 51 0.032 1.26 2.01 Init - 64272 64068 205 0 1 49 94 147 0.872 10.46 2.00 Prom - 66286 66247 40 -2.65 3.03 PlyA - 66330 66325 6 1.05 3.02 Term - 73777 73592 186 1 0 90 37 121 0.241 3.71 3.01 Init - 79999 79937 63 1 0 62 84 65 0.280 4.70 3.00 Prom - 85939 85900 40 -5.65 4.00 Prom + 98643 98682 40 -5.95 4.01 Sngl + 100001 100780 780 1 0 76 36 606 0.792 49.94 4.02 PlyA + 101075 101080 6 1.05 5.04 PlyA - 102462 102457 6 1.05 5.03 Term - 119544 118486 1059 0 0 -23 37 486 0.252 23.53 5.02 Intr - 128750 128695 56 0 2 91 98 33 0.081 2.38 5.01 Init - 131868 131862 7 0 1 55 110 0 0.075 -0.01 5.00 Prom - 136180 136141 40 -5.65 6.00 Prom + 137839 137878 40 -1.45 6.01 Init + 139038 139062 25 2 1 87 89 32 0.593 2.95 6.02 Term + 148642 149315 674 2 2 55 49 263 0.065 12.03 6.03 PlyA + 149423 149428 6 1.05 7.04 PlyA - 150719 150714 6 1.05 7.03 Term - 151001 150933 69 2 0 76 54 44 0.005 -3.24 7.02 Intr - 161203 161076 128 2 2 67 85 114 0.561 8.48 7.01 Init - 169092 168981 112 0 1 65 40 77 0.062 1.03 7.00 Prom - 170608 170569 40 -5.15 8.00 Prom + 172782 172821 40 -5.35 8.01 Sngl + 174691 175347 657 0 0 65 43 342 0.665 23.32 8.02 PlyA + 175575 175580 6 1.05 9.00 Prom + 175744 175783 40 -6.15 9.01 Init + 177017 178623 1607 1 2 37 86 401 0.967 26.69 9.02 Term + 181346 181469 124 2 1 61 45 111 0.251 0.98 9.03 PlyA + 182560 182565 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 23285 23161 125 2 2 90 103 128 0.822 14.19 S.002 Sngl - 59723 59502 222 2 0 51 33 187 0.831 4.40 S.003 Term - 135157 134934 224 2 2 87 38 161 0.910 7.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:26609673_26810449|GENSCAN_predicted_peptide_1|414_aa PGFGGDFPLLTEPTASLGKRRFASLLSLSALPARLRRLEQGLQRPPQQREPSRAPHGRPG PHPRSVAAPEDASRATHCLRPAPQPPRTNAAAFAGSQARRAGPVPDPGRDLLLERESRPS SLLFSRLLYNSLQTFVTPSLRGAHPVLCNLQELGRDREGEIREVGHGGWLRPPLKLRPAR WESVHPCTRHPQLGNETFGVACGCAPWELQVQIWVGETFHVQAFLTFSAGALQRTPAALY QLGKSKIQSSSFFWVPRLGHIGGNVDSPGLPGEIPRINRLSTVVGDVWNAKVWQCTKSHL VGTVKEAGEEIYGRRLQKTPLYVCERKNKKGKSHLNHIVQIVSSSGIPCKDLRDFLESLD QTLRATGVVEEMALPASGDTSRPCNSLGEKDEGLHQGVSSGKEKRMNGREAKEK >gi568815595f:26609673_26810449|GENSCAN_predicted_CDS_1|1245_bp cctgggtttggaggcgatttccctctattgactgagccgactgcatctctgggcaagcgg cgctttgcctctctcctgtctttgtctgctcttcctgcaaggctacggcgcctggagcag ggtctgcagcggccgccgcagcaacgcgagccaagtcgtgccccgcacggccgcccgggg ccgcaccctcgctcggtggcggcgcccgaagacgccagccgcgccacgcactgcctgcgt cccgcgccccagccgccgcgcaccaacgccgccgccttcgccgggagccaagcccgccgg gccggcccggtcccagaccctgggagagacctcctcctggagagagagagtcgaccctct tcgctcctgttcagccggctgctatacaatagtctgcaaacttttgtgaccccttccctg cgcggtgctcaccccgtgctgtgcaacttgcaggagctgggtagggacagagaaggtgaa ataagagaggtgggacatggaggatggctgcggccacccctgaagcttcgtccagcgcga tgggagtcagtccatccttgcacacgccacccccaactcggcaatgaaacctttggagtg gcttgcggctgtgcgccgtgggaactgcaagtgcagatatgggtgggtgaaaccttccac gtgcaggcttttcttactttctccgcgggtgccctacagaggactcctgctgccttgtac cagttgggaaaaagcaaaattcagagcagttcctttttctgggtccctcgccttggccac atcggaggaaacgtcgactcgccagggctcccgggagaaatcccgagaataaacagactt tctacagttgtgggagatgtttggaatgcaaaggtctggcagtgcacaaaaagtcactta gtagggaccgtgaaggaagctggtgaagaaatctatggcaggaggcttcagaaaactcca ctgtatgtttgtgagagaaagaataaaaaaggcaaatcacatcttaatcacattgtacaa atagtctcatcttcaggaatcccctgcaaggatcttagggacttcctggagtccctggac caaactttgagagccactggtgtggtggaggagatggctctaccagcaagtggggatacc agcagaccttgtaatagcttaggggagaaagatgaaggcctgcaccaaggtgtatcttca ggaaaggagaagaggatgaatgggagagaagctaaggagaaataa >gi568815595f:26609673_26810449|GENSCAN_predicted_peptide_2|108_aa MKLGLLQSGWLSNVVHKPGTNFKKNLKRVDRVSVIHQLMHYIKKTLLELTDFVQTSCFSG GGCSSAGGGFELTLFYYPSADLHLVTQLFYEIPLASFFRPTSQLCSDI >gi568815595f:26609673_26810449|GENSCAN_predicted_CDS_2|327_bp atgaaattgggcctcctgcagtctggctggctttccaatgtggtccacaaaccagggacc aatttcaagaagaacctcaaaagagtggacagggtgagtgtcattcaccaattaatgcac tacattaagaaaactctcctagagctgactgattttgtgcagacttcctgtttcagtgga ggaggctgttcttcagcaggtggaggatttgagctgactctgttctactacccctctgca gacttgcatttagttacccagctcttctatgagattcctttggcttctttcttccgcccc acttcacagctgtgcagtgacatctaa >gi568815595f:26609673_26810449|GENSCAN_predicted_peptide_3|82_aa MQILPPLYGSPYGNGERAGERTLLELMFGLNWWAPKKYDATTAEIWLCQYLDISAAATIY PTDQMVVVLKCHRVVKWSYNVV >gi568815595f:26609673_26810449|GENSCAN_predicted_CDS_3|249_bp atgcagattttgccacctctttatggaagtccatatggaaatggagaaagagcaggagag aggaccttgctagagcttatgtttggcctaaactggtgggccccaaagaaatatgatgct acaactgctgaaatctggctttgccaatatttggatatttcagctgctgctacaatttac ccaacagatcagatggttgtggtactgaaatgccatcgtgtagtaaagtggtcctataat gtagtttaa >gi568815595f:26609673_26810449|GENSCAN_predicted_peptide_4|259_aa MNLVDLWLTRSLSMCLLLQSFVLMILCFHSASMCPKGCLCSSSGGLNVTCSNANLKEIPR DLPPETVLLYLDSNQITSIPNEIFKDLHQLRVLNLSKNGIEFIDEHAFKGVAETLQTLDL SDNRIQSVHKNAFNNLKARARIANNPWHCDCTLQQVLRSMASNHETAHNVICKTSVLDEH AGRPFLNAANDADLCNLPKKTTDYAMLVTMFGWFTMVISYVVYYVRQNQEDARRHLEYLK SLPSRQKKADEPDDISTVV >gi568815595f:26609673_26810449|GENSCAN_predicted_CDS_4|780_bp atgaatctggtagacctgtggttaacccgttccctctccatgtgtctcctcctacaaagt tttgttcttatgatactgtgctttcattctgccagtatgtgtcccaagggctgtctttgt tcttcctctgggggtttaaatgtcacctgtagcaatgcaaatctcaaggaaatacctaga gatcttcctcctgaaacagtcttactgtatctggactccaatcagatcacatctattccc aatgaaatttttaaggacctccatcaactgagagttctcaacctgtccaaaaatggcatt gagtttatcgatgagcatgccttcaaaggagtagctgaaaccttgcagactctggacttg tccgacaatcggattcaaagtgtgcacaaaaatgccttcaataacctgaaggccagggcc agaattgccaacaacccctggcactgcgactgtactctacagcaagttctgaggagcatg gcgtccaatcatgagacagcccacaacgtgatctgtaaaacgtccgtgttggatgaacat gctggcagaccattcctcaatgctgccaacgacgctgacctttgtaacctccctaaaaaa actaccgattatgccatgctggtcaccatgtttggctggttcactatggtgatctcatat gtggtatattatgtgaggcaaaatcaggaggatgcccggagacacctcgaatacttgaaa tccctgccaagcaggcagaagaaagcagatgaacctgatgatattagcactgtggtatag >gi568815595f:26609673_26810449|GENSCAN_predicted_peptide_5|373_aa MPVSLLLIKCVTKFLDYRSGGKVNKDTQELNSALHEADLIDIYRTLHPKSTEYTFFSAPQ HTYSKIDHIVGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLN DYWVHNEMKAEIKMFFETNENKDTTYQNIWDTFKAVCRGKFIALNSHKRKQERSKIDTLT SQLKELEKQEQTHSKASRRQEITKIRAELKEIEMQKTLQKINESRSWFFERINKIDKPLA RLIKKKREKNQIDAIKNDKGDITTDPTEIQTTLREYYKHLYANKLENLEEMDKFLDTYTL PGLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQS IEKDKVFLKYEKT >gi568815595f:26609673_26810449|GENSCAN_predicted_CDS_5|1122_bp atgccagtgtccctgctgctaatcaagtgtgtcactaaattccttgattacagaagtgga gggaaagtcaacaaggatacccaggaattgaactcagctctgcacgaagcagacctaata gacatctacagaactctccatcccaaatcaacagaatatacatttttttcagccccacag cacacctattccaaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaa agaacagaaattataacaaactatctctcagaccacagtgcaatcaaactagaactcagg attaagaatctcactcaaaaccgctcaactacatggaaactgaacaacctgctcctgaat gactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaatgag aacaaagacacaacataccagaatatctgggacacattcaaagcagtgtgtagagggaaa tttatagcactaaattcccacaagagaaagcaggaaagatccaaaattgacaccctaaca tcacaattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaataactaaaatcagagcagaactgaaggaaatagagatgcaaaaaacccttcaaaaa attaatgaatccaggagctggttttttgaaaggatcaacaaaattgataaaccgctagca agactaataaagaaaaaaagagagaagaatcaaatagacgcaataaaaaatgataaaggg gatatcaccaccgatcccacagaaatacaaactaccctcagagaatactacaaacacctc tacgcaaataaactagaaaatctagaagaaatggataaattccttgacacatacaccctc ccaggactaaaccaggaagaagttgaatctctgaatagaccaataacaggttctgaaatt gtggcaataatcaatagcttaccaaccaaaaagagtccaggaccagatggattcacagcc gaattctaccagaggtacaaggaggaactggtaccattccttctgaaactattccaatca atagaaaaagacaaggtatttctgaaatatgagaaaacttga >gi568815595f:26609673_26810449|GENSCAN_predicted_peptide_6|232_aa MVKGEVGAANIFSRGTLLHMANHDHLYQWQRTESAFQNLHRTQVVMSWSRQGSTVALERD GTWILIPVLMHVVHATVGQFSTYWASTSLNYRTNDKNHMIISIDAEKAFDKVQQPFMLKT LNKLSIDGTYLKILRAIYDKPTASIILNGQKLEAFPLKTGTRQGCPFSPLLFNIVLEVMA RVIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIISAQNLLKLIGNFSKV >gi568815595f:26609673_26810449|GENSCAN_predicted_CDS_6|699_bp atggtgaaaggtgaagtgggagcagccaacatattttccagaggaacattactgcatatg gctaaccatgatcatctctaccagtggcaaagaacagagagtgcatttcagaatttacat agaacccaggtggtgatgtcctggagcagacaaggtagcacagtggctttggagagagac ggaacatggattctgattcctgttctgatgcatgttgtccatgcaactgtgggccaattc tcaacttactgggcttccacttctttgaattacagaaccaacgacaaaaaccacatgatt atctcaatagacgcagaaaaggcctttgacaaagttcaacagcccttcatgctaaaaact ctcaataaattgagtattgatgggacgtatctcaaaatattaagagctatctatgacaaa cccacagccagtatcatactgaatgggcaaaaactggaagcattccctttgaaaactggc acaagacagggatgccctttctcaccactcctattcaacatagtgttggaagttatggcc agggtaatcaggcaggagaaggaaataaagggtatacaattaggaaaagaggaagtcaaa ttgtccctgtttgcagatgacatgattgtatatctagaaaaccccatcatctcagcccaa aatctccttaagctgataggcaacttcagcaaagtctga >gi568815595f:26609673_26810449|GENSCAN_predicted_peptide_7|102_aa MQTKATMRYHLTPIKVAYIQKTNAGEGVKKGEPLYTVERSSSFREWKLSVKVQKDKNRNS RDLKSRKPNAYDLATGTQLEALVSDVPLPVSKCSHCLIPTYE >gi568815595f:26609673_26810449|GENSCAN_predicted_CDS_7|309_bp atgcagaccaaagctacaatgagatatcatctcaccccaattaaagtggcttatatccaa aagacaaatgccggagagggtgtgaagaaaggggaacccttatacactgttgagaggagc tcatcatttagggaatggaagttgtcagtaaaggtgcaaaaggacaagaacagaaatagc agagacctgaaaagtagaaagcctaatgcttatgaccttgctactggaactcagcttgag gccctggtgtctgatgttccccttcctgtgtccaagtgttctcattgtttaattcccacc tatgagtga >gi568815595f:26609673_26810449|GENSCAN_predicted_peptide_8|218_aa MEDGMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGDNGTKLENTLQD MIQENFPNLTRQVNVQIQEIQRMPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV TLKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDK QMLRDFVTTRPALKELLKEALNMERNNRYQPLQNHVNL >gi568815595f:26609673_26810449|GENSCAN_predicted_CDS_8|657_bp atggaagatggaatgaatgaaatgaagcgagaaggaaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatggggacaatggaaccaagttggaaaacactctgcaggat atgatccaggagaatttccccaatctaacaaggcaggtcaacgttcagattcaggaaata cagagaatgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga tttaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt accctcaaaggaaagcccatcagactaacagcggatctctcagcagaaaccctacaagcc agaagagagtgggggccaatattcaacattctcaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagacaaa caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctaaaggaagca ctaaacatggaaaggaacaaccggtaccagccactgcaaaatcatgtcaacttgtaa >gi568815595f:26609673_26810449|GENSCAN_predicted_peptide_9|576_aa MNIDAKILNKILANRIQQHIKKLIHHDQLGFIPGMQGWFNICKSINIIQHLNRAKDKNHM LISIDAEKAFDKIQQPFMLKTLNKLGIDGTHFKIIRAIYEKPTANIILNGQKLEAFPLKT GTRQGCPLSPLLFDIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADNMILYLEDPIVSA QNLLKLISNFSKVTGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTR DVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMP FFTQLEKTTLKFTWNQKRARITKSILSQKNKAGGITLLDFKLYYQATVTKTAWYWYKNRD IDQWNRTEPSEIMPHIYNYLVFDKPEKNKQWGKDSLFNQWCWENWLAICRKLKLDPFLTP YTKINSRWIKYLHVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLIK LKSFCTAKETTIRFNRQPTEWEKIFATYSSDKGLISRIYNELKQIYKKKTNKPIKRAVGF PLAQDRSRNAVQESVLGIRDPNSPLGALPSCGCGDT >gi568815595f:26609673_26810449|GENSCAN_predicted_CDS_9|1731_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccatgatcaactgggcttcatccctgggatgcaaggctggttcaat atatgcaaatcaataaacataatccagcatctaaacagagccaaagacaaaaaccacatg cttatctcaatagatgcagaaaaggcctttgacaaaattcagcaacctttcatgttaaaa actctcaataaattaggtattgatgggacacatttcaaaataataagagctatctatgag aaacccacagccaatatcatactgaatgggcaaaaattggaagcattccctttgaaaact ggcacaagacagggatgccctctctcgccactcctattcgacatagtgttggaagttctg gccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtc aaattgtccctgtttgcagacaacatgattctatatctagaagaccccattgtctcagcc caaaatctccttaagctgataagcaacttcagcaaagtcacaggatacaaaatcaatgtg caaaaatcacaagcgttcctatacaccaacaacagacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggat acaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatattgtgaaaatg gccatactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatgcct ttcttcacacaattggaaaaaactactttaaagttcacatggaaccaaaaaagagcccgc atcaccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacttgacttc aaactatactaccaggctacagtaaccaaaacagcatggtactggtacaaaaacagagat atagatcaatggaacagaacagagccctcagaaataatgccgcatatctacaactatctg gtctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaatcaatgg tgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacacct tatacaaaaatcaattcaagatggattaaatacttacatgttagacctaaaaccataaaa accctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggacttcatg tctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaa ctaaagagcttctgcacagcaaaagaaactaccatcagattcaacaggcaacctacagaa tgggagaaaatttttgcaacctactcatctgacaaagggctaatatccagaatctacaat gaactcaaacaaatttacaagaaaaaaacaaacaaacccatcaaaagggcagtgggcttt cctctagcccaggacagatccaggaatgctgtccaagagtcagtgcttggaatcagggac cccaacagcccccttggtgctctgccctcctgtggttgtggggatacctaa