GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:13:38 Sequence gi568815576f:40757182_41026432 : 269251 bp : 43.21% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 1103 1098 6 1.05 1.06 Term - 13800 13653 148 2 1 103 53 142 0.894 9.57 1.05 Intr - 16838 16756 83 2 2 88 105 19 0.987 2.04 1.04 Intr - 19954 19859 96 1 0 106 96 76 0.978 10.41 1.03 Intr - 20192 20047 146 0 2 59 84 86 0.997 5.30 1.02 Intr - 21944 21828 117 0 0 76 111 40 0.949 5.54 1.01 Init - 27930 27861 70 0 1 110 77 52 0.993 7.51 1.00 Prom - 30380 30341 40 -8.36 2.19 PlyA - 30754 30749 6 1.05 2.18 Term - 31982 31729 254 0 2 38 54 162 0.422 3.60 2.17 Intr - 35495 35344 152 1 2 96 92 11 0.397 2.11 2.16 Intr - 37399 37333 67 2 1 76 74 55 0.496 0.86 2.15 Intr - 41902 41842 61 1 1 83 110 12 0.040 1.21 2.14 Intr - 62096 62014 83 0 2 -5 69 122 0.016 0.36 2.13 Intr - 69485 69368 118 2 1 95 38 79 0.215 3.64 2.12 Intr - 70048 69915 134 2 2 87 110 48 0.722 7.26 2.11 Intr - 72493 72445 49 1 1 63 76 15 0.717 -3.95 2.10 Intr - 73775 73659 117 2 0 68 61 154 0.842 11.36 2.09 Intr - 75490 75388 103 0 1 47 75 81 0.996 2.88 2.08 Intr - 78489 78379 111 2 0 63 99 119 0.998 9.99 2.07 Intr - 78706 78622 85 2 1 91 94 80 0.999 7.78 2.06 Intr - 83511 83445 67 0 1 92 101 79 0.992 8.08 2.05 Intr - 87758 87658 101 0 2 44 107 118 0.907 9.13 2.04 Intr - 91188 91113 76 0 1 76 66 101 0.886 5.79 2.03 Intr - 93699 93642 58 2 1 58 115 43 0.083 2.89 2.02 Intr - 99448 99250 199 2 1 44 78 187 0.610 11.71 2.01 Init - 104813 103946 868 2 1 75 83 456 0.556 38.53 2.00 Prom - 106378 106339 40 -2.46 3.00 Prom + 115815 115854 40 -3.16 3.01 Init + 124055 124103 49 1 1 77 58 40 0.789 -0.99 3.02 Intr + 124589 124996 408 0 0 89 87 176 0.189 11.84 3.03 Intr + 129132 129334 203 1 2 53 80 216 0.152 16.30 3.04 Intr + 150406 150468 63 0 0 122 100 61 0.784 9.61 3.05 Intr + 151941 152054 114 2 0 78 102 54 0.986 6.44 3.06 Intr + 157058 157143 86 1 2 113 91 34 0.975 4.82 3.07 Intr + 165152 165332 181 2 1 96 75 116 0.975 10.97 3.08 Intr + 167181 167301 121 2 1 99 107 -10 0.724 2.07 3.09 Term + 169088 169254 167 0 2 95 43 182 0.979 12.48 3.10 PlyA + 170302 170307 6 1.05 4.00 Prom + 187827 187866 40 -2.46 4.01 Init + 194218 194295 78 0 0 93 53 150 0.814 13.06 4.02 Intr + 196374 196452 79 2 1 99 78 45 0.606 3.72 4.03 Intr + 206866 206936 71 2 2 113 83 -23 0.203 -1.40 4.04 Intr + 209156 209259 104 1 2 100 97 26 0.340 3.67 4.05 Term + 209346 209463 118 0 1 92 38 29 0.317 -3.69 4.06 PlyA + 210047 210052 6 1.05 5.07 PlyA - 211370 211365 6 1.05 5.06 Term - 220258 220060 199 0 1 116 47 107 0.674 6.27 5.05 Intr - 239424 239313 112 1 1 73 110 20 0.398 2.14 5.04 Intr - 246199 246162 38 0 2 105 86 49 0.358 4.31 5.03 Intr - 252507 252413 95 0 2 102 49 56 0.227 1.86 5.02 Intr - 254105 254045 61 1 1 81 38 60 0.045 -0.96 5.01 Init - 268842 268835 8 0 2 114 91 0 0.111 3.40 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 93698 93642 57 2 0 75 115 45 0.907 7.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:40757182_41026432|GENSCAN_predicted_peptide_1|219_aa MPGLTEFDQAKNNSQIGQPSEPEGVVNVLLTTPLWVVNTRLKLQGAKFRNEDIVPTNYKG IIDAFHQIIRDEGISALWNGTFPSLLLVFNPAIQFMFYEGLKRQLLKKRMKLSSLDVFII GAVAKAIATTVTYPLQTVQSILRFGRHRLNPENRTLGSLRNILYLLHQRVRRFGIMGLYK GLEAKLLQTVLTAALMFLVYEKLTAATFTVMGLKRAHQH >gi568815576f:40757182_41026432|GENSCAN_predicted_CDS_1|660_bp atgcctggcctaacagagtttgatcaagcaaagaacaattcgcaaattgggcagccctca gaaccagaaggagtggttaatgtgttgctaacaactccactctgggtggtaaacaccaga ctgaagcttcaaggagcaaaatttaggaatgaagacattgtaccaacaaactacaaaggt atcattgatgcttttcatcagatcattcgcgatgaaggaatctcggctttatggaatggc acatttccctcattgctgttggtcttcaatcctgccatccagttcatgttttatgaaggt ttaaaacggcagcttttaaagaaacggatgaagctttcttccttggatgtgttcatcatt ggtgcagtagccaaagcgattgccaccacggtgacctatcccctgcagacggtacagtca attctgaggtttgggcgtcatagactaaacccagaaaacagaacattgggaagtcttcgg aatattctctatcttcttcaccaacgagtaagacgttttggaataatgggactctacaaa ggccttgaagccaaactgctgcagacagtcctcactgctgctctcatgttccttgtttat gagaaactgacagctgccaccttcacagttatggggctgaagcgtgcacaccaacactga >gi568815576f:40757182_41026432|GENSCAN_predicted_peptide_2|900_aa MVDYYEVLGLQRYASPEDIKKAYHKVALKWHPDKNPENKEEAERKFKEVAEAYEVLSNDE KRDIYDKYGTEGLNGGGSHFDDECEYGFTFHKPDDVFKEIFHERDPFSFHFFEDSLEDLL NRPGSSYGNRNRDAGYFFSTASEYPIFEKFSSYDTGYTSQGSLGHEGLTSFSSLAFDNSG MDNYISVTTSDKIVNGRNINTKKIIESDQEREAEDNGELTFFLVNSVANEEGFAKECSWR TQSFNNYSPNSHSSKHVSQYTFVDNDEGGISWVTSNRDPPIFSAGVKEGAAPFCAVTPSQ RLGLEPGRSPPSFAHHLPTMDPRKVNELRAFVKMCKQDPSVLHTEEMRFLREWVESMGGK VPPATQKAKSEENTKEEKPDSKKVEEDLKADEPSSEESDLGLIVIFSTLAEIDKEGVIEP DTDAPQEMGDENAEITEEMMDQANDKKVAAIEALNDGELQKAIDLFTDAIKLNPRLAILY AKRASVFVKLQKPNAAIRDCDRAIEINPDSAQPYKWRGKAHRLLGHWEEAAHDLALACKL DYDEDASAMLKEVQPRAQKIAEHRRKYERKREEREIKERIERVKKAREEHERAQREEEAR RQSGAQYGSFPGGFPGGMPGNFPGGMPGMGGGMPGMAGMPGLNEILSDPEVLAAMQDPEV MVAFQDVAQNPANMSKYQSNPKVMNLISKLSAKFGVWLGLRGAAPMASVLSYESLVHAVA GAVGSVTAMTVFFPLDTARLRLQVDEKRKSKTTHMVLLEIIKEEGLLAPYRGWFPVISSL CCSNFVYFYTFNSLKALWVKGQHSTTGKDLVVGFVAANFPEQVVDNLPADISSGIYYGWA SAGSRDGQKMVVSIGWNPYYENTKKSMETHTMHTFKEHFYGEILSVAVVGYLRPEKNFDS >gi568815576f:40757182_41026432|GENSCAN_predicted_CDS_2|2703_bp atggtggattactatgaagttctaggactgcaaagatatgcttcacctgaggacattaaa aaagcttatcataaagtggcacttaaatggcaccctgataaaaatccagaaaataaagaa gaagcagagagaaaattcaaagaagtagctgaggcatacgaggtattatcaaatgatgag aaacgggacatttatgataaatatggcacagaaggattaaacggaggtggaagtcatttt gatgatgaatgtgagtacggcttcacattccataagccagatgatgtttttaaagaaatt tttcatgaaagggatccattttcttttcacttctttgaagactcgcttgaggacctgtta aatcgtccaggaagctcctatggaaacagaaacagagatgcaggatactttttctccact gccagtgaatatccaatttttgagaaattttcttcatatgatacaggatatacatcacag ggttcattggggcatgaaggccttacttctttctcttccctggcttttgataatagtggg atggacaactacatatctgttacaacttcagacaaaatcgttaatggcagaaatattaat acaaagaaaattattgaaagtgatcaagaaagagaagctgaagataatggagagttgaca ttttttcttgtaaatagtgtggccaatgaagagggctttgcaaaagaatgcagctggaga acacagtcattcaacaactattcaccaaattctcacagctccaaacatgtatctcaatat actttcgtggacaatgatgagggaggtatatcttgggttaccagcaacagagatccccct attttctcagcaggagtcaaagagggtgccgcccccttctgcgcggtcacgccgagccag cgcctgggcctggaaccgggccgtagcccccccagtttcgcccaccacctccctaccatg gacccccgcaaagtgaacgagcttcgggcctttgtgaaaatgtgtaagcaggatccgagc gttctgcacaccgaggaaatgcgcttcctgagggagtgggtggagagcatgggtggtaaa gtaccacctgctactcagaaagctaaatcagaagaaaataccaaggaagaaaaacctgat agtaagaaggtggaggaagacttaaaggcagacgaaccatcaagtgaggaaagtgatcta ggtcttattgtcattttctcgactttggcagaaattgataaagaaggtgtgattgaacca gacactgatgctcctcaagaaatgggagatgaaaatgcggagataacggaggagatgatg gatcaggcaaatgataaaaaagtggctgctattgaagccctaaatgatggtgaactccag aaagccattgacttattcacagatgccatcaagctgaatcctcgcttggccattttgtat gccaagagggccagtgtcttcgtcaaattacagaagccaaatgctgccatccgagactgt gacagagccattgaaataaatcctgattcagctcagccttacaagtggcgggggaaagca cacagacttctaggccactgggaagaagcagcccatgatcttgcccttgcctgtaaattg gattatgatgaagatgctagtgcaatgctgaaagaagttcaacctagggcacagaaaatt gcagaacatcggagaaagtatgagcgaaaacgtgaagagcgagagatcaaagaaagaata gaacgagttaagaaggctcgagaagagcatgagagagcccagagggaggaagaagccaga cgacagtcaggagctcagtatggctcttttccaggtggctttcctgggggaatgcctggt aattttcccggaggaatgcctggaatgggagggggcatgcctggaatggctggaatgcct ggactcaatgaaattcttagtgatccagaggttcttgcagccatgcaggatccagaagtt atggtggctttccaggatgtggctcagaacccagcaaatatgtcaaaataccagagcaac ccaaaggttatgaatctcatcagtaaattgtcagccaaatttggagtgtggctgggtctt cgaggagccgcaccaatggcttccgtgctgtcctacgaaagcctggtccacgccgtggcc ggagccgtgggaagcgtgacagcaatgacagtgttttttcccctggatacagctagactt cgacttcaggttgatgagaaaagaaaatccaaaactacacacatggtgctcctggagatc attaaagaagaaggactcctggcaccatatcgagggtggtttccagtgatttccagtctc tgctgctccaattttgtctatttctacacttttaatagcctcaaagcactctgggtcaaa ggtcaacattctaccactggaaaagatctggtagttgggtttgttgcagctaattttcct gagcaagtggtagataatctgccagccgatatatccagtggcatttattatggttgggcc agtgctggaagtagagatggtcagaagatggtggtgagcataggatggaacccatactat gagaatacgaagaagtccatggaaacacataccatgcataccttcaaagagcacttctat ggggaaatcctgagtgtggccgttgttggctacctcagaccagaaaagaactttgattca tga >gi568815576f:40757182_41026432|GENSCAN_predicted_peptide_3|463_aa MRFYHVGRAGLELLTSGEVTPGLSQVEYALRRHKLMSLIQKEAQGQSGTDQTVVVLSNPT YYMSNDIPYTFHQDNNFLYLCGFQEPDSILVLQSLPGKQLPSHKAILFVPRRDPSRELWD GPRSGTDGAIALTGVDEAYTLEEFQHLLPKMKAETNMVWYDWMRPSHAQLHSDYMQPLTE AKAKSKNKVRGVQQLIQRLRLIKSPAEIERMQIAGKLTSQAFIETMFTSKAPVEEAFLYA KFEFECRARGADILAYPPVVAGGNRSNTLHYVKNNQLIKDGEMVLLDGGCESSCYVSDIT RTWPVNGRFTAPQAELYEAVLEIQRDCLALCFPGTSLENIYSMMLTLIGQKLKDLGIMKN IKENNAFKAARKYCPHHVGHYLGMDVHDTPDMPRSLPLQPGMVITIEPGIYIPEDDKDAP EKFRGLGVRIEDDVVVTQDSPLILSADCPKEMNDIEQICSQAS >gi568815576f:40757182_41026432|GENSCAN_predicted_CDS_3|1392_bp atgaggttttaccatgttggccgggctggtcttgaactcctgacctcaggggaggtaact ccaggactatctcaggtggaatatgcacttcgcagacacaaactaatgtctctgatccag aaggaagctcaagggcagagtgggacagaccagacagtggttgtgctctccaaccctaca tactacatgagcaacgatattccctatactttccaccaagacaacaatttcctgtaccta tgtggattccaagagcctgatagcattcttgtccttcagagcctccctggcaaacaatta ccatcacacaaagccatactttttgtgcctcggcgagatcccagtcgagaactttgggat ggtccgcgatctggcactgatggagcaatagctctaactggagtagacgaagcctatacg ctagaagaatttcaacatcttctaccaaaaatgaaagctgagacgaacatggtttggtat gactggatgaggccctcacatgcacagcttcactctgactatatgcagcccctgactgag gccaaagccaagagcaagaacaaggttcggggtgttcagcagctgatacagcgcctccgg ctgatcaagtctcctgcagaaattgaacgaatgcagattgctgggaagctgacatcacag gctttcatagaaaccatgttcaccagtaaagcccctgtggaagaagcctttctttatgct aagtttgaatttgaatgccgggctcgtggcgcagacattttagcctatccacctgtggtg gctggtggtaatcggtcaaacactttgcactatgtgaaaaataatcaactcatcaaggat ggggaaatggtgcttctggatggaggttgtgagtcttcctgctatgtgagtgacatcaca cgtacgtggccagtcaatggcaggttcaccgcacctcaggcagaactctatgaagccgtt ctagagatccaaagagattgtttggccctctgcttccctgggacaagcttggagaacatc tacagcatgatgctgaccctgataggacagaagcttaaagacttggggatcatgaagaac attaaggaaaataatgccttcaaggctgctcgaaaatactgtcctcatcatgttggccac tacctcgggatggatgtccatgacactccagacatgccccgttccctccctctgcagcct gggatggtaatcacaattgagcccggcatttatattccagaggatgacaaagatgcccca gagaagtttcggggtcttggtgtacgaattgaggatgatgtagtggtgactcaggactca cctctcatcctttctgcagactgtcccaaagagatgaatgacattgaacagatatgcagc caggcttcttga >gi568815576f:40757182_41026432|GENSCAN_predicted_peptide_4|149_aa MAAAMDVDTPSGTNSGAGKKRFEVKKWNAVALWAWDIVVDNCAICRNHIMDLCIECQANQ ASATSEECTVAWGVCNRVVCSDLVGLYQDTVLLQSAQWVRWEREKLSNTLRELYKSQLKR DLPKGCHSKRIFVLLGIPSVLAVDSTLSY >gi568815576f:40757182_41026432|GENSCAN_predicted_CDS_4|450_bp atggcggcagcgatggatgtggataccccgagcggcaccaacagcggcgcgggcaagaag cgctttgaagtgaaaaagtggaatgcagtagccctctgggcctgggatattgtggttgat aactgtgccatctgcaggaaccacattatggatctttgcatagaatgtcaagctaaccag gcgtccgctacttcagaagagtgtactgtcgcatggggagtctgtaaccgcgtggtatgc tctgacttagttgggctctaccaggacaccgtactgctgcaaagtgctcagtgggtgagg tgggagagggagaagctgtcgaatacccttcgtgagttatacaagagtcagctgaaaagg gatcttccaaaaggctgtcactctaagagaatatttgtcctcttgggtatccctagtgta ctagctgtggattccactcttagttactga >gi568815576f:40757182_41026432|GENSCAN_predicted_peptide_5|170_aa MPRCYEHGDKPDILPPDSLWSKGPLRGQLGPEPVPTPMASLFQACLPGEVQDAAGFSTWY KKSGPEKDLFTWLFESWLLVNKLVLNPALHTTVFPCLGFARGPSRAPTTLEQVIVCPGDL VWDSPPMSDDCLHQVSQLACHIPKEDFPAPLLEVATQFYSMTPQPSSLST >gi568815576f:40757182_41026432|GENSCAN_predicted_CDS_5|513_bp atgcccaggtgctacgaacatggtgataagcctgatatcttgcctccagacagcctgtgg tctaagggacctttgcgaggtcagcttggccctgagcctgtgcccacacccatggcctcc ctctttcaggcctgtttacctggtgaggtgcaggatgctgcagggttctccacctggtac aagaagagtggcccagagaaagacctttttacttggctcttcgaaagctggctcttggtc aacaaacttgtcctcaaccccgcccttcacaccacagtcttcccctgcctggggtttgct cgggggcccagcagagctccaaccacactggagcaagtcatcgtctgccccggggacttg gtctgggactccccacctatgtcagatgactgtcttcatcaagtctcacaacttgcctgt catatccccaaagaggacttccctgcccctctgttagaagtggccacacagttttactct atgacacctcaaccctcttcactgtctacgtga