GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:48:19 Sequence gi568815583f:28689199_28889483 : 200285 bp : 43.02% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.23 PlyA - 49 44 6 1.05 1.22 Term - 12839 12655 185 0 2 31 40 116 0.431 -1.19 1.21 Intr - 13171 13016 156 2 0 68 89 186 0.932 16.68 1.20 Intr - 13364 13267 98 1 2 101 97 40 0.942 5.85 1.19 Intr - 13547 13447 101 2 2 26 70 99 0.722 0.81 1.18 Intr - 14212 14121 92 2 2 92 89 27 0.796 2.91 1.17 Intr - 14719 14644 76 1 1 85 102 105 0.810 10.69 1.16 Intr - 15047 14952 96 2 0 69 37 164 0.722 9.61 1.15 Intr - 16029 15889 141 0 0 85 66 115 0.957 9.55 1.14 Intr - 16541 16285 257 0 2 85 90 374 0.996 34.46 1.13 Intr - 17005 16918 88 1 1 88 86 77 0.957 7.04 1.12 Intr - 17308 17201 108 1 0 78 78 36 0.805 2.08 1.11 Intr - 17501 17415 87 2 0 59 94 82 0.985 6.07 1.10 Intr - 18659 18550 110 0 2 82 109 107 0.859 12.20 1.09 Intr - 18839 18755 85 2 1 86 81 43 0.984 2.79 1.08 Intr - 18975 18928 48 0 0 116 105 44 0.968 7.78 1.07 Intr - 19100 19005 96 2 0 105 99 -24 0.578 0.61 1.06 Intr - 19215 19177 39 0 0 123 21 55 0.621 0.72 1.05 Intr - 20149 20069 81 1 0 87 91 64 0.967 6.43 1.04 Intr - 20390 20331 60 2 0 68 94 42 0.755 1.73 1.03 Intr - 21408 21289 120 0 0 92 76 34 0.903 3.29 1.02 Intr - 29291 29236 56 0 2 77 110 43 0.254 4.10 1.01 Init - 36029 36005 25 2 1 89 81 37 0.250 2.83 1.00 Prom - 42224 42185 40 -3.16 2.00 Prom + 42940 42979 40 -3.36 2.01 Init + 48385 48717 333 0 0 100 10 450 0.926 35.70 2.02 Term + 48796 48957 162 0 0 -45 45 310 0.415 11.34 2.03 PlyA + 50498 50503 6 -0.45 3.00 Prom + 51334 51373 40 -3.16 3.01 Init + 51942 52068 127 2 1 77 66 122 0.695 9.22 3.02 Term + 60897 60907 11 1 2 103 39 1 0.067 -4.94 3.03 PlyA + 61080 61085 6 1.05 4.03 PlyA - 61301 61296 6 1.05 4.02 Term - 74941 74704 238 0 1 -6 41 267 0.130 8.24 4.01 Init - 89265 89219 47 0 2 83 82 28 0.186 1.86 4.00 Prom - 94704 94665 40 -5.76 5.00 Prom + 98238 98277 40 -1.66 5.01 Init + 100001 100275 275 1 2 48 15 806 0.131 63.74 5.02 Intr + 100927 100969 43 1 1 97 105 30 0.197 3.84 5.03 Intr + 102736 102804 69 0 0 80 53 72 0.149 2.28 5.04 Intr + 108217 108277 61 0 1 62 42 39 0.020 -4.99 5.05 Intr + 108612 108662 51 1 0 62 102 38 0.040 1.48 5.06 Intr + 119050 119214 165 2 0 65 45 137 0.542 7.03 5.07 Intr + 124645 124787 143 2 2 57 11 144 0.633 3.67 5.08 Intr + 126149 126397 249 1 0 97 67 130 0.975 9.43 5.09 Intr + 129706 129800 95 0 2 72 106 37 0.466 2.66 5.10 Intr + 136749 136789 41 0 2 55 103 42 0.174 0.27 5.11 Term + 139987 140048 62 2 2 77 49 60 0.304 -1.03 5.12 PlyA + 140955 140960 6 1.05 6.11 PlyA - 141423 141418 6 1.05 6.10 Term - 154242 153037 1206 0 0 28 42 2211 0.769 202.17 6.09 Intr - 154677 154536 142 2 1 61 99 244 0.992 23.16 6.08 Intr - 155065 155007 59 1 2 127 16 46 0.982 -1.02 6.07 Intr - 156437 156331 107 0 2 80 113 73 0.915 8.93 6.06 Intr - 156613 156520 94 1 1 92 81 85 0.991 7.74 6.05 Intr - 157051 157022 30 1 0 128 77 46 0.983 5.83 6.04 Intr - 157994 157866 129 2 0 102 76 98 0.993 10.89 6.03 Intr - 164801 164700 102 2 0 23 64 106 0.069 2.07 6.02 Intr - 169256 169123 134 0 2 97 4 132 0.017 6.06 6.01 Init - 176430 176376 55 0 1 83 98 -6 0.137 -0.65 6.00 Prom - 185045 185006 40 -2.96 7.04 PlyA - 185299 185294 6 1.05 7.03 Term - 195466 195248 219 1 0 81 48 121 0.852 4.44 7.02 Intr - 196749 196447 303 0 0 19 80 167 0.455 5.79 7.01 Init - 198628 198497 132 1 0 102 69 49 0.487 3.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 23125 23078 48 1 0 70 93 54 0.975 5.05 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:28689199_28889483|GENSCAN_predicted_peptide_1|734_aa MEMDIIQPGLAVRCCVLERRIVPSLWQLKEYWQKNSPRVPAGANRNRKTNGSIPQTATSG GCQPPGDSATGFHREGPTSSATLKDLESPCQERAVVLDSRSVEISQLKNTIKSLKQQKKQ VEHQLEEGPDNLVPWVGLSWGIGGILGACLLLCHLCLPLEKKANNKKQKAKRVLEVQLQT LNIQKEELNTDLYHMKRSLRYFEEKSKDLAVRLQHSLQRKGELESVLSDVMATQKKKANQ LSSPSKAGTEWKLEQSMREEALLKVQLTQLKESFQQVQLERDEYSEHLKGERARWQQRMR KMSQEICTLKKEKQQDMRRVEKLERSLSKLKNQMAEPLPPEPPAVPSEVELQHLRKELER VAGELQAQVKNNQRISLLNQRQEERIREQEERLRKQEERIQEQHKSLQQLAKPQSVFEEP NNENKSTLQLEQQVKELQEKLGEVKESETSTPSKKGWEAGSSLWGGEVELKSQEAQSLQQ QPDHYLGHLQQSVATYQQQEHLEAASQQNQQLTAQLSLMALPGEGHGGEHLDSEGEEAPQ PMPSVPEDPESREAMSSFMDHLEEKADLSELVKKQELRFIQYWQERCHQKIHHLLSEPGG RAKDAALGGGHHQAGAQGGDEGEAAGAAADGIAAYSNYNNGHRKFLAAAHNSADEPGPGA PAPQELGAADKHGGPPGAPRLGQQLLCAILLLGLAAKKKEINITILKELLKKFLNKKPSY GVNLLHNSFTSFEC >gi568815583f:28689199_28889483|GENSCAN_predicted_CDS_1|2205_bp atggagatggacatcatccaacctggcctggctgtgcgctgctgtgttctggaaaggcgc attgtgccctcgctgtggcagttaaaagaatattggcagaaaaacagccctagagttcca gcaggagcgaacaggaacaggaaaacaaatggcagtatccctcagacagccacttctggt ggttgccagccacctggggattcagcaacaggttttcacagggaaggccctacatcatct gctaccctgaaagatctggagagcccgtgccaagaacgagcagtagtcctggattcaagg tccgtagaaatcagtcaactgaagaacaccatcaaatctttgaaacaacagaagaaacaa gtggaacatcagctggaagaaggccccgataacctggtcccatgggtgggcctgtcctgg ggcattggtggcattctgggggcatgtctcttgctgtgccatctctgcctccccctggaa aagaaagcaaacaacaagaaacagaaagccaaaagggtgctagaggttcaactccagaca ttgaacatacagaaagaggaactaaatacggacctgtaccacatgaaacgttctctcaga tactttgaagaaaagtccaaggatctggctgtccgcctgcaacattcattgcagcgtaaa ggagagttagagagtgttctctctgatgtcatggccacacagaagaagaaggcaaaccag ttgtccagccccagtaaagcaggtacggagtggaagttagagcagtccatgcgggaggag gcactactgaaagtgcagctgacacagttgaaggagtcatttcaacaagtccaattagaa agagatgagtattctgaacatctaaaaggagagagggcccggtggcagcagaggatgaga aaaatgtcgcaggagatttgcacattaaagaaagagaagcagcaagatatgcgtcgggta gagaagctggagaggagcttgtccaaactcaaaaaccagatggctgaacccttgcccccg gagcccccagcagtgccctctgaggtggagctgcagcacctgaggaaggaactagagaga gtggcaggagagctccaggcccaggtcaaaaacaatcagcgcataagtctcctgaaccag cgacaagaagagaggattcgggagcaggaagagaggcttcggaagcaggaggagaggatt caggagcagcacaagagccttcagcagctggccaagccacagagcgtcttcgaggagccg aacaatgagaacaagagcacactgcagttggagcagcaagtaaaggagctacaggagaag cttggcgaggtgaaggagtcggaaacctccaccccatccaagaagggctgggaggcgggc agcagcctctggggaggggaggtggagctgaagagccaagaggctcagagtctgcagcag cagccagaccattacctgggtcacctgcagcagtccgtggccacctatcagcagcaggag cacctggaagctgccagccagcagaaccagcagctaacggcccagctgagcctcatggct ctccctggggaaggacacggaggagaacatctggacagtgagggggaggaggcacctcag cccatgccgagtgtcccagaggacccggagagcagggaggccatgagcagctttatggac cacctggaggagaaggcagacctgagtgagcttgtgaagaaacaagaacttcgcttcatt caatactggcaagagagatgccatcagaaaatccatcaccttttatcagaaccagggggc cgtgccaaagatgcggcactgggaggaggacaccatcaggctggagctcagggaggagat gaaggtgaagctgctggagctgcagcagatggtattgcggcttacagcaactacaacaat gggcacagaaaattcctggccgctgcccacaactctgctgatgagcccggtccaggagcc ccagctccccaggagcttggggctgcagacaagcatggtggaccaccaggagcacccagg cttgggcagcaactgctgtgtgccattcttttgctgggcttggctgccaagaagaaggag ataaacatcaccatcctcaaagagctgctcaagaaatttttaaataagaaaccaagttat ggggttaatctcctacacaattcatttacttcctttgaatgttag >gi568815583f:28689199_28889483|GENSCAN_predicted_peptide_2|164_aa MEDEQPDSLEGWAPLREGLFADPQRHRLRFLVAWNGAEGKFAVTCHDRTAQQPQRREGAR LGLEHKPEAAVSPPSWAGRLSAAGFRGARRQPAALWPPLEHCFPRLPPELDAALQELCGQ LERYLGAAAHGCGGATVRDALFAAKGRAADCESPREFRERALRA >gi568815583f:28689199_28889483|GENSCAN_predicted_CDS_2|495_bp atggaggacgagcagcccgacagcctggagggctgggcgccgctccgggagggcctcttc gccgatccccagaggcaccggttgcgcttcctggtggcttggaacggcgcggagggcaag tttgctgtgacttgtcacgaccgcaccgcgcagcagccgcagcggcgcgagggggcccgg ctggggctggagcacaagcccgaggccgccgtgtccccgcccagctgggccggccggctc tcggccgcggggttccgcggcgcgcgccggcagccagcggcgctgtggccgcctctggaa cactgcttcccacggctgccgccggagctggacgcggcgctgcaggagctttgcgggcag ctggagcgctacctgggcgcggcggcccacggctgtggcggcgccaccgtgcgcgacgct ctcttcgcggctaagggccgcgcggccgactgcgagagcccgcgcgagtttcgggagcgg gccctgcgcgcctga >gi568815583f:28689199_28889483|GENSCAN_predicted_peptide_3|45_aa MVALMKVYQEEDEAYQELVTVATTFFQYLLRPFRAMREVATLYEI >gi568815583f:28689199_28889483|GENSCAN_predicted_CDS_3|138_bp atggtagcattaatgaaagtttaccaagaggaagatgaagcctaccaggaattagttacc gtggcaaccacgttcttccagtacttattgcggccatttagggctatgcgagaagttgca actttatatgaaatttag >gi568815583f:28689199_28889483|GENSCAN_predicted_peptide_4|94_aa MVKYALVQLYTINEYSFFVFGGALKSSSGYLAKSSIVEDGVMVQITAENMDSSRQALLET RDFSITCGKADAEDPQERMHIRWVDDDKNVSKGV >gi568815583f:28689199_28889483|GENSCAN_predicted_CDS_4|285_bp atggtcaaatatgcacttgtacaactttatactatcaatgagtatagtttttttgtgttc ggtggcgctctgaaatcctcttctggataccttgccaagtccagtattgtggaagatggc gttatggtccagatcactgcagagaacatggattcctcgaggcaggcactgctagagacg agggacttcagcatcacctgtgggaaggcagacgcggaggatccccaggagcgcatgcac atccggtgggtggatgatgacaagaacgttagcaagggtgtctaa >gi568815583f:28689199_28889483|GENSCAN_predicted_peptide_5|417_aa MLGARVAAHLDALGPLVPYVPPPLLPSMFYVGLFFVNVLILYYAFLMEYIVLNVGLVFLF EDMDQALVDLGVLSDPGSGLYDADSELDVFDGLLVPQQVGKTDIMLAIEKLQAGALATDA VTAALVELEGDRRNFYGLPVLEMPSGHGDVELTTLERQLDAFSSHLQGTSFRTVLDKAVQ ADGHVKECYPSHRDPIVLLCKPEPELNAAIPSANPAKTMQGRCLEDIDVNEAKKEIRLHL AETSSLWSSQRDVTLDVPIKELTKTVEERVVNVLKSLLSNLDEVKKEREGLENDLKSVNF DMTSKFLTALAQDGMINEEALSVTELDRVYGGLTTKVQESLKKQEGLLKNIQFYNELTEI LVRFQNKCSDIVLAWKTERDELLKSRMDYIICRNAFASNKNLLGSSFGEPDMENGLN >gi568815583f:28689199_28889483|GENSCAN_predicted_CDS_5|1254_bp atgctcggcgcccgggtcgcggcccacctggacgcactgggccccctggtcccctacgtg ccgccgccgctgctgccctctatgttctacgtgggcctgttcttcgtcaatgtgctgatc ctgtactacgccttcctcatggagtacatcgtcctcaacgtgggcctcgtcttcctgttc gaggacatggaccaggcgctcgtggacctcggcgtgctctccgaccccggctcgggcctt tacgatgctgactcggagctcgacgtctttgatgggttactcgttcctcagcaagttggc aaaacagatatcatgctggcaattgaaaagctgcaggcgggtgctcttgcaactgacgca gtcactgcagcactggtggaacttgagggggatagaagaaacttctacggcctgccagtc cttgaaatgcctagtggtcatggagatgtagagctgaccacactggaacgacagcttgat gcatttagctcacatttacagggaaccagcttcagaacagttttagataaagctgtgcaa gcagatggacacgtgaaagaatgttacccgtctcatcgtgaccccatcgtgcttttgtgt aagccagagcctgagctgaatgctgccatcccttctgctaatccagcaaagaccatgcag ggcagatgtttagaagatatagatgtaaatgaggccaaaaaggaaattcgcctacattta gcagaaacctcgagtctctggtcttcacaaagggatgttactttggacgttcctatcaag gagttaactaagactgtggaagaaagagttgtaaatgtcttaaaatccttattgtcaaat cttgatgaagtaaagaaggaaagagagggtctggagaatgacttgaaatctgtgaatttt gacatgacaagcaagtttttgacagccctggctcaagatggcatgataaatgaagaagct ctttctgttactgaactagatcgagtctatggaggtcttacaactaaagtccaagaatct ctaaagaaacaggagggacttcttaaaaatattcagttttacaatgagttgactgaaatc ctggtcaggttccagaacaaatgcagcgatatagttttggcatggaagacagaaagagat gaactcttaaagagtaggatggattatattatttgtcgaaatgcctttgcatctaacaag aacctcttgggttcatcctttggagaacctgatatggaaaatggactgaactag >gi568815583f:28689199_28889483|GENSCAN_predicted_peptide_6|685_aa MGLGTVAHACNPSTLRGQGPANTVVLEIRLAPYCQPTTSSQIRQNDSTEEEAAVSCLVHV HRRSFVELQFDDTPGKNTSHRNYCDPCFPKRVLGTQYFTDYHQWNSAGVGTGATDTKKKK INHGANPETTTSGGCHSPEDKQQNRAQLKEAQDHTIRILMCQKTELETALHDSQDAARKF EEDSKDLAARLHHSWHFAGELQRALSAMSAEHERADKYIKELTKEREAMSLELFRNIITN KELKEKNAELQEKLRLVETEKSEIQLHIKELKRKLETDKIPLPQVQTNTLQEKMWRQEEE LRDQEELRDQEKLRKHEEKMWRQEQRLRDQEKELREQEQQMQEQEEQMRKQEEQMRKQEE QMRKQEEQMRKQEEQMRKQEEQMRKQEEQMGKQEEQMGEQEEQMRKQEKQMLKQKEQMRK QEEQMWKQEEQIGEQEEQMRKQEEQMWKQEEQIGEQEEQMRKQEEQMWKQEEQMGEQMRK QEEQMGEQEEQIRKQEEQMGEQEEQMRKQEEQMGEQEEQMRKQEEQMGEQEEQMRKQEEQ MGEQEEQMGEQEEQMRKQVERLQFKEERLWDEYEKMQEEEEKIRRQVEKRREKKERMGEQ EKTQEERCSEPCLPPSKYPSDMSHPGSLEPAREAGKGYSHDNRTAQIMQLPPGMKNAQER PGLGSTSCIPFFYGGDKKKIKIISI >gi568815583f:28689199_28889483|GENSCAN_predicted_CDS_6|2058_bp atggggctgggcacagtggctcacgcctgtaatcccagcactttgagaggccaaggacca gccaacaccgttgtgttagaaataaggttagctccttattgccaaccaaccacaagctcc cagattcgtcagaatgattccaccgaggaggaagctgcagtcagctgcctggtccatgtg catcggcggagctttgtggagctacaatttgacgacacccccggaaaaaacacaagccac agaaactactgtgacccgtgctttccaaagagggttttgggaactcagtattttacagac tatcatcagtggaacagtgctggtgttggtaccggagcaaccgacaccaaaaagaagaaa ataaatcatggcgctaaccctgagacaaccacttcggggggctgccactcgcctgaggat aaacaacagaaccgagctcagctgaaagaggcccaggatcataccatacgaatccttatg tgtcagaaaactgaactggagacagcgctccatgacagccaggatgctgccaggaaattt gaagaagattccaaggatctggcagcccgcctgcatcattcctggcactttgcaggagag ttacagcgggctctctctgctatgtccgcagagcacgagagggcggacaagtacatcaag gagttaacaaaggagagggaagccatgagtctggagctgttcaggaacatcataaccaat aaggagctgaaggagaaaaatgccgaactacaagaaaaacttcgactggtagaaactgaa aagtctgagatccagctccacatcaaggagctaaaaaggaaactggagacggacaaaatc ccgctgccacaggttcaaaccaacactttgcaggagaagatgtggaggcaggaggaggag ctacgggatcaggaggagctacgggatcaggagaagctacggaagcacgaggagaagatg tggagacaggagcagaggctgcgggaccaggagaaggagctgcgggagcaggagcagcag atgcaggagcaggaggagcagatgcggaagcaggaggagcagatgcggaagcaggaggag cagatgcggaagcaggaggagcagatgcggaagcaggaggagcagatgcggaagcaggag gagcagatgcggaagcaggaggagcagatggggaagcaggaggagcagatgggggagcag gaggagcagatgcggaagcaggagaagcagatgctgaagcagaaggagcagatgcggaag caggaggagcagatgtggaagcaggaggagcagataggggagcaggaggagcagatgcgg aagcaggaggagcagatgtggaagcaggaggagcagataggggagcaggaggagcagatg cgaaagcaggaggagcagatgtggaagcaggaggagcagatgggggagcagatgaggaag caggaggagcagatgggggagcaggaggagcagatccggaagcaggaggagcagatgggg gagcaggaggagcagatgcggaagcaggaggagcagatgggggagcaggaggagcagatg cggaagcaggaggagcagatgggggagcaggaggagcagatgaggaagcaggaggagcag atgggggagcaggaggagcagatgggggagcaggaggagcagatgcggaagcaggtggag aggctgcaattcaaggaggagaggctgtgggatgagtatgagaagatgcaggaggaggag gagaagatccggaggcaggtggagaagaggcgggagaagaaggagaggatgggagagcag gagaagacgcaggaggagcggtgctcagagccctgcctccctccctccaaatatccttct gatatgagccaccctggcagcctggagcctgcacgagaggccgggaagggttattcccat gacaaccgcactgcacagatcatgcagctgccccctggaatgaagaacgcccaggagcgc ccaggcttaggcagcacctcctgcatcccattcttctacggaggagacaagaaaaagatc aagatcatcagtatctaa >gi568815583f:28689199_28889483|GENSCAN_predicted_peptide_7|217_aa MEPTPRAPLTVLLEPTEGRLPAHSHKLQLYPLTPWTLPGAPAFQIARREEREDKDASVPG VREVLRPLAPALGQSAGQEGALGTPGPPGPVSGIGAGTQALEGAAPRTPRALGSTQIPGR APAVLAGELQGSGPAGRASEREAPQMSTPSLSTFHLHRHSAYLANTKFIQPPVAQPFTNA NSGKQQLRRSDSEKINAVIFLPYCYVVTSHRKPDTCA >gi568815583f:28689199_28889483|GENSCAN_predicted_CDS_7|654_bp atggagcccactccaagggcccccctgacagtgctgctagagcccaccgagggaaggctg ccggcacacagccacaagctccagctttacccactcacgccctggacattgcctggagcc ccagcgttccagatcgcgcgccgtgaagagagggaagacaaagatgcgtccgtgccgggc gtgcgcgaggtcctgcgccccctggccccggcgctgggacagtctgcagggcaggagggt gcactcgggaccccgggtccccccgggccggtctccgggatcggggctgggacccaggcg ctggaaggcgcggcgccccgaacgccccgggccctggggagcacacagattcccgggcgt gccccggccgtgctggctggggagctgcaggggtctggtcccgcaggtagagccagtgaa cgcgaggctccgcagatgagcacaccttcactgtctactttccacctgcatagacattct gcttacctggctaacacaaagttcatccagcctcctgtggcccagcctttcaccaatgcc aatagtgggaagcaacaattaagaagaagtgattctgagaagataaatgcagtcatcttc ttgccttactgttatgtggtcacatcccataggaaaccagacacctgtgcctga