GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:22:26 Sequence gi568815576r:40670837_40919248 : 248412 bp : 43.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 35 30 6 -0.45 1.02 Term - 3007 2648 360 1 0 -7 48 516 0.513 32.64 1.01 Init - 3304 3134 171 1 0 91 34 70 0.480 1.35 1.00 Prom - 4971 4932 40 -10.74 2.00 Prom + 5429 5468 40 -9.06 2.01 Init + 8625 8898 274 2 1 98 109 178 0.899 17.55 2.02 Term + 10113 11092 980 1 2 124 43 1675 0.999 158.83 2.03 PlyA + 11223 11228 6 1.05 3.03 PlyA - 11664 11659 6 1.05 3.02 Term - 39058 39001 58 0 1 78 41 62 0.552 -2.34 3.01 Init - 40200 40160 41 0 2 69 117 74 0.872 6.15 3.00 Prom - 77335 77296 40 -1.16 4.07 PlyA - 78925 78920 6 1.05 4.06 Term - 100145 99998 148 1 1 103 53 142 0.883 9.57 4.05 Intr - 103183 103101 83 1 2 88 105 19 0.987 2.04 4.04 Intr - 106299 106204 96 0 0 106 96 76 0.978 10.41 4.03 Intr - 106537 106392 146 2 2 59 84 86 0.997 5.30 4.02 Intr - 108289 108173 117 2 0 76 111 40 0.949 5.54 4.01 Init - 114275 114206 70 2 1 110 77 52 0.993 7.51 4.00 Prom - 116725 116686 40 -8.36 5.19 PlyA - 117099 117094 6 1.05 5.18 Term - 118327 118074 254 2 2 38 54 162 0.422 3.60 5.17 Intr - 121840 121689 152 0 2 96 92 11 0.397 2.11 5.16 Intr - 123744 123678 67 1 1 76 74 55 0.496 0.86 5.15 Intr - 128247 128187 61 0 1 83 110 12 0.040 1.21 5.14 Intr - 148441 148359 83 2 2 -5 69 122 0.016 0.36 5.13 Intr - 155830 155713 118 1 1 95 38 79 0.215 3.64 5.12 Intr - 156393 156260 134 1 2 87 110 48 0.722 7.26 5.11 Intr - 158838 158790 49 0 1 63 76 15 0.717 -3.95 5.10 Intr - 160120 160004 117 1 0 68 61 154 0.842 11.36 5.09 Intr - 161835 161733 103 2 1 47 75 81 0.996 2.88 5.08 Intr - 164834 164724 111 1 0 63 99 119 0.998 9.99 5.07 Intr - 165051 164967 85 1 1 91 94 80 0.999 7.78 5.06 Intr - 169856 169790 67 2 1 92 101 79 0.992 8.08 5.05 Intr - 174103 174003 101 2 2 44 107 118 0.907 9.13 5.04 Intr - 177533 177458 76 2 1 76 66 101 0.886 5.79 5.03 Intr - 180044 179987 58 1 1 58 115 43 0.083 2.89 5.02 Intr - 185793 185595 199 1 1 44 78 187 0.610 11.71 5.01 Init - 191158 190291 868 1 1 75 83 456 0.556 38.53 5.00 Prom - 192723 192684 40 -2.46 6.00 Prom + 202160 202199 40 -3.16 6.01 Init + 210400 210448 49 0 1 77 58 40 0.789 -0.99 6.02 Intr + 210934 211341 408 2 0 89 87 176 0.189 11.84 6.03 Intr + 215477 215679 203 0 2 53 80 216 0.152 16.30 6.04 Intr + 236751 236813 63 2 0 122 100 61 0.783 9.61 6.05 Intr + 238286 238399 114 1 0 78 102 54 0.977 6.44 6.06 Intr + 243403 243488 86 0 2 113 91 34 0.430 4.82 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 180043 179987 57 1 0 75 115 45 0.907 7.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:40670837_40919248|GENSCAN_predicted_peptide_1|176_aa MEKAGTYLDGGAKRVTISAPSDAPMFVMGVNHEKYGNSLKIISNASCTTSCLTPLAKNII PAKPVGKVIPELNGKLTGVAFHVPTTNMLVMDLTCHLEKPAKYSDIKRMVKQALREILKG ILGCTEHKVVSSDFDSDTHSSIFNAGAGIALNNDFVKLISWYDRVVNLMVHMASKE >gi568815576r:40670837_40919248|GENSCAN_predicted_CDS_1|531_bp atggagaaggctgggacttacttagatgggggagccaaaagagtcaccatctctgcccct tctgatgcccccatgtttgtgatgggtgtgaatcatgagaagtatggaaacagcctcaag attatcagcaatgcctcctgcaccaccagttgcttaacccccctggccaagaacatcatt cctgccaagcctgtgggcaaagtcatccccgagctgaatgggaagctcactggcgtggcc ttccacgtccccaccaccaacatgttggtcatggacctgacctgccatctggagaaacct gccaaatacagtgacatcaagaggatggtgaagcaggcattgagggagatcctcaagggc atcctgggctgtactgagcacaaggtcgtctcctctgactttgacagtgacacccattct tccatcttcaatgctggggctggcattgccctcaacaatgactttgtcaagctcatttcc tggtatgacagggtggtgaacctcatggtccacatggcctccaaggagtaa >gi568815576r:40670837_40919248|GENSCAN_predicted_peptide_2|417_aa MKKGVGRAVGLGGGSGCQATEEDPLPNCGACAPGQGGRRWRLPQPAWVEGSSARLWEQAT GTGWMDLEASLLPTGPNASNTSDGPDNLTSAGSPPRTGSISYINIIMPSVFGTICLLGII GNSTVIFAVVKKSKLHWCNNVPDIFIINLSVVDLLFLLGMPFMIHQLMGNGVWHFGETMC TLITAMDANSQFTSTYILTAMAIDRYLATVHPISSTKFRKPSVATLVICLLWALSFISIT PVWLYARLIPFPGGAVGCGIRLPNPDTDLYWFTLYQFFLAFALPFVVITAAYVRILQRMT SSVAPASQRSIRLRTKRVTRTAIAICLVFFVCWAPYYVLQLTQLSISRPTLTFVYLYNAA ISLGYANSCLNPFVYIVLCETFRKRLVLSVKPAAQGQLRAVSNAQTADEERTESKGT >gi568815576r:40670837_40919248|GENSCAN_predicted_CDS_2|1254_bp atgaagaagggagtggggagggcagttgggcttggaggcggcagcggctgccaggctacg gaggaagacccccttcccaactgcggggcttgcgctccgggacaaggtggcaggcgctgg aggctgccgcagcctgcgtgggtggaggggagctcagctcggttgtgggagcaggcgacc ggcactggctggatggacctggaagcctcgctgctgcccactggtcccaacgccagcaac acctctgatggccccgataacctcacttcggcaggatcacctcctcgcacggggagcatc tcctacatcaacatcatcatgccttcggtgttcggcaccatctgcctcctgggcatcatc gggaactccacggtcatcttcgcggtcgtgaagaagtccaagctgcactggtgcaacaac gtccccgacatcttcatcatcaacctctcggtagtagatctcctctttctcctgggcatg cccttcatgatccaccagctcatgggcaatggggtgtggcactttggggagaccatgtgc accctcatcacggccatggatgccaatagtcagttcaccagcacctacatcctgaccgcc atggccattgaccgctacctggccactgtccaccccatctcttccacgaagttccggaag ccctctgtggccaccctggtgatctgcctcctgtgggccctctccttcatcagcatcacc cctgtgtggctgtatgccagactcatccccttcccaggaggtgcagtgggctgcggcata cgcctgcccaacccagacactgacctctactggttcaccctgtaccagtttttcctggcc tttgccctgccttttgtggtcatcacagccgcatacgtgaggatcctgcagcgcatgacg tcctcagtggcccccgcctcccagcgcagcatccggctgcggacaaagagggtgacccgc acagccatcgccatctgtctggtcttctttgtgtgctgggcaccctactatgtgctacag ctgacccagttgtccatcagccgcccgaccctcacctttgtctacttatacaatgcggcc atcagcttgggctatgccaacagctgcctcaacccctttgtgtacatcgtgctctgtgag acgttccgcaaacgcttggtcctgtcggtgaagcctgcagcccaggggcagcttcgcgct gtcagcaacgctcagacggctgacgaggagaggacagaaagcaaaggcacctga >gi568815576r:40670837_40919248|GENSCAN_predicted_peptide_3|32_aa MWAFWSALLTLPARPCAYDLAVFKVPTSPDIL >gi568815576r:40670837_40919248|GENSCAN_predicted_CDS_3|99_bp atgtgggccttttggagtgccctgctgacactccctgcccggccctgtgcttatgacttg gcagtgtttaaggttcctaccagccctgacatcctgtga >gi568815576r:40670837_40919248|GENSCAN_predicted_peptide_4|219_aa MPGLTEFDQAKNNSQIGQPSEPEGVVNVLLTTPLWVVNTRLKLQGAKFRNEDIVPTNYKG IIDAFHQIIRDEGISALWNGTFPSLLLVFNPAIQFMFYEGLKRQLLKKRMKLSSLDVFII GAVAKAIATTVTYPLQTVQSILRFGRHRLNPENRTLGSLRNILYLLHQRVRRFGIMGLYK GLEAKLLQTVLTAALMFLVYEKLTAATFTVMGLKRAHQH >gi568815576r:40670837_40919248|GENSCAN_predicted_CDS_4|660_bp atgcctggcctaacagagtttgatcaagcaaagaacaattcgcaaattgggcagccctca gaaccagaaggagtggttaatgtgttgctaacaactccactctgggtggtaaacaccaga ctgaagcttcaaggagcaaaatttaggaatgaagacattgtaccaacaaactacaaaggt atcattgatgcttttcatcagatcattcgcgatgaaggaatctcggctttatggaatggc acatttccctcattgctgttggtcttcaatcctgccatccagttcatgttttatgaaggt ttaaaacggcagcttttaaagaaacggatgaagctttcttccttggatgtgttcatcatt ggtgcagtagccaaagcgattgccaccacggtgacctatcccctgcagacggtacagtca attctgaggtttgggcgtcatagactaaacccagaaaacagaacattgggaagtcttcgg aatattctctatcttcttcaccaacgagtaagacgttttggaataatgggactctacaaa ggccttgaagccaaactgctgcagacagtcctcactgctgctctcatgttccttgtttat gagaaactgacagctgccaccttcacagttatggggctgaagcgtgcacaccaacactga >gi568815576r:40670837_40919248|GENSCAN_predicted_peptide_5|900_aa MVDYYEVLGLQRYASPEDIKKAYHKVALKWHPDKNPENKEEAERKFKEVAEAYEVLSNDE KRDIYDKYGTEGLNGGGSHFDDECEYGFTFHKPDDVFKEIFHERDPFSFHFFEDSLEDLL NRPGSSYGNRNRDAGYFFSTASEYPIFEKFSSYDTGYTSQGSLGHEGLTSFSSLAFDNSG MDNYISVTTSDKIVNGRNINTKKIIESDQEREAEDNGELTFFLVNSVANEEGFAKECSWR TQSFNNYSPNSHSSKHVSQYTFVDNDEGGISWVTSNRDPPIFSAGVKEGAAPFCAVTPSQ RLGLEPGRSPPSFAHHLPTMDPRKVNELRAFVKMCKQDPSVLHTEEMRFLREWVESMGGK VPPATQKAKSEENTKEEKPDSKKVEEDLKADEPSSEESDLGLIVIFSTLAEIDKEGVIEP DTDAPQEMGDENAEITEEMMDQANDKKVAAIEALNDGELQKAIDLFTDAIKLNPRLAILY AKRASVFVKLQKPNAAIRDCDRAIEINPDSAQPYKWRGKAHRLLGHWEEAAHDLALACKL DYDEDASAMLKEVQPRAQKIAEHRRKYERKREEREIKERIERVKKAREEHERAQREEEAR RQSGAQYGSFPGGFPGGMPGNFPGGMPGMGGGMPGMAGMPGLNEILSDPEVLAAMQDPEV MVAFQDVAQNPANMSKYQSNPKVMNLISKLSAKFGVWLGLRGAAPMASVLSYESLVHAVA GAVGSVTAMTVFFPLDTARLRLQVDEKRKSKTTHMVLLEIIKEEGLLAPYRGWFPVISSL CCSNFVYFYTFNSLKALWVKGQHSTTGKDLVVGFVAANFPEQVVDNLPADISSGIYYGWA SAGSRDGQKMVVSIGWNPYYENTKKSMETHTMHTFKEHFYGEILSVAVVGYLRPEKNFDS >gi568815576r:40670837_40919248|GENSCAN_predicted_CDS_5|2703_bp atggtggattactatgaagttctaggactgcaaagatatgcttcacctgaggacattaaa aaagcttatcataaagtggcacttaaatggcaccctgataaaaatccagaaaataaagaa gaagcagagagaaaattcaaagaagtagctgaggcatacgaggtattatcaaatgatgag aaacgggacatttatgataaatatggcacagaaggattaaacggaggtggaagtcatttt gatgatgaatgtgagtacggcttcacattccataagccagatgatgtttttaaagaaatt tttcatgaaagggatccattttcttttcacttctttgaagactcgcttgaggacctgtta aatcgtccaggaagctcctatggaaacagaaacagagatgcaggatactttttctccact gccagtgaatatccaatttttgagaaattttcttcatatgatacaggatatacatcacag ggttcattggggcatgaaggccttacttctttctcttccctggcttttgataatagtggg atggacaactacatatctgttacaacttcagacaaaatcgttaatggcagaaatattaat acaaagaaaattattgaaagtgatcaagaaagagaagctgaagataatggagagttgaca ttttttcttgtaaatagtgtggccaatgaagagggctttgcaaaagaatgcagctggaga acacagtcattcaacaactattcaccaaattctcacagctccaaacatgtatctcaatat actttcgtggacaatgatgagggaggtatatcttgggttaccagcaacagagatccccct attttctcagcaggagtcaaagagggtgccgcccccttctgcgcggtcacgccgagccag cgcctgggcctggaaccgggccgtagcccccccagtttcgcccaccacctccctaccatg gacccccgcaaagtgaacgagcttcgggcctttgtgaaaatgtgtaagcaggatccgagc gttctgcacaccgaggaaatgcgcttcctgagggagtgggtggagagcatgggtggtaaa gtaccacctgctactcagaaagctaaatcagaagaaaataccaaggaagaaaaacctgat agtaagaaggtggaggaagacttaaaggcagacgaaccatcaagtgaggaaagtgatcta ggtcttattgtcattttctcgactttggcagaaattgataaagaaggtgtgattgaacca gacactgatgctcctcaagaaatgggagatgaaaatgcggagataacggaggagatgatg gatcaggcaaatgataaaaaagtggctgctattgaagccctaaatgatggtgaactccag aaagccattgacttattcacagatgccatcaagctgaatcctcgcttggccattttgtat gccaagagggccagtgtcttcgtcaaattacagaagccaaatgctgccatccgagactgt gacagagccattgaaataaatcctgattcagctcagccttacaagtggcgggggaaagca cacagacttctaggccactgggaagaagcagcccatgatcttgcccttgcctgtaaattg gattatgatgaagatgctagtgcaatgctgaaagaagttcaacctagggcacagaaaatt gcagaacatcggagaaagtatgagcgaaaacgtgaagagcgagagatcaaagaaagaata gaacgagttaagaaggctcgagaagagcatgagagagcccagagggaggaagaagccaga cgacagtcaggagctcagtatggctcttttccaggtggctttcctgggggaatgcctggt aattttcccggaggaatgcctggaatgggagggggcatgcctggaatggctggaatgcct ggactcaatgaaattcttagtgatccagaggttcttgcagccatgcaggatccagaagtt atggtggctttccaggatgtggctcagaacccagcaaatatgtcaaaataccagagcaac ccaaaggttatgaatctcatcagtaaattgtcagccaaatttggagtgtggctgggtctt cgaggagccgcaccaatggcttccgtgctgtcctacgaaagcctggtccacgccgtggcc ggagccgtgggaagcgtgacagcaatgacagtgttttttcccctggatacagctagactt cgacttcaggttgatgagaaaagaaaatccaaaactacacacatggtgctcctggagatc attaaagaagaaggactcctggcaccatatcgagggtggtttccagtgatttccagtctc tgctgctccaattttgtctatttctacacttttaatagcctcaaagcactctgggtcaaa ggtcaacattctaccactggaaaagatctggtagttgggtttgttgcagctaattttcct gagcaagtggtagataatctgccagccgatatatccagtggcatttattatggttgggcc agtgctggaagtagagatggtcagaagatggtggtgagcataggatggaacccatactat gagaatacgaagaagtccatggaaacacataccatgcataccttcaaagagcacttctat ggggaaatcctgagtgtggccgttgttggctacctcagaccagaaaagaactttgattca tga >gi568815576r:40670837_40919248|GENSCAN_predicted_peptide_6|308_aa MRFYHVGRAGLELLTSGEVTPGLSQVEYALRRHKLMSLIQKEAQGQSGTDQTVVVLSNPT YYMSNDIPYTFHQDNNFLYLCGFQEPDSILVLQSLPGKQLPSHKAILFVPRRDPSRELWD GPRSGTDGAIALTGVDEAYTLEEFQHLLPKMKAETNMVWYDWMRPSHAQLHSDYMQPLTE AKAKSKNKVRGVQQLIQRLRLIKSPAEIERMQIAGKLTSQAFIETMFTSKAPVEEAFLYA KFEFECRARGADILAYPPVVAGGNRSNTLHYVKNNQLIKDGEMVLLDGGCESSCYVSDIT RTWPVNGS >gi568815576r:40670837_40919248|GENSCAN_predicted_CDS_6|924_bp atgaggttttaccatgttggccgggctggtcttgaactcctgacctcaggggaggtaact ccaggactatctcaggtggaatatgcacttcgcagacacaaactaatgtctctgatccag aaggaagctcaagggcagagtgggacagaccagacagtggttgtgctctccaaccctaca tactacatgagcaacgatattccctatactttccaccaagacaacaatttcctgtaccta tgtggattccaagagcctgatagcattcttgtccttcagagcctccctggcaaacaatta ccatcacacaaagccatactttttgtgcctcggcgagatcccagtcgagaactttgggat ggtccgcgatctggcactgatggagcaatagctctaactggagtagacgaagcctatacg ctagaagaatttcaacatcttctaccaaaaatgaaagctgagacgaacatggtttggtat gactggatgaggccctcacatgcacagcttcactctgactatatgcagcccctgactgag gccaaagccaagagcaagaacaaggttcggggtgttcagcagctgatacagcgcctccgg ctgatcaagtctcctgcagaaattgaacgaatgcagattgctgggaagctgacatcacag gctttcatagaaaccatgttcaccagtaaagcccctgtggaagaagcctttctttatgct aagtttgaatttgaatgccgggctcgtggcgcagacattttagcctatccacctgtggtg gctggtggtaatcggtcaaacactttgcactatgtgaaaaataatcaactcatcaaggat ggggaaatggtgcttctggatggaggttgtgagtcttcctgctatgtgagtgacatcaca cgtacgtggccagtcaatggcagn