GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:07:15 Sequence gi568815576r:40761068_40961994 : 200927 bp : 41.72% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 9584 9934 351 1 0 91 48 265 0.952 18.60 1.02 PlyA + 10814 10819 6 1.05 2.07 PlyA - 11209 11204 6 1.05 2.06 Term - 12633 12432 202 2 1 7 38 160 0.176 -1.22 2.05 Intr - 12952 12870 83 1 2 88 105 23 0.683 1.52 2.04 Intr - 16068 15973 96 0 0 106 96 67 0.885 8.59 2.03 Intr - 16306 16161 146 2 2 59 84 106 0.999 6.38 2.02 Intr - 18058 17942 117 2 0 76 111 50 0.954 5.62 2.01 Init - 24044 23975 70 2 1 110 77 71 0.985 9.46 2.00 Prom - 26494 26455 40 -9.55 3.08 PlyA - 26868 26863 6 1.05 3.07 Term - 28224 27843 382 2 1 42 54 322 0.722 17.53 3.06 Intr - 28422 28229 194 0 2 16 -3 209 0.465 1.97 3.05 Intr - 37199 37100 100 1 1 72 45 101 0.047 3.39 3.04 Intr - 56297 56136 162 1 0 -24 100 145 0.635 2.67 3.03 Intr - 57992 57787 206 2 2 10 61 111 0.265 -2.32 3.02 Intr - 58260 58128 133 2 1 -47 69 208 0.424 5.13 3.01 Init - 58526 58468 59 2 2 69 49 101 0.713 5.14 3.00 Prom - 61935 61896 40 -6.75 4.12 PlyA - 62085 62080 6 1.05 4.11 Term - 65599 65471 129 1 0 95 35 81 0.927 0.70 4.10 Intr - 66162 66029 134 1 2 87 110 96 0.967 11.14 4.09 Intr - 68607 68559 49 0 1 63 76 34 0.961 -2.97 4.08 Intr - 69889 69773 117 1 0 68 61 199 0.946 14.94 4.07 Intr - 71604 71502 103 2 1 47 75 140 0.999 7.86 4.06 Intr - 74603 74493 111 1 0 63 99 165 0.999 13.68 4.05 Intr - 74820 74736 85 1 1 91 94 80 0.999 6.86 4.04 Intr - 79625 79559 67 2 1 92 101 84 0.972 7.66 4.03 Intr - 83872 83772 101 2 2 44 107 156 0.935 12.01 4.02 Intr - 87302 87227 76 2 1 76 66 138 0.953 8.57 4.01 Init - 89812 89756 57 1 0 75 115 79 0.999 10.66 4.00 Prom - 92482 92443 40 -4.65 5.00 Prom + 94506 94545 40 -7.45 5.01 Init + 94750 94816 67 0 1 86 60 34 0.603 1.69 5.02 Intr + 95384 95474 91 0 1 98 54 94 0.905 5.13 5.03 Term + 95984 96479 496 2 1 26 42 260 0.977 8.25 5.04 PlyA + 96936 96941 6 1.05 6.02 PlyA - 97329 97324 6 1.05 6.01 Sngl - 100927 99998 930 1 0 75 32 685 0.982 57.78 6.00 Prom - 102492 102453 40 -3.65 7.00 Prom + 111929 111968 40 -4.35 7.01 Init + 120169 120217 49 0 1 77 58 40 0.679 -0.91 7.02 Intr + 120703 121110 408 2 0 89 87 279 0.258 21.21 7.03 Intr + 125246 125448 203 0 2 53 80 314 0.252 25.18 7.04 Intr + 146520 146582 63 2 0 122 100 72 0.761 9.80 7.05 Intr + 148055 148168 114 1 0 78 102 63 0.986 6.42 7.06 Intr + 153172 153257 86 0 2 113 91 71 0.987 7.60 7.07 Intr + 161266 161446 181 1 1 96 75 96 0.992 8.05 7.08 Intr + 163295 163415 121 1 1 99 107 42 0.972 6.35 7.09 Term + 165202 165368 167 2 2 95 43 236 0.984 16.90 7.10 PlyA + 166416 166421 6 1.05 8.00 Prom + 183941 183980 40 -3.65 8.01 Init + 190332 190409 78 2 0 93 53 139 0.688 12.01 8.02 Intr + 192488 192566 79 1 1 99 78 64 0.470 4.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:40761068_40961994|GENSCAN_predicted_peptide_1|116_aa MTIKVTTGPVGHTPCAPLDAFQANEGNICGGRSQSQGRITSLHSGGNLPLEHLRNFSWEG VSVLVCTLQPHNCEGGSCQFLINKEHESSSEDCLQQFGFKAFVESHYSKTSKGKEV >gi568815576r:40761068_40961994|GENSCAN_predicted_CDS_1|351_bp atgacaattaaggtgacaacaggtccagttggccatactccctgtgcacccttggatgct tttcaagccaatgagggtaacatttgtggtggcaggagccagagtcaagggagaatcact tctcttcactcaggaggaaacctccctcttgagcatcttcggaatttttcatgggaaggc gtctcagtgttggtgtgcacgcttcagccccataactgtgaaggtggcagctgtcagttt ctcataaacaaggaacatgagagcagcagtgaggactgtctgcagcagtttggcttcaag gcctttgtagagtcccattattccaaaacgtctaaagggaaagaggtttga >gi568815576r:40761068_40961994|GENSCAN_predicted_peptide_2|237_aa MPGLTEFDQAKNNSQIGQPSEPEGVVNVLLTTPLWVVNTRLKLQGAKFRNEDIVPTNYKG IIDAFHQIIRDEGISALWNGTFPSLLLVFNPAIQFMFYEGLKRQLLKKRMKLSSLDVFII GAVAKAIATTVTYPLQTVQSILRFGRHRLNPENRTLGSLRNILYLLHQRVRTPSEREPFY TSVTLSVLSSVAYPMMPLSNITSAYRIGVLEKGGGSGKSDNSDDDDDNDDGGVGKSS >gi568815576r:40761068_40961994|GENSCAN_predicted_CDS_2|714_bp atgcctggcctaacagagtttgatcaagcaaagaacaattcgcaaattgggcagccctca gaaccagaaggagtggttaatgtgttgctaacaactccactctgggtggtaaacaccaga ctgaagcttcaaggagcaaaatttaggaatgaagacattgtaccaacaaactacaaaggt atcattgatgcttttcatcagatcattcgcgatgaaggaatctcggctttatggaatggc acatttccctcattgctgttggtcttcaatcctgccatccagttcatgttttatgaaggt ttaaaacggcagcttttaaagaaacggatgaagctttcttccttggatgtgttcatcatt ggtgcagtagccaaagcgattgccaccacggtgacctatcccctgcagacggtacagtca attctgaggtttgggcgtcatagactaaacccagaaaacagaacattgggaagtcttcgg aatattctctatcttcttcaccaacgagtaagaactccttcagaaagagaacccttctat acttcagtcaccctgagtgtgctttcatcagtggcatatcctatgatgcctcttagcaat ataacatctgcctatagaataggagtactggagaagggagggggcagtggaaaaagtgat aatagtgatgatgatgatgataatgatgatggtggtgttggtaagagtagctaa >gi568815576r:40761068_40961994|GENSCAN_predicted_peptide_3|411_aa MEELVRARERQRTSEEPGAGSGAPFLTPLAAASVGKVWLGLRGAAPMASVLSYESLVHAV AGAVRGMGHWRKVVGGIRGKGRDSATSWSPRDCGVESRVHTCPLRVRIFAVGSLRSGYHR TLTHFALLARRRTGRENKIAPAKETNEGWISTAGGKLRTKSESHVKKVFLGKEVDQLGQM LLMGQVRAFPELFAQLLLTTDDKTGKDQDHINSPFEFPEQVGKEGKKENENGWETQLLSR WHQGSSVACHRHRRAPGAPPRQTELTPLPTPWLVRGPGTAWTVSLTQGPGCSPCLEQTAL RSICHTPAGTKWYGASAVATSSWASPTANFPEQVVDNLPADISSGIYYGWASAGSRDGQK MVVSIGWNPYYENTKKSMETHTMHTFKEHFYGEILSVAVVGYLRPEKNFDS >gi568815576r:40761068_40961994|GENSCAN_predicted_CDS_3|1236_bp atggaggaactggtcagggcccgggaacgccagagaacgtcggaggaaccgggagccggc tccggtgctcctttcctaactccactggctgcggcatctgtgggaaaagtgtggctgggt cttcgaggagccgcaccaatggcttccgtgctgtcctacgaaagcctggtccacgccgtg gccggagccgtgagaggaatgggccattggcggaaagttgttggaggaattcggggtaag ggccgagactccgccacctcctggtctcctagagactgcggggtagagagccgtgttcac acgtgcccgctcagggtccgcatctttgccgtgggaagtttaaggtcaggctaccaccgc acactcacacactttgctttactggcccggaggagaacaggccgagaaaataagattgca ccagcaaaggagacaaacgaaggatggatcagtactgcaggagggaagctaagaactaag agtgaaagccacgtgaagaaagtgtttctggggaaggaagtagaccagctgggtcagatg ctgctgatgggtcaagtaagagcttttccagagctttttgcccagttgttgctgacaact gatgacaagacaggcaaagatcaagaccatatcaacagccccttcgaatttccagaacag gtagggaaagaaggaaagaaagaaaatgaaaatggttgggagacccagctcctgagtcgc tggcatcagggatcctccgtggcgtgtcatcggcatcgtcgggcccctggtgcacccccg cggcagacagagctcacgccccttcccaccccgtggctggtccggggtccgggcaccgcg tggacggtgtctctgacgcagggaccgggctgcagcccatgcctcgagcagactgcatta cgaagcatctgtcatactcctgccgggaccaagtggtatggggcttcggccgtggctaca agcagctgggcatcccccacagctaattttcctgagcaagtggtagataatctgccagcc gatatatccagtggcatttattatggttgggccagtgctggaagtagagatggtcagaag atggtggtgagcataggatggaacccatactatgagaatacgaagaagtccatggaaaca cataccatgcataccttcaaagagcacttctatggggaaatcctgagtgtggccgttgtt ggctacctcagaccagaaaagaactttgattcatga >gi568815576r:40761068_40961994|GENSCAN_predicted_peptide_4|342_aa MGGKVPPATQKAKSEENTKEEKPDSKKVEEDLKADEPSSEESDLGLIVIFSTLAEIDKEG VIEPDTDAPQEMGDENAEITEEMMDQANDKKVAAIEALNDGELQKAIDLFTDAIKLNPRL AILYAKRASVFVKLQKPNAAIRDCDRAIEINPDSAQPYKWRGKAHRLLGHWEEAAHDLAL ACKLDYDEDASAMLKEVQPRAQKIAEHRRKYERKREEREIKERIERVKKAREEHERAQRE EEARRQSGAQYGSFPGGFPGGMPGNFPGGMPGMGGGMPGMAGMPGLNEILSDPEVLAAMQ DPEVMVAFQDVAQNPANMSKYQSNPKVMNLISKLSAKFGGQA >gi568815576r:40761068_40961994|GENSCAN_predicted_CDS_4|1029_bp atgggtggtaaagtaccacctgctactcagaaagctaaatcagaagaaaataccaaggaa gaaaaacctgatagtaagaaggtggaggaagacttaaaggcagacgaaccatcaagtgag gaaagtgatctaggtcttattgtcattttctcgactttggcagaaattgataaagaaggt gtgattgaaccagacactgatgctcctcaagaaatgggagatgaaaatgcggagataacg gaggagatgatggatcaggcaaatgataaaaaagtggctgctattgaagccctaaatgat ggtgaactccagaaagccattgacttattcacagatgccatcaagctgaatcctcgcttg gccattttgtatgccaagagggccagtgtcttcgtcaaattacagaagccaaatgctgcc atccgagactgtgacagagccattgaaataaatcctgattcagctcagccttacaagtgg cgggggaaagcacacagacttctaggccactgggaagaagcagcccatgatcttgccctt gcctgtaaattggattatgatgaagatgctagtgcaatgctgaaagaagttcaacctagg gcacagaaaattgcagaacatcggagaaagtatgagcgaaaacgtgaagagcgagagatc aaagaaagaatagaacgagttaagaaggctcgagaagagcatgagagagcccagagggag gaagaagccagacgacagtcaggagctcagtatggctcttttccaggtggctttcctggg ggaatgcctggtaattttcccggaggaatgcctggaatgggagggggcatgcctggaatg gctggaatgcctggactcaatgaaattcttagtgatccagaggttcttgcagccatgcag gatccagaagttatggtggctttccaggatgtggctcagaacccagcaaatatgtcaaaa taccagagcaacccaaaggttatgaatctcatcagtaaattgtcagccaaatttggaggt caagcgtaa >gi568815576r:40761068_40961994|GENSCAN_predicted_peptide_5|217_aa MPGLKKLFFSDVFGVITLPSNGGSAFPRCAERSDPAYTFSQRPEARSLCGGPWKAREPPP PEGEVRLARFPGDAALQAVAFPVVTLFLFPTRELGRNALAALSPQAGSRCSKRPRPLRLD SSPTVSSHGVPWRDPSWSQAIPGSADPSPGGASAPQPSVLRGFLLYRLLLERGGPSVFST VEVFCEAEMTAERKVCHPPEGLSSKLLGLRSCGNYHT >gi568815576r:40761068_40961994|GENSCAN_predicted_CDS_5|654_bp atgccaggccttaagaaattatttttctcagacgtgtttggagtgataacactgccaagt aacggaggaagcgcatttcctcggtgtgcagaacgctcggatcctgcttacacattttca caaaggcccgaagctcgttcactttgcgggggtccatggaaggcccgcgagccgcctccg ccggaaggggaggtgcggctggcccggtttcctggcgacgcggccctgcaggcggttgcg ttccccgtcgttaccctctttctcttcccgacgcgtgagttaggccgtaatgccttggct gctctcagcccccaagctggttcccgctgtagcaaacgtccgcggcctctcaggttagac tcttctcccacggtctcctcccatggtgtcccctggagggacccaagttggtctcaggcg atccctggttcggctgacccttctcctggtggggcctccgctccccaaccctctgtgctc cgcggatttcttctataccggctgctcctggaaaggggtgggcccagtgtgttctctacc gttgaggtattttgtgaagccgaaatgactgcagagagaaaagtgtgccatcccccagag gggctgtcttccaagttactcggactccgaagttgtggcaattatcacacttaa >gi568815576r:40761068_40961994|GENSCAN_predicted_peptide_6|309_aa MVDYYEVLGLQRYASPEDIKKAYHKVALKWHPDKNPENKEEAERKFKEVAEAYEVLSNDE KRDIYDKYGTEGLNGGGSHFDDECEYGFTFHKPDDVFKEIFHERDPFSFHFFEDSLEDLL NRPGSSYGNRNRDAGYFFSTASEYPIFEKFSSYDTGYTSQGSLGHEGLTSFSSLAFDNSG MDNYISVTTSDKIVNGRNINTKKIIESDQEREAEDNGELTFFLVNSVANEEGFAKECSWR TQSFNNYSPNSHSSKHVSQYTFVDNDEGGISWVTSNRDPPIFSAGVKEGGKRKKKKRKEV QKKSTKRNC >gi568815576r:40761068_40961994|GENSCAN_predicted_CDS_6|930_bp atggtggattactatgaagttctaggactgcaaagatatgcttcacctgaggacattaaa aaagcttatcataaagtggcacttaaatggcaccctgataaaaatccagaaaataaagaa gaagcagagagaaaattcaaagaagtagctgaggcatacgaggtattatcaaatgatgag aaacgggacatttatgataaatatggcacagaaggattaaacggaggtggaagtcatttt gatgatgaatgtgagtacggcttcacattccataagccagatgatgtttttaaagaaatt tttcatgaaagggatccattttcttttcacttctttgaagactcgcttgaggacctgtta aatcgtccaggaagctcctatggaaacagaaacagagatgcaggatactttttctccact gccagtgaatatccaatttttgagaaattttcttcatatgatacaggatatacatcacag ggttcattggggcatgaaggccttacttctttctcttccctggcttttgataatagtggg atggacaactacatatctgttacaacttcagacaaaatcgttaatggcagaaatattaat acaaagaaaattattgaaagtgatcaagaaagagaagctgaagataatggagagttgaca ttttttcttgtaaatagtgtggccaatgaagagggctttgcaaaagaatgcagctggaga acacagtcattcaacaactattcaccaaattctcacagctccaaacatgtatctcaatat actttcgtggacaatgatgagggaggtatatcttgggttaccagcaacagagatccccct attttctcagcaggagtcaaagagggtggtaagaggaaaaaaaagaagcgtaaagaggtg caaaagaagtctaccaaaaggaattgttaa >gi568815576r:40761068_40961994|GENSCAN_predicted_peptide_7|463_aa MRFYHVGRAGLELLTSGEVTPGLSQVEYALRRHKLMSLIQKEAQGQSGTDQTVVVLSNPT YYMSNDIPYTFHQDNNFLYLCGFQEPDSILVLQSLPGKQLPSHKAILFVPRRDPSRELWD GPRSGTDGAIALTGVDEAYTLEEFQHLLPKMKAETNMVWYDWMRPSHAQLHSDYMQPLTE AKAKSKNKVRGVQQLIQRLRLIKSPAEIERMQIAGKLTSQAFIETMFTSKAPVEEAFLYA KFEFECRARGADILAYPPVVAGGNRSNTLHYVKNNQLIKDGEMVLLDGGCESSCYVSDIT RTWPVNGRFTAPQAELYEAVLEIQRDCLALCFPGTSLENIYSMMLTLIGQKLKDLGIMKN IKENNAFKAARKYCPHHVGHYLGMDVHDTPDMPRSLPLQPGMVITIEPGIYIPEDDKDAP EKFRGLGVRIEDDVVVTQDSPLILSADCPKEMNDIEQICSQAS >gi568815576r:40761068_40961994|GENSCAN_predicted_CDS_7|1392_bp atgaggttttaccatgttggccgggctggtcttgaactcctgacctcaggggaggtaact ccaggactatctcaggtggaatatgcacttcgcagacacaaactaatgtctctgatccag aaggaagctcaagggcagagtgggacagaccagacagtggttgtgctctccaaccctaca tactacatgagcaacgatattccctatactttccaccaagacaacaatttcctgtaccta tgtggattccaagagcctgatagcattcttgtccttcagagcctccctggcaaacaatta ccatcacacaaagccatactttttgtgcctcggcgagatcccagtcgagaactttgggat ggtccgcgatctggcactgatggagcaatagctctaactggagtagacgaagcctatacg ctagaagaatttcaacatcttctaccaaaaatgaaagctgagacgaacatggtttggtat gactggatgaggccctcacatgcacagcttcactctgactatatgcagcccctgactgag gccaaagccaagagcaagaacaaggttcggggtgttcagcagctgatacagcgcctccgg ctgatcaagtctcctgcagaaattgaacgaatgcagattgctgggaagctgacatcacag gctttcatagaaaccatgttcaccagtaaagcccctgtggaagaagcctttctttatgct aagtttgaatttgaatgccgggctcgtggcgcagacattttagcctatccacctgtggtg gctggtggtaatcggtcaaacactttgcactatgtgaaaaataatcaactcatcaaggat ggggaaatggtgcttctggatggaggttgtgagtcttcctgctatgtgagtgacatcaca cgtacgtggccagtcaatggcaggttcaccgcacctcaggcagaactctatgaagccgtt ctagagatccaaagagattgtttggccctctgcttccctgggacaagcttggagaacatc tacagcatgatgctgaccctgataggacagaagcttaaagacttggggatcatgaagaac attaaggaaaataatgccttcaaggctgctcgaaaatactgtcctcatcatgttggccac tacctcgggatggatgtccatgacactccagacatgccccgttccctccctctgcagcct gggatggtaatcacaattgagcccggcatttatattccagaggatgacaaagatgcccca gagaagtttcggggtcttggtgtacgaattgaggatgatgtagtggtgactcaggactca cctctcatcctttctgcagactgtcccaaagagatgaatgacattgaacagatatgcagc caggcttcttga >gi568815576r:40761068_40961994|GENSCAN_predicted_peptide_8|53_aa MAAAMDVDTPSGTNSGAGKKRFEVKKWNAVALWAWDIVVDNCAICRNHIMDLX >gi568815576r:40761068_40961994|GENSCAN_predicted_CDS_8|159_bp atggcggcagcgatggatgtggataccccgagcggcaccaacagcggcgcgggcaagaag cgctttgaagtgaaaaagtggaatgcagtagccctctgggcctgggatattgtggttgat aactgtgccatctgcaggaaccacattatggatctttnn