GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:18:11 Sequence gi568815587f:20264238_20483202 : 218965 bp : 38.26% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9565 9767 203 0 2 63 81 78 0.762 3.00 1.02 Term + 11858 12119 262 2 1 115 31 178 0.856 8.81 1.03 PlyA + 12688 12693 6 1.05 2.00 Prom + 28677 28716 40 -5.55 2.01 Init + 29418 29462 45 2 0 47 81 66 0.125 2.55 2.02 Intr + 40438 40479 42 0 0 83 96 22 0.038 0.02 2.03 Intr + 50738 50872 135 1 0 85 98 119 0.682 12.44 2.04 Intr + 62355 62453 99 2 0 108 98 27 0.411 5.09 2.05 Intr + 68100 68235 136 2 1 97 40 71 0.484 2.42 2.06 Intr + 79564 79702 139 2 1 39 72 128 0.004 4.90 2.07 Intr + 80049 80197 149 0 2 16 41 108 0.025 -2.14 2.08 Intr + 92236 92423 188 2 2 46 89 168 0.270 11.19 2.09 Term + 94297 94320 24 0 0 111 34 1 0.148 -5.75 2.10 PlyA + 94662 94667 6 1.05 3.00 Prom + 97146 97185 40 -6.65 3.01 Init + 99575 99675 101 1 2 53 91 297 0.999 24.28 3.02 Intr + 100000 100195 196 1 1 56 52 226 0.998 14.30 3.03 Intr + 102937 103044 108 0 0 61 91 89 0.984 6.06 3.04 Intr + 105751 105925 175 0 1 83 -3 202 0.925 9.19 3.05 Intr + 112078 112151 74 2 2 139 58 6 0.828 0.91 3.06 Intr + 112343 112480 138 1 0 99 76 79 0.985 7.54 3.07 Intr + 117941 118002 62 1 2 83 106 57 0.973 3.61 3.08 Intr + 118743 118881 139 0 1 95 69 40 0.320 2.35 3.09 Intr + 122950 123101 152 0 2 77 -43 182 0.306 2.04 3.10 Intr + 123228 123517 290 0 2 58 -3 198 0.466 3.67 3.11 Intr + 123782 123917 136 0 1 49 79 183 0.846 12.21 3.12 Intr + 124228 124407 180 1 0 19 67 122 0.680 1.26 3.13 Intr + 125507 125589 83 2 2 68 101 51 0.924 2.76 3.14 Intr + 131566 131703 138 2 0 93 66 151 0.677 12.91 3.15 Intr + 133372 133484 113 2 2 34 72 105 0.661 2.68 3.16 Intr + 138682 138747 66 0 0 102 115 62 0.993 8.48 3.17 Intr + 143674 143795 122 0 2 71 83 121 0.979 8.27 3.18 Intr + 162529 162628 100 1 1 57 101 107 0.594 8.09 3.19 Term + 165244 165357 114 0 0 60 39 86 0.429 -1.51 3.20 PlyA + 165936 165941 6 1.05 4.05 PlyA - 166201 166196 6 1.05 4.04 Term - 172076 171928 149 0 2 6 43 145 0.738 -1.12 4.03 Intr - 174004 173846 159 2 0 86 92 137 0.972 12.94 4.02 Intr - 174663 174516 148 0 1 56 99 82 0.858 4.99 4.01 Init - 174970 174830 141 1 0 56 84 67 0.230 3.28 4.00 Prom - 176936 176897 40 -6.05 5.00 Prom + 182520 182559 40 -3.95 5.01 Init + 183120 183128 9 2 0 87 117 13 0.888 3.90 5.02 Intr + 187893 187971 79 2 1 124 100 15 0.980 4.51 5.03 Intr + 197743 197930 188 2 2 60 100 207 0.957 17.59 5.04 Intr + 200223 200309 87 2 0 54 115 36 0.671 2.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 79419 79781 363 2 0 51 54 220 0.833 10.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:20264238_20483202|GENSCAN_predicted_peptide_1|154_aa MPPATKINKPIRRAESRPEPLGVGPHLQKKAQRFFLGTDGPFFLSCFRKRLGCQALIQNR SAFRSILCFGLSDLLWLGSISGAHSWNVRAASKSWTLSLRRVLEDPLPTLTPSHLRGVIV PMAITPSPLECYLTDGGPSLWERRDHDLSEHMVS >gi568815587f:20264238_20483202|GENSCAN_predicted_CDS_1|465_bp atgcctcctgcaaccaaaataaataaacccataagaagagcggaaagcagaccagagcct ctgggagttggccctcacctgcagaagaaagcacagagattcttcctggggactgatggt ccgttttttctttcttgttttcgaaagaggcttggctgtcaggctttgattcagaacagg agtgcattcagaagcatcctctgctttggcctctcagacttgctgtggcttgggagcata tctggggcccacagctggaatgtccgggctgcttccaagagctggaccctcagtctcagg cgggttctggaggacccactcccaaccctaacacccagtcaccttcgtggggtgatcgtc ccaatggccatcactccctcaccactggagtgttacctcacagatggaggacctagcctg tgggaaagaagagaccatgacttatcggaacatatggtttcttaa >gi568815587f:20264238_20483202|GENSCAN_predicted_peptide_2|318_aa MQWYAPVVTATQTAKTWISGVLKATGYPRISGDIKESDVLVLHDEMCQRLKDLHNSVNQC FPIDQCMVAQNQAQTPELKPTERELRKQHSLENKNFPRMSLPQMAIKVIKTQKSAFNLAF QHDVVEQKAIALRLEHLIQFLAPTLAWYGLYAARVQNWTEDEMGELIEVGFRRWVITNFA ELKEHVLTQCEEAKNLDKSKTGQYSNSGNTENTTQILHEKINPKTHNQQILQNRNEGKNV KGSQRERPETEDTDTRREDHVKTEAEMGVLQPQPEERLEPSEVGRGKGAFSPRTCRGNMV LLTLQFQTSERAGPYLPL >gi568815587f:20264238_20483202|GENSCAN_predicted_CDS_2|957_bp atgcagtggtatgcacctgtggtcacagcgactcagacagctaagacctggatctcagga gttctcaaagctacaggctacccaaggattagtggagatattaaagaaagtgatgttttg gtgttgcatgatgaaatgtgtcaacgtttgaaagatctgcataactcagtgaaccagtgt tttccaattgaccagtgcatggtggcacaaaatcaggcacagacaccagaactcaaacca acagaaagagaactgcgaaagcagcatagccttgagaataagaatttccccagaatgtcc ctcccacaaatggccataaaggtcataaaaacccaaaaatctgcctttaatttggctttt caacatgacgtggtagagcagaaagcaattgctcttaggctagaacacctgattcaattt ctggctccaacacttgcctggtatggcctttatgcagcaagggtgcagaactggacagag gatgagatgggtgaattgatagaagtaggcttcagaaggtgggtaataacaaactttgct gagctaaaggagcatgttctaacccaatgcgaagaagctaagaaccttgataaaagcaag acaggccaatattcaaattcaggaaataccgagaacaccactcagatactccatgagaag atcaatcccaagacacataatcagcagattctccaaaatcgaaatgaaggaaaaaatgtt aagggcagccagagagaaaggccagagacagaagacacagacaccagaagagaagaccac gtgaagacagaggcagagatgggagttttacagccacaacccgaggaacgcctggagcca tcagaagtgggaagagggaagggagcattctcccctagaacctgcagagggaacatggtc ctgctgacacttcaatttcagacttctgagagagctggtccctacctgcctctctag >gi568815587f:20264238_20483202|GENSCAN_predicted_peptide_3|828_aa MAGPAALSAAAAAALAAALLLLRREDPGPGAGPSMAETEALSKLREDFRMQNKSVFILGA SGETGRVLLKEILEQGLFSKVTLIGRRKLTFDEEAYKNVNQEVVDFEKLDDYASAFQGHD VGFCCLGTTRGKAGANTVVQSMHWLPDLALIQIYCISISEFGSLESVFLAGSLGDPDAVN PVLLVEPTFKRHWDLFGVCVSMSLSLFPYCSPVLLLKLEGFVRVDRDYVLKSAELAKAGG CKHFNLLSSKGADKSSNFLYLQVKGEVEAKVEELKFDRYSVFRPGVLLCDRQESRPGEWL VRKFFGSLPDSWASGHSVPVVTVVRAMLNNVDLGAQKPAEPPAFGLLRHQRASWADRPSA NRSSQEKGAHRGQDANPKTAATAHCANPAAGGGGAPALAPPPPHVTRTAPRGRKRLRASA RASAFPAVCEVSGARVPPRVRACARARSWSRRPLRALVAQPGPRPQTRRALGAPQPCARG RGAVENEEDLPELSDSGDEAAWEDEDDADLPHGKQQTPCLFCNSQGWGLRIFCFYEDKIG SNSEVTNELLLTSNIFGSHPELDNPYGAFRLYHANSLTSTELLRLFTSAEETFSHCKSEH QFNIDSMVHKHDVEDLYEPVSVPFSYPNGLSENTSVVEKLKHMEARALSAEAALARAHVR TCSSSTSVIADLQEDEDGVYFSSYGHYGIHEEMLKDKIRTESYRDFIYQNPHIFKDKVVL DVGCGTGILSMFAAKAGAKKVLGVDQSEILYQAMDIIRLNKLEDTITLIKGKIEEVHLPV EKVDVIISEWMSVGQVRPLSSSGRQSLESLEPEMKGASCLEAKVLRNQ >gi568815587f:20264238_20483202|GENSCAN_predicted_CDS_3|2487_bp atggccgggcctgcggcgctgagcgcggcggcggcggctgctctggcggccgccctgctc ctgctgcgtcgtgaggacccggggccgggggctggccccagcatggccgaaacagaagcc ctgtcgaagcttcgggaagacttcaggatgcagaataaatccgtctttattttgggcgcc agcggagaaaccggcagagtgctcttaaaggaaatcctggagcagggcctgttttccaaa gtcacgctcattggccggaggaagctcaccttcgacgaggaagcttataaaaatgtgaat caagaagtggtggactttgaaaagttggatgactacgcctctgcctttcaaggtcatgat gttggattctgttgcctgggtaccaccagagggaaagctggggcgaacacagttgttcag tcaatgcactggctccctgacttggccctgatccagatctactgcatcagcatctctgaa tttggaagcttggaatctgtatttttagcaggctccttgggagatcctgatgcagtcaac cctgtccttcttgttgaacctactttcaagaggcattgggatttgtttggtgtctgtgtt tcaatgtcattgtccttgtttccttattgcagtccagtcttgctgctaaagctggaggga tttgttcgtgttgaccgagattatgtgctgaagtctgcagagctggcaaaagctggaggg tgcaaacatttcaacttgctatcctctaaaggagctgataaatcaagcaattttttatat ctacaagttaagggagaagtagaagccaaggttgaagaattaaaatttgatcgttactct gtatttaggcctggagttctgttatgtgataggcaagaatctcgcccaggtgaatggctg gttagaaagttctttggctccttaccagactcttgggccagtgggcattctgtgcctgtg gtgaccgtggttagagcaatgctgaacaatgtggacctgggagcccagaaaccagccgag cctccagcgtttggcctcctccgccaccagagggcgtcatgggccgatcgcccctccgct aaccgctcatcccaggaaaaaggggcgcacagaggacaggatgccaaccccaaaaccgca gcgaccgcgcactgcgccaaccctgcggcaggaggaggcggagcccccgctctcgccccg cccccacctcacgtgacccgcacggcgcctcgaggccggaagcgattgcgagccagcgcg cgcgcttcggcgttcccggcggtctgcgaagtttccggagcccgggtcccgccgcgggtt cgcgcttgtgctcgcgctcgttcctggagtcggcggccgctgcgcgcgctcgttgcccaa cccggtccccgcccccagacacgccgggctctcggggcaccacagccatgtgctcgcggc cggggcgctgtggagaatgaggaggacctgccagaactgtcggacagcggggacgaggcc gcctgggaggatgaggacgatgcagatctcccccacggcaagcagcagaccccctgcctg ttctgtaacagtcagggatggggtctaagaatattctgtttttatgaggacaagatcgga agtaacagtgaagttactaacgagttactactaacgagtaacatatttggcagccaccct gaactagataatccctacggagccttccggctttaccatgctaattctttgaccagtaca gagcttctcaggttattcacatctgctgaagaaacattttcacactgtaagtctgagcat cagtttaatattgacagcatggttcataaacatgatgtagaagatctttatgaaccggtg tcagtacccttctcataccccaatggactcagtgaaaatacatctgttgttgaaaaattg aaacatatggaagccagggcactgtctgctgaagccgcattggccagagcacatgtcaga acctgctcgtcatctactagtgtcattgcggacctccaggaggatgaggatggtgtttat ttcagctcatacgggcattatgggatacatgaagaaatgctaaaggacaaaatacgaaca gaaagctaccgagatttcatataccaaaatccacatatcttcaaagacaaggtagttttg gatgttgggtgtggaactggaattctctctatgtttgctgctaaagctggggcgaagaag gttcttggagttgatcaatctgaaatactttaccaggcaatggatattataagactaaat aaacttgaagatactattacactaattaaaggaaagattgaagaagttcatcttcctgta gaaaaagtagatgttatcatatctgagtggatgagcgttggtcaggtaagacctctctct tcatcaggaaggcagtcactggagtcactggagcctgagatgaaaggagccagttgtttg gaagccaaggttcttcgaaaccagtaa >gi568815587f:20264238_20483202|GENSCAN_predicted_peptide_4|198_aa MPAEGAAWGPNSCALPRENDTLSCPQCLCGYCHLGIETMAEVYLASGLELQQYAASWEKS TGATQSSNTPLPSGLKQCPASQEMVPWLPRETTTPFGSGQCCGLSKTTVTTEPSGSELLE GASIVTDPSSCGKLTSNPATEGESAPKDPARAIRQEKEIKDIQIGKKGDKLFLFEDDMIL YRKSKNLLELTNTFNKAE >gi568815587f:20264238_20483202|GENSCAN_predicted_CDS_4|597_bp atgccagctgaaggagctgcctggggtccaaatagctgtgctctcccaagggaaaatgac actctgtcctgcccccagtgcctgtgtggctactgccatcttggaattgaaactatggct gaagtgtaccttgcttcagggctggagctgcagcagtatgctgcctcctgggaaaagagt actggggccactcagagcagtaacacccctctgccatcagggttgaagcagtgccctgca tcccaggaaatggtgccttggctgcccagagaaaccacgaccccatttgggtcaggccag tgctgtggcttgagtaaaaccacagttacaactgagccctctggttccgagctgctggag ggtgcctccattgtcacagatcctagttcttgtgggaaacttacatccaaccctgccaca gagggtgaatctgcacccaaagacccagccagagcaatcaggcaagagaaagaaataaag gacatccaaattggaaagaagggagacaaattgttcctgtttgaagacgacatgatctta tacagaaaatctaaaaacctcttagagctgacaaacacattcaataaagctgaatga >gi568815587f:20264238_20483202|GENSCAN_predicted_peptide_5|121_aa MVPGYFLLFESMLDSVLYAKNKYLAKGGSVYPDICTISLVAVSDVNKHADRIAFWDDVYG FKMSCMKKAVIPEAVVEVLDPKTLISEPCGIKHIDCHTTSISDLEFSSDFTLKITRTSMC T >gi568815587f:20264238_20483202|GENSCAN_predicted_CDS_5|363_bp atggtgccgggctattttcttctgtttgagtctatgttagattctgtcctttatgcaaag aacaaatacttggcaaaaggaggctcggtctaccctgacatttgcactatcagccttgta gcagtgagtgatgtgaataaacatgctgatagaattgctttttgggatgatgtctatggc ttcaagatgtcctgcatgaagaaagcagttattccagaagctgttgtggaagttttagat ccgaagactcttatttcagaaccttgtggtattaagcatatagattgccatacgacgtct atctcagatttggaattttcatcagattttaccctgaaaatcacaaggacatccatgtgc acg