GENSCAN 1.0 Date run: 3-Nov-116 Time: 04:34:11 Sequence gi568815595r:134257708_134471433 : 213726 bp : 45.16% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.19 PlyA - 371 366 6 1.05 1.18 Term - 23702 23553 150 2 0 73 40 125 0.063 4.11 1.17 Intr - 43653 43575 79 2 1 46 99 96 0.124 6.05 1.16 Intr - 57138 56898 241 1 1 87 48 159 0.424 8.41 1.15 Intr - 58472 58448 25 2 1 91 107 19 0.610 1.70 1.14 Intr - 77409 77308 102 0 0 17 81 102 0.123 2.77 1.13 Intr - 87737 87657 81 2 0 93 99 20 0.093 3.43 1.12 Intr - 92758 92625 134 2 2 97 39 26 0.095 -1.04 1.11 Intr - 94366 94234 133 1 1 61 48 101 0.287 3.62 1.10 Intr - 96898 96771 128 2 2 66 65 33 0.081 -0.80 1.09 Intr - 100801 100694 108 2 0 84 83 15 0.567 0.86 1.08 Intr - 101009 100833 177 0 0 53 106 81 0.764 6.29 1.07 Intr - 101796 101576 221 2 2 88 110 202 0.990 20.35 1.06 Intr - 102706 102399 308 1 2 117 52 401 0.924 34.65 1.05 Intr - 104100 103805 296 1 2 121 99 684 0.999 69.53 1.04 Intr - 108202 108110 93 2 0 74 70 70 0.925 3.74 1.03 Intr - 108720 108576 145 0 1 82 78 178 0.998 16.06 1.02 Intr - 110096 109790 307 1 1 125 102 347 0.967 36.25 1.01 Init - 113726 112993 734 2 2 74 93 808 0.997 74.15 1.00 Prom - 114698 114659 40 -9.65 2.00 Prom + 114893 114932 40 -9.65 2.01 Init + 115628 116018 391 1 1 55 93 175 0.771 10.13 2.02 Intr + 116420 116808 389 0 2 88 45 216 0.446 11.81 2.03 Intr + 149566 149643 78 0 0 79 95 7 0.047 0.25 2.04 Intr + 170163 170213 51 2 0 135 84 29 0.088 6.30 2.05 Term + 173579 173659 81 1 0 30 48 101 0.034 -2.01 2.06 PlyA + 173899 173904 6 1.05 3.00 Prom + 174699 174738 40 -2.96 3.01 Init + 179899 180102 204 0 0 87 27 241 0.576 16.75 3.02 Term + 180180 180395 216 2 0 34 42 104 0.707 -2.46 3.03 PlyA + 181335 181340 6 1.05 4.05 PlyA - 181639 181634 6 1.05 4.04 Term - 184813 184690 124 0 1 48 44 93 0.212 -1.24 4.03 Intr - 190335 190152 184 1 1 94 19 97 0.577 2.25 4.02 Intr - 192127 191956 172 1 1 46 53 169 0.844 8.72 4.01 Init - 198482 198453 30 2 0 39 86 50 0.551 -0.37 4.00 Prom - 211627 211588 40 -1.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:134257708_134471433|GENSCAN_predicted_peptide_1|1153_aa MRTLEDSSGTVLHRLIQEQLRYGNLTETRTLLAIQQQALRGGAGTGGTGSPQASLEILAP EDSQVLQQATRQEPQGQEHQGGENHLAENTLYRLCPQPSKGEELPTYEEAKAHSQYYAAQ QAGTRPHAGDRDPRGAPGGSRRQDEALRELRHGHVRSLSERLLQLSLERNGARAPSHMSS SHSFPQLARNQQGPPLRGPPAEGPESRGPPPQYPHVVLAHETTTAVTDPRYRARGSPHFQ HAEVRILQAQVPPVFLQQQQQYQYLQQSQEHPPPPHPAALGHGPLSSLSPPAVEGPVSAQ ASSATSGSAHLAQMEAVLRENARLQRDNERLQRELESSAEKAGRIEKLESEIQRLSEAHE SLTRASSKREALEKTMRNKMDSEMRRLQDFNRDLRERLESANRRLASKTQEAQAGSQDMV AKLLAQSYEQQQEQEKLEREMALLRGAIEDQRRRAELLEQALGNAQGRAARAEEELRKKQ AYVEKVERLQQALGQLQAACEKREQLELRLRTRLEQELKALRAQQRQAGAPGGSSGSGGS PELSALRLSEQLREKEEQILALEADMTKWEQKYLEERAMRQFAMDAAATAAAQRDTTLIR HSPQPSPSSSFNEGLLTGGHRHQEMESRLKVLHAQILEKDAVIKVLQQRSRRDPGKAIQG SLRPAKSVPSVFAAAAAGTQGWQGLSSSERQTADAPARLTTDRAPTEEPVVTAPPAAHAK HGSRDGSTQTEGPPDSTSTCLPPEPDSLLGCSSSQRAASLGRAEGALGCELEHKAFSSQT PSGSLPALHGEPCIARVPLARGFQSICTFCEIQSVESHFNVSLAQTRRNKKSVPVYPAHT FKIKQFLAKKQKQNRPIPQWIRMKTGNKIRYNSKRRHWKRTKLAPKLDRQGVVLGTVWSA QGPVVELLQPSHHLSIERYSPLRPPLAHETEQQSQPGQGFGLKCPHDQPAVLSAQKGHSQ MKELLWVTNGASMFQEGFQEVVVFYCEGKNPVQCLVPALGALDVFQDLYDALFHASLMSC MLPPPNQGRTKSRKFHDQTHMVFPLGRPLRRDEELDSCLFGTRNQYYEGEEESVNQASVA FVHMVQAGCWGAAIMSTTSSPQEENNIGIFSLTQGGGKASSPWVSDLQMKKCKTVTYFDL GPIITPDPHAYVV >gi568815595r:134257708_134471433|GENSCAN_predicted_CDS_1|3462_bp atgaggacactggaagactcctcggggacagtcctgcaccgcctcatccaggagcagctg cgctacggcaacctgactgagacgcgcacgctgctagccatccagcagcaggccctgagg ggtggggctggaactgggggtacagggagcccccaggcctccctggagatcctggcccca gaggacagtcaggtgctgcagcaggccaccaggcaggagccccagggccaggagcaccag ggcggtgagaaccacctggcagagaacaccctctaccggctatgcccacagcccagcaag ggagaggagctgcccacctatgaggaggccaaagcccactcgcagtactatgcggcccag caggcagggacccggccacatgcgggggaccgagatccccgtggggccccgggaggcagt cggaggcaggacgaggccctgcgggagctgaggcatgggcacgtgcgctcgttgagtgaa cggctccttcagttgtccctggagaggaacggcgcccgggcccccagccacatgagctcc tcccacagcttcccacagctggcccgcaaccagcagggccccccactgaggggcccccct gctgagggcccagagtcccgaggacccccacctcagtaccctcatgttgtactagctcat gagaccaccactgctgtcactgacccacggtaccgtgcccgcggcagcccgcacttccag catgctgaagtcaggatcctgcaggcccaggtgcctcctgtgttcctccaacagcagcag cagtaccagtacctgcagcaatctcaggagcacccccctcccccacatccagctgctctc ggccatggccccctgagctccctcagtccacctgctgtggaggggccagtgagtgcccag gcctcctcagccacctcgggcagtgcccacctggcccagatggaggccgtgctgagggag aatgccaggctgcagagagacaatgagcggctgcagagggagctggagagctctgcggag aaggctggccgcattgagaagctggaaagcgaaatccagcggctctctgaggcccatgag agcctgaccagagcctcctccaagcgtgaggccctggagaagaccatgcggaacaagatg gacagtgaaatgaggaggctgcaagacttcaaccgggatcttagagagagattggaatct gcaaatcgccgcctggcaagcaagacacaggaggcccaggccggcagtcaggacatggtg gccaagctgcttgctcagagctacgaacagcagcaggagcaagagaagctggagcgagag atggcactgctgcgcggcgccatcgaggaccagcggcggcgtgccgagctgctggagcag gctctgggcaatgcgcagggccgggcagctcgagccgaagaggagctgcgcaagaagcag gcctatgtggagaaagtggagcggctgcagcaggcgctcgggcagctgcaggcagcctgt gagaagcgggagcagctggagctgcgtctgcggactcgcctggagcaggaactcaaggcc ctgcgtgcacagcagagacaggcaggtgccccaggtggtagcagtggcagtggtgggtct ccagagctcagcgccctgcgactgtcagaacaactgcgagagaaggaggagcagatcctg gcgctggaggccgacatgaccaagtgggagcagaagtatttggaggaacgtgccatgagg cagtttgccatggatgcggctgccacggctgctgctcagcgtgacaccactctcatccga cattccccccagccctcacccagcagcagcttcaatgagggtctgctcactggtggccac aggcatcaggagatggaaagcaggttaaaggtgctccatgcccagatcctggagaaggat gcagtgatcaaggtccttcagcagcgctccaggagagaccctggcaaggccatccagggc tccctgcggcctgccaagtcggtgccatctgttttcgcggctgcggcagcaggaacccag ggctggcaagggctctcttctagtgagcgacaaacagcagacgcccctgctcggctgact acagacagagcacccacagaggagccagtggtcacagctccccctgctgcccatgccaaa cacgggagcagagatgggagcacccagactgagggccccccagacagcacctccacctgc ctgccaccggagcctgacagccttctggggtgcagcagtagccagagagcagcctctctg ggtagggccgagggggcactggggtgtgaactggaacacaaggccttttcttcccagacc ccctccgggtcattgcccgcactccatggggagccgtgcatagccagagtacctttggcc aggggcttccaaagcatctgcaccttctgtgagatccagtctgtggaaagtcactttaat gtttccttggcacagaccaggaggaataagaaatcggtcccagtgtacccagcacatact ttcaagattaagcaattcctggccaagaaacaaaagcaaaatcgtcccattccccagtgg attcggatgaaaactggtaataaaatcaggtacaactccaagaggagacattggaaaaga accaagctagcccccaagctggaccgtcaaggcgtggtcctgggaacagtgtggtcagct caggggcctgtggttgaacttttgcagccatcccaccacttgtccattgaaagatattca cctctcagacctccccttgcacatgaaacagagcagcagagtcagccaggccagggcttt ggcttgaaatgcccccatgaccaaccagctgtgctctcagcccagaagggacactcccag atgaaagagctgctctgggtcaccaacggtgccagcatgtttcaggaaggcttccaggaa gtggtagtgttttactgtgaaggaaagaatccagtacaatgcctggtgcctgcattgggt gccctggatgtattccaggatctttatgatgccctcttccatgccagtctcatgtcctgc atgctaccacccccaaatcagggaaggacaaaaagcaggaagttccatgaccagacccac atggtcttccccttgggaaggcccctccgcagggatgaagagctggacagctgccttttc ggaaccagaaatcagtactacgaaggggaagaggagtcagtcaatcaggcaagtgtggcc ttcgtccacatggtccaagctggctgctggggtgctgccatcatgtccaccacatccagc ccccaggaagagaataacattggcatcttctcactgacacagggaggaggaaaagcatca tctccctgggtctctgatttgcagatgaagaaatgcaagacggtcacctacttcgatttg ggaccaatcataactccagatccccatgcttatgtggtatag >gi568815595r:134257708_134471433|GENSCAN_predicted_peptide_2|329_aa MRSLPDRFQALHSVPGLGARNISKASHREFVPAKPLRFTSFQTSNAWAFTKTCFRAPGRE PLLHSPAALSEGRLSKPARVSSLGLGSCSPKEPKTVSKLGGSGTPARAPAPPRARADLEH RWPLVRLGRAEPRKAQRAALPGQSGPCQALRLPRKGALRPEDARSSRLRLPEKAGLRRDS SFGWGTFLWPRALSRAPSGLPYRCGPRVPSAVRHHNLRLGPAQLGGEDVFSAVAPTLWLF APQRAAPGSARAGGGCYARNAFTTTVMGCIPSLCLMFYFKRNESELHVDIHLSNIYRAAT PCQGRLKEGSDAFEEKVAIWKVQVQGGGG >gi568815595r:134257708_134471433|GENSCAN_predicted_CDS_2|990_bp atgcggagccttcctgacagattccaggcgctccatagtgtgcccggactgggagcaagg aacatctccaaggcctctcatcgcgaatttgtcccggccaagcccctcagattcacgtcc tttcagacatcaaacgcctgggcctttacaaaaacctgcttccgcgcccccggccgcgaa cctctgctgcactcgcccgctgcgctctctgaaggccgcctttctaagccagcgcgcgtc tccagccttggcctgggctcctgctcgcctaaggaaccaaagacagtctccaagctagga ggttctggcaccccagccagggctcccgccccaccacgagcccgggcggacctggagcac cggtggccgctggtacgcctggggcgagccgagccccgcaaagcccagcgcgctgcgctc cctgggcagagcggcccttgccaggcactgcggctgcctcgaaagggcgcactgcgcccc gaggacgctcgatcctcgaggctccgactcccggaaaaggccgggttgcgccgagactcc agctttggctggggcacgtttctgtggcctcgcgcgctctcccgagcgccgagcggcctt ccttaccgctgcggcccgagggtgcccagcgcagtcagacaccacaacctccggctcggc ccagctcagctcggcggcgaagatgtgttctcggccgtggcgccgacgctctggctgttc gcgccccagcgcgcagccccagggtcggcccgcgccggaggcggctgctatgccaggaat gcttttacaactactgtgatgggctgtatcccctctttgtgcttgatgttttattttaaa cgaaatgagtcagaactgcatgtcgatattcatctgtcaaatatttatcgagcagccact ccgtgccaggggagattaaaggagggatcagatgcttttgaagagaaggtggccatctgg aaggtgcaggtgcaaggtggtggaggctga >gi568815595r:134257708_134471433|GENSCAN_predicted_peptide_3|139_aa MAKGDPKKPKGKMSAYAFFVQMCREEHKKKTPEVPVNFAEFSRKCPEKRKIMSGKEKSKF DEMAKVDKLVPNPSFGSHCLDSSCSVPNSALISNPQTLVSLLEMSQKGWVRCRLTCNSEK PPYITKATKLKKYEKDVAD >gi568815595r:134257708_134471433|GENSCAN_predicted_CDS_3|420_bp atggctaaaggtgaccccaagaaaccaaagggcaagatgtctgcttatgccttctttgtg cagatgtgcagagaagaacataagaagaaaaccccagaggtccctgtcaattttgcagaa ttttccaggaagtgccctgagaagcggaagataatgtctgggaaagagaagtctaaattt gatgaaatggcaaaggtggataaattggtccctaatccttcgtttggatcccactgtctg gattcttcctgctctgttccaaattctgccctaatatcaaatccacaaaccctggtatct ctattggaaatgtcgcaaaaaggctgggtgagatgtagattaacatgtaacagtgaaaag ccaccttacatcaccaaggcgacaaagctgaagaagtatgagaaagatgttgctgactaa >gi568815595r:134257708_134471433|GENSCAN_predicted_peptide_4|169_aa MKENTNHEKQSLAIAAQESVASAQTQQQTLKVPQLVAISSLHSHHKVPSEGELSGTFHWL PTWQAYADVSGGSQVPSHMGSEARPQTCFQSPCSPLAPALLPLLPNVLVRASLLPAVAEP HQSIAACPRKHVNLSVEQYYLDLLNGNKYDCQGPSRYPCVLPGGDHLGP >gi568815595r:134257708_134471433|GENSCAN_predicted_CDS_4|510_bp atgaaagagaacaccaaccatgaaaagcagtcattggccattgctgcccaggagagtgtg gcctctgctcaaacacagcaacagaccctgaaggtgccgcagctggtggccatcagctca ctacactctcatcacaaggttccctcggagggagaactgagtggcaccttccactggctg ccaacatggcaggcttatgcggatgtttctgggggttcccaggtgccatcccacatgggc tctgaagcaaggccgcagacatgcttccagtctccctgctcaccgctggccccagcactg ctgccccttctgcccaatgtccttgtgagggcatctctgctgccggctgtggccgagccc caccagagcattgctgcatgtcctagaaagcacgtaaacctgtccgtcgagcaatattat ttggacctgctaaatggaaacaagtacgattgccagggcccaagcagatatccctgtgtc cttcctggaggtgaccacctgggtccatga