GENSCAN 1.0 Date run: 3-Nov-116 Time: 11:04:48 Sequence gi568815594r:185062801_185291196 : 228396 bp : 41.69% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 448 534 87 2 0 115 75 110 0.854 11.42 1.02 Term + 8768 8883 116 0 2 114 33 111 0.628 5.95 1.03 PlyA + 9396 9401 6 1.05 2.08 PlyA - 10416 10411 6 1.05 2.07 Term - 18381 18248 134 1 2 85 41 50 0.175 -2.73 2.06 Intr - 23392 23302 91 1 1 72 55 109 0.428 4.65 2.05 Intr - 31293 31183 111 0 0 76 26 88 0.042 1.06 2.04 Intr - 48779 48610 170 0 2 61 23 151 0.040 4.64 2.03 Intr - 57992 57884 109 2 1 53 82 53 0.032 0.14 2.02 Intr - 64338 64193 146 1 2 -1 92 128 0.285 3.38 2.01 Init - 64454 64379 76 2 1 96 89 45 0.303 6.72 2.00 Prom - 64496 64457 40 -6.95 3.03 PlyA - 64512 64507 6 -0.45 3.02 Term - 65190 64803 388 2 1 63 35 199 0.611 5.53 3.01 Init - 65926 65868 59 1 2 66 80 120 0.967 7.83 3.00 Prom - 73587 73548 40 -7.25 4.00 Prom + 78237 78276 40 -6.95 4.01 Init + 80573 80683 111 1 0 103 94 103 0.995 10.79 4.02 Intr + 81964 82450 487 0 1 114 76 485 0.920 41.56 4.03 Intr + 82959 83099 141 1 0 83 86 162 0.997 14.90 4.04 Term + 84014 84171 158 0 2 66 35 122 0.813 1.81 4.05 PlyA + 84452 84457 6 1.05 5.05 PlyA - 84487 84482 6 1.05 5.04 Term - 100125 99998 128 1 2 82 41 120 0.681 4.16 5.03 Intr - 101379 101229 151 0 1 31 71 99 0.429 1.31 5.02 Intr - 113251 112986 266 2 2 94 111 156 0.869 14.81 5.01 Init - 128396 127343 1054 2 1 103 86 733 0.985 69.00 5.00 Prom - 129642 129603 40 -11.04 6.00 Prom + 132875 132914 40 -4.15 6.01 Init + 147027 147455 429 2 0 78 98 307 0.265 25.00 6.02 Intr + 147741 147861 121 2 1 16 52 167 0.227 5.05 6.03 Intr + 155728 155770 43 2 1 71 84 24 0.024 -3.32 6.04 Intr + 164221 164397 177 1 0 85 49 149 0.298 8.81 6.05 Intr + 171452 171522 71 2 2 75 25 104 0.093 0.71 6.06 Intr + 177018 177058 41 1 2 81 82 35 0.394 -0.78 6.07 Intr + 179505 179595 91 2 1 54 84 122 0.923 7.05 6.08 Intr + 179670 179772 103 1 1 44 89 133 0.346 7.31 6.09 Intr + 196061 196264 204 2 0 27 93 146 0.047 6.39 6.10 Intr + 197013 197104 92 0 2 73 18 118 0.788 2.02 6.11 Intr + 197937 198033 97 1 1 49 74 114 0.845 4.25 6.12 Intr + 201638 201810 173 2 2 60 69 71 0.873 1.06 6.13 Intr + 204169 204355 187 2 1 88 63 204 0.896 15.73 6.14 Intr + 206707 206750 44 1 2 24 89 36 0.345 -5.73 6.15 Term + 209325 209515 191 1 2 52 37 182 0.331 6.13 6.16 PlyA + 210443 210448 6 1.05 7.00 Prom + 224459 224498 40 -3.45 7.01 Init + 224609 224756 148 1 1 71 86 90 0.065 7.40 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:185062801_185291196|GENSCAN_predicted_peptide_1|67_aa XLPQEYEGQSSSSCWSHDVTLKAGNSALEELIREDREVSVGGSKEPPSLQLRLRGLRRFI HKATLHK >gi568815594r:185062801_185291196|GENSCAN_predicted_CDS_1|204_bp ngcctgccacaagaatacgaaggccagagctccagcagctgttggagccacgatgtgacc ctgaaggcaggaaactcagctttggaagaactgatccgagaggacagggaagtcagtgtg ggcggctctaaggagcctccctctctacagctgcggctcagaggattacggcgcttcatc cacaaggcaacattgcacaagtag >gi568815594r:185062801_185291196|GENSCAN_predicted_peptide_2|278_aa MDAARGHVLSQLAPGTENQILHVLTYGNNEHLGAKSGKRGRGPTAEKRPIGYHVHYLVNG FNKSPNLSIMRYTQANHLHGAPWLLQAIYAPLSQGCEAGRSLKKGSAGRTGSLVTLLLMF RLIDMGSRVAEGSGTHVKEPTRRKMFRLTDMGSRALCYPGPGTASAYVWGTGPKTALEFS YSLEGLTELTESCDTHSYGLLQGKLPADLHAQTGAVHRLQQEKGLLACGNHLLQHSRDTM VNRQMMKSSPGDDWCLVPSCTKPVFMEFTREVETKDRQ >gi568815594r:185062801_185291196|GENSCAN_predicted_CDS_2|837_bp atggatgcagctagaggccatgtcctaagccaattagcaccaggaacagaaaaccagatc ctgcatgttctcacttatgggaacaatgagcacttgggcgccaaaagtggcaagcggggg agggggccaacagctgaaaaacgacctatcgggtaccatgttcactatttggttaatggg ttcaataaaagcccaaacctcagcatcatgcgatatacccaggccaatcatctccatggg gccccgtggctgctacaagccatttacgccccgctgagtcagggctgtgaagctgggaga tctctgaagaagggttcagctggcaggacaggctctctagtgactctcctactgatgttc agactcattgacatgggttccagggtagcagagggctctggaacccatgtcaaggagcct acccgcaggaagatgttcaggctcactgacatgggttccagagccctctgctatcctggt cctggcaccgcctcagcatatgtttgggggaccggtcccaaaaccgcccttgagttcagt tattcactagagggactcacagagctcactgagagctgtgatactcacagttatggttta ttacagggaaagctccctgcagacttgcatgctcagacgggagctgtccaccgcctgcaa caggaaaagggcttgcttgcgtgtggtaaccaccttcttcaacattctagggatacaatg gtgaacaggcaaatgatgaagtcaagtcctggagatgactggtgtttagtccctagttgc actaaacctgttttcatggagtttacacgagaagtggagacaaaagacagacaataa >gi568815594r:185062801_185291196|GENSCAN_predicted_peptide_3|148_aa MRGVRRGPGAEGQAGLATPSTHPGHRTPVLSRGSSKPWFSPDREFRSVRAPFRPPAPSVP PQGPEAALGAQRIPRRLEPQPRRFVAAVQETAGEGARGAETACRGLCSPGASFGLAEPDS QPALTTAQRLARWVSEPPTLKDTDLART >gi568815594r:185062801_185291196|GENSCAN_predicted_CDS_3|447_bp atgcgcggggtgcgtcggggtcccggcgccgagggccaggcggggctggcgacacccagc acacatcccggacacagaacgcccgtactttcccgagggtcttccaagccctggttcagc ccggaccgtgagttccggagcgtccgcgcccccttccgccctcccgcaccctcagtcccg cctcagggccccgaagccgccctgggcgcgcagcgcatccccaggcgactggagccccag ccccgacgcttcgtcgccgcagtccaggagaccgcaggagaaggggctcgtggggcggag acagcctgccggggcctctgcagcccgggagcctcctttggactcgccgaacccgactcc caacccgcccttaccactgcccagaggctcgcgcgctgggtttctgagcctcccaccctg aaggacaccgatctcgcccggacataa >gi568815594r:185062801_185291196|GENSCAN_predicted_peptide_4|298_aa MGDHAWSFLKDFLAGGVAAAVSKTAVAPIERVKLLLQVQHASKQISAEKQYKGIIDCVVR IPKEQGFLSFWRGNLANVIRYFPTQALNFAFKDKYKQLFLGGVDRHKQFWRYFAGNLASG GAAGATSLCFVYPLDFARTRLAADVGKGAAQREFHGLGDCIIKIFKSDGLRGLYQGFNVS VQGIIIYRAAYFGVYDTAKGMLPDPKNVHIFVSWMIAQSVTAVAGLVSYPFDTVRRRMMM QSGRKGADIMYTGTVDCWRKIAKDEGAKAFFKGAWSNVLRGMGGAFVLVLYDEIKKYV >gi568815594r:185062801_185291196|GENSCAN_predicted_CDS_4|897_bp atgggtgatcacgcttggagcttcctaaaggacttcctggccgggggcgtcgccgctgcc gtctccaagaccgcggtcgcccccatcgagagggtcaaactgctgctgcaggtccagcat gccagcaaacagatcagtgctgagaagcagtacaaagggatcattgattgtgtggtgaga atccctaaggagcagggcttcctctccttctggaggggtaacctggccaacgtgatccgt tacttccccacccaagctctcaacttcgccttcaaggacaagtacaagcagctcttctta gggggtgtggatcggcataagcagttctggcgctactttgctggtaacctggcgtccggt ggggccgctggggccacctccctttgctttgtctacccgctggactttgctaggaccagg ttggctgctgatgtgggcaagggcgccgcccagcgtgagttccatggtctgggcgactgt atcatcaagatcttcaagtctgatggcctgagggggctctaccagggtttcaacgtctct gtccaaggcatcattatctatagagctgcctacttcggagtctatgatactgccaagggg atgctgcctgaccccaagaacgtgcacatttttgtgagctggatgattgcccagagtgtg acggcagtcgcagggctggtgtcctacccctttgacactgttcgtcgtagaatgatgatg cagtccggccggaaaggggccgatattatgtacacggggacagttgactgctggaggaag attgcaaaagacgaaggagccaaggccttcttcaaaggtgcctggtccaatgtgctgaga ggcatgggcggtgcttttgtattggtgttgtatgatgagatcaaaaaatatgtctaa >gi568815594r:185062801_185291196|GENSCAN_predicted_peptide_5|532_aa MDQFGDILEGEVDHSFFDSDFEEGKKCETNSVFDKQNDDPKERIDKDTKNVNSNTGMQTT ENYLTEKGNERNVKFPPEHPVENDVTQTVSSFSLPASSRSKKLCDVTTGLKIHVSIPNRI PKIVKEGEDDYYTDGEESSDDGKKYHVKSKSAKPSTNVKKSIRKKYCKVSSSSSSSLSSS SSGSGTDCLDAGSDSHLSDSSPSSKSSKKHVSGITLLSPKHKYKSGIKSTETQPSSTTPK CGHYPEESEDTVTDVSPLSTPDISPLQSFELGIANDQKVKIKKQENVSQEIYEDVEDLKN NSKYLKAAKKGKEKHEPDVSSKSSSVLDSSLDHRHKQKVLHDTMDLNHLLKAFLQLDKKG PQKHHFDQPSVAPGKNYSFTREEVRQIDRENQRLLKELSRQAEKPGSKSTIPRSADHPPK LYHSALNRQKEQQRIERENLALLKRLEAVKPTVGMKRSEQLMDYHRNMGYLNSSPLSRRA RSTLGQYSPLRASRTSSATSGLSCRSERSAVDPSSGHPRRRPKPPNVRTAWL >gi568815594r:185062801_185291196|GENSCAN_predicted_CDS_5|1599_bp atggatcagtttggagatatattagaaggtgaagtggaccattctttctttgacagtgac tttgaagaaggaaagaaatgtgaaactaactcagtttttgacaagcaaaatgatgaccca aaggaaagaatagataaagatacaaaaaatgtaaattcgaacactggaatgcaaacaaca gaaaattatcttactgagaagggaaatgaaagaaacgtgaaatttcccccagaacacccc gtagagaatgatgttacacaaactgtaagttctttctcattgccagcctcttcaagatca aaaaaattgtgtgatgttacaacaggacttaaaatacacgtgtccattccaaatagaatt cccaaaattgtaaaagaaggtgaagatgattactacacagatggagaggaaagcagtgat gatgggaagaaataccatgtgaagtccaagtccgctaaaccatctactaacgttaaaaaa agcataaggaaaaagtattgcaaagttagctcctcttcctcctcctctttatcttcctca tcttcaggttcaggtacagattgtttagatgcagggtctgatagccatctatctgattcg tctccgtcatctaagtcatctaagaaacatgtatctggtataaccctcctgtcaccaaaa cacaagtataaatcaggaataaaatcgacagaaacacagccttcaagtactacaccaaaa tgtggccactaccctgaggagtctgaagatactgtgactgacgtaagtcccttatcaact ccagacattagccctcttcagtcttttgaactgggcatagcaaatgatcaaaaagtgaaa attaaaaagcaagaaaatgtgagccaagaaatatatgaagatgttgaggatttgaaaaat aattcaaaatatttgaaagcagccaaaaaagggaaagaaaaacatgagcctgatgtctcc tcaaagtcgtcttcagtgttagactccagtttagaccacagacataaacagaaagtctta catgacacaatggatctgaatcatctcttgaaagcttttctgcaattagataaaaaagga ccacaaaaacatcactttgatcagccttcagtagcacccgggaaaaactactctttcaca agagaagaggtgagacagatcgatcgggaaaatcagaggcttttgaaagaactgtcaaga caggcggaaaagccgggaagcaaaagtacaattcctagatcggctgatcatcccccaaag ttatatcacagtgctctcaacagacagaaggaacaacaaaggattgagagagaaaacttg gctttattgaaaaggcttgaggccgtgaaaccaacagttggtatgaaacgttcagaacaa ctgatggactatcatcgcaatatgggctatctcaactcatcaccattgtcaagacgggcc agatccactcttggccaatatagcccattaagagcttccaggacatccagtgctacgagt ggtctcagttgtaggagtgagcgatcagcggttgacccctccagtggccaccctcgaaga agacctaaaccccctaatgtccgtacagcttggttataa >gi568815594r:185062801_185291196|GENSCAN_predicted_peptide_6|687_aa MHPDATDSGGAGPSPARAAGAGGRPVSGFRGERRPESPGDAEAAAAAAPGAPGGRSWWKP VAVAALAAVALSFLGPGSGEAAGAAGLSSVLFRLSLYLSCAAAAFLLGILFALVCRSPRA QPPDFAAAWSRLAATSAARRPPGGVPRPSVSLAVGGEEGNHSVRYVQALRTVLAIRLVLL AVLASQHSAATLGTCQSCPVSRKSLRAVDQVTVSVLGTTATQDDTQKKKVPFWSKENVLR NASGSVPLTSYWPGLGRKQVIHQETLRKEPAMKVKYDGMREGDLAGSQDNSGGKHRLPGD SVRAHRLSEAQSPRALNPPPPTSVFNLLEQLTELRETLTYIYWFIVEDIAKDADEEMLRY RDYILSWYGNLSRDEGQLYHLLLEDFWEIARQLHHRLSHVDVVKVVCNDVVRTLLTHFCD LKAANAREEIKYLDGVPLLPCDAVRVGDLAAECGCTAVLTSQMPYKDLGDTQESLDHPLK TAAYRESNFQHEEQPRPFVLHACLRNSDDEVRFLQTCSRVLVFCLLPSKDVQSLSLRIML AEILTTKVLKPVVELLSNPDYINQMLLAQLAYREQMNEHHKRAYTYAPSYEDFIKLINSN SDVEFLKQLSRSTVKAGVDYLVTPRAVTDSANSTRAKNVNATLVPDIKGQLDGFPTVGVG GEGCHNQGRQTRAQNEGASVSETHNCH >gi568815594r:185062801_185291196|GENSCAN_predicted_CDS_6|2064_bp atgcaccccgatgcgaccgacagtggcggcgccggccccagccccgcgcgggccgcaggc gccggcggccgtcctgtctcgggcttcaggggcgagcggcggccggagtccccgggggac gcggaggcagcagcagcggcggcgccgggggccccgggcggccggagctggtggaagccc gtggcggtggccgcactcgccgccgtggccctctccttcctggggcccggcagcggggag gcggcgggggccgcggggctgagctccgtcctgttcaggctcagcctgtacctgagctgc gcggcggccgccttcctgctggggatcctgtttgccctcgtctgccggagcccgcgcgcc cagccgcccgacttcgccgccgcctggagccggctggccgcgacctcagccgcccgccgc ccgccggggggtgtccccagaccttcggtgtcgcttgccgtgggaggggaagaagggaat cacagcgttcgctacgtgcaggctctgcgcacggtcctggcgatccgcttggttcttctg gccgttctcgctagccagcactcagcagccactctaggcacttgccagtcctgccccgtg agtcggaagagcttgcgggctgttgaccaggtgactgtatcagttttaggcaccacagcc acgcaggatgatacccagaagaagaaagtccctttttggagcaaggaaaatgtgctcaga aatgcttctggcagtgttcccctcacatcttactggccaggtttgggcagaaagcaggtt attcaccaggaaacattacgaaaggaacctgctatgaaagtaaaatatgacggcatgaga gagggggatttggcagggtcacaggacaatagtggagggaagcatcgtctacctggagac agtgtcagagcccatagattaagtgaagctcagtccccaagagccctcaaccccccaccc ccaacgtcagtctttaatttgctggagcagctcacagaactcagggaaacgcttacttac atttactggtttattgtagaagatattgcaaaggatgcagatgaagaaatgctacgttat agagattacattctgtcctggtatggaaacctcagcagagatgagggacaactttaccat ctgctcttggaagacttttgggaaattgccagacagctgcaccacagactgagtcacgtg gatgtggttaaagttgtctgcaatgatgttgtgaggactttactcactcatttctgtgac ctgaaagctgccaatgccagggaagagataaagtatttagatggtgtcccccttctgccc tgtgatgctgttcgtgtaggggaccttgctgctgaatgcggctgcacagcagttttaaca tcccagatgccctacaaggatcttggggacacccaggagtctctggaccaccctttgaaa actgctgcctatagagaaagtaacttccaacatgaagaacagccaagaccttttgtgttg cacgcatgcttgaggaactcagatgatgaagtaagatttctacaaacgtgttctcgggtt ctggtgttttgtctcctcccctcaaaggatgtgcagtctctcagcttacgtataatgctt gcagaaattctcacaacaaaagtcttgaagccggtagtggagttactgagtaatccagat tacattaaccaaatgctgcttgcccagctggcgtacagagagcaaatgaatgagcatcac aagagagcctacacctatgccccctcttacgaggacttcatcaagctcattaacagcaac tctgatgtggagttcttgaagcaactaagtcgttccacagtaaaggcaggtgtggactat ttggtaaccccaagagcagtgactgactctgctaatagcacaagagccaaaaatgtgaat gccaccttggttccggatattaaagggcagctggatgggtttcccacagttggtgtaggt ggtgagggatgccacaatcaaggaaggcaaaccagagctcagaatgaaggtgcatctgtt agtgaaactcacaattgccattaa >gi568815594r:185062801_185291196|GENSCAN_predicted_peptide_7|50_aa MRRCTQAGQWGCRGEGSSENYLRRHHPVELSTVKKMFYLAPPGVVHKPTX >gi568815594r:185062801_185291196|GENSCAN_predicted_CDS_7|150_bp atgaggcgctgcactcaggcagggcagtggggctgcaggggagagggcagttctgagaac tatttaagacggcaccatccagtagaactttctacagtgaagaaaatgttctacttggca ccgcccggtgtggtccacaagcccacagnn