GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:19:53 Sequence gi568815581f:43299733_43500335 : 200603 bp : 46.84% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 410 405 6 -1.95 1.03 Term - 923 670 254 0 2 89 44 156 0.040 7.10 1.02 Intr - 16202 16126 77 1 2 17 101 115 0.275 4.86 1.01 Init - 17965 17916 50 1 2 51 106 48 0.445 3.32 1.00 Prom - 18546 18507 40 -4.26 2.04 PlyA - 23574 23569 6 1.05 2.03 Term - 23791 23579 213 1 0 69 42 138 0.518 4.53 2.02 Intr - 31207 31014 194 2 2 49 97 103 0.358 6.51 2.01 Init - 33367 33025 343 1 1 83 55 192 0.356 12.80 2.00 Prom - 37540 37501 40 -1.66 3.00 Prom + 37599 37638 40 -2.46 3.01 Init + 38011 38231 221 0 2 40 76 137 0.633 5.80 3.02 Term + 38331 38448 118 0 1 33 46 85 0.276 -3.19 3.03 PlyA + 39517 39522 6 1.05 4.10 PlyA - 39617 39612 6 1.05 4.09 Term - 39772 39717 56 2 2 49 45 125 0.054 2.02 4.08 Intr - 43840 43756 85 1 1 88 91 16 0.178 1.29 4.07 Intr - 47507 47340 168 2 0 67 26 165 0.059 8.34 4.06 Intr - 47850 47761 90 0 0 63 89 37 0.205 1.49 4.05 Intr - 60974 60782 193 1 1 22 87 144 0.267 7.19 4.04 Intr - 71694 71515 180 2 0 112 53 64 0.860 4.28 4.03 Intr - 74407 74355 53 1 2 108 89 -6 0.834 -0.79 4.02 Intr - 78081 77939 143 1 2 76 67 113 0.953 8.07 4.01 Init - 81599 81551 49 2 1 62 86 76 0.689 5.91 4.00 Prom - 83510 83471 40 -7.06 5.00 Prom + 96811 96850 40 -6.76 5.01 Sngl + 100001 100606 606 1 0 74 46 750 0.772 65.60 5.02 PlyA + 101983 101988 6 1.05 6.06 PlyA - 104752 104747 6 1.05 6.05 Term - 112181 112098 84 2 0 134 47 4 0.412 -1.45 6.04 Intr - 114811 114716 96 1 0 52 72 50 0.312 0.01 6.03 Intr - 115802 115659 144 2 0 84 51 62 0.549 2.58 6.02 Intr - 118053 117884 170 1 2 119 37 64 0.522 4.07 6.01 Init - 125807 125759 49 2 1 86 89 23 0.141 1.32 6.00 Prom - 139731 139692 40 -4.06 7.00 Prom + 144115 144154 40 -3.86 7.01 Sngl + 176466 176933 468 2 0 110 43 440 0.989 37.83 7.02 PlyA + 177016 177021 6 1.05 8.00 Prom + 180246 180285 40 -6.96 8.01 Init + 184306 184453 148 0 1 89 85 223 0.890 22.35 8.02 Intr + 189717 189802 86 1 2 103 92 73 0.991 8.74 8.03 Intr + 192451 192596 146 0 2 92 46 184 0.956 13.68 8.04 Intr + 192949 193308 360 1 0 28 90 278 0.790 16.34 8.05 Intr + 193713 193857 145 0 1 70 100 110 0.930 10.68 8.06 Intr + 193951 194154 204 0 0 78 47 259 0.974 20.20 8.07 Intr + 196449 196536 88 2 1 79 88 156 0.569 14.24 8.08 Intr + 199130 199227 98 0 2 24 100 34 0.035 -2.07 8.09 Intr + 200224 200371 148 0 1 64 58 80 0.024 2.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 45662 45608 55 2 1 58 93 56 0.813 4.55 S.002 Init + 84927 84993 67 2 1 71 71 74 0.840 5.24 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:43299733_43500335|GENSCAN_predicted_peptide_1|126_aa MLEKVSAPGAKNLQGIKDLEKFVESSIFAFAITTLISPYYTHGKRGPLSADRSLPQTPVA FAPGLFRFCLFAGSIRTRKSPPAEPRLPQAEAAPGAESTAEATPSVKGRGMQIRNESPGT PEEARV >gi568815581f:43299733_43500335|GENSCAN_predicted_CDS_1|381_bp atgctggagaaagtcagtgctccaggagctaaaaaccttcaaggaataaaagatctggaa aaatttgtcgaatccagcatctttgccttcgccattaccaccctcatatcaccatattac acccacggaaaacgtggaccgctctccgccgacaggtctcttccacagacccctgtcgcc ttcgcccccggtctcttccggttctgtcttttcgctggctcgatacgaacaaggaagtcg cccccagcggagccccggctcccccaggcagaggcggccccgggggcggagtcaacggcg gaggccacgccctctgtgaaagggcggggcatgcaaattcgaaatgaaagcccgggaacg ccggaagaagcacgggtgtaa >gi568815581f:43299733_43500335|GENSCAN_predicted_peptide_2|249_aa MPRLSLYDELLTDLNEFISPNQQNPAERYYQQPSQLYPPTFPQTLNSPAMQIENEMMQPE KEALNYVFMQPGTEAPISTQPGEEALNSLPGNEALNPIPVQPCCRALNFLPIQPANRDIL KEIVLGIQTAVLFNTNNQNFFNNKPHFLLYAHDAKKEITGWLNAIQNLILMAIPYGHFQF LEDSSSRTSRSTTVFDLSDKCIEETLTFAVLPLHLGYHKGKGPLSSGHVTRVTYRSLEMT GTPYPAPLP >gi568815581f:43299733_43500335|GENSCAN_predicted_CDS_2|750_bp atgccccggttatcgctttatgatgaacttttaactgatctgaatgaatttatttccccc aaccagcaaaatccagctgaaagatattatcaacagccatcacagctgtaccctcctacc tttcctcaaactctcaactctccagctatgcagatagaaaatgagatgatgcagcctgaa aaagaggctctaaactatgttttcatgcagcctggtacagaggctccaatttctacacag cccggtgaagaggctctcaattccctgcctggtaatgaggccctgaaccctatccctgtt cagccctgctgtagagctctcaacttcttgcctattcagcctgccaatcgggacattcta aaagaaattgtccttggcatacagaccgctgttcttttcaacaccaacaaccaaaatttt ttcaacaacaagccccactttctactatatgcccacgatgccaaaaaggaaatcactggg tggctcaatgccattcaaaatttgatattgatggcaatcccttacggtcacttccagttc ttagaagacagcagtagcagaactagtaggagtaccacagtcttcgatctttctgataag tgcatagaagaaacgctgacgtttgctgtcctccctctccacctcggctaccacaaaggg aaaggccccctgtccagtggacacgtgactcgcgtgacctatcgatcattggagatgact ggcactccttaccctgcccccttgccttga >gi568815581f:43299733_43500335|GENSCAN_predicted_peptide_3|112_aa MKRIWLDYCLDCRYTWSWLRYSSKHLWYRLRSLIICLRSLIFRLRTLGSRVISHLLSRIS GLWLILCHSNRSGRNVCNLLLMSSHGGGNSSGNRTRSQKEIRVLNRGRIKNL >gi568815581f:43299733_43500335|GENSCAN_predicted_CDS_3|339_bp atgaaaaggatctggttggactactgcctggactgcaggtatacctggagctggctgcgc tacagcagcaagcatttatggtataggttgaggagcctgattatttgcctgaggagcctg attttcaggctgcggaccttggggagccgtgtgatcagccacctgctgagcaggatcagc gggctgtggctgatcctgtgccacagcaacaggagcggcaggaatgtatgtaacctgttg cttatgagcagccatggtggtggcaacagcagtggtaaccggaccagaagccaaaaagag attcgagttttgaatagaggaagaatcaagaacctgtaa >gi568815581f:43299733_43500335|GENSCAN_predicted_peptide_4|338_aa MEHMLLANHTDKATFTGRGWKATFDGCILHDDLLTKVHGFQSLIGLQQHLACGDSGSLMK NCHDFPVACKTFSLKANSCHDRGIETIRGNFREFPGHIHPPPPRVTTTSTFPSGAVDEPS VLLAKAHLPLGHRNPSIPPYLWQYKHLSIQVYRMCFTIESRTAFGFTWLCGMRYEDRCIQ VYRLYFTAESDCRQICMTDCCIRFLRAPLKPQALLKRGPQLPWQPGFQAHERPAQQGHKI THSPFILLSRRLILQLIKINSLIIKPLSRTIRLLREIIRLHNCLPLGHSQYLDFEKFAFT VPTLNNIPPAAHYHWKVLPQVCRSRKESEDKSKVNDKQ >gi568815581f:43299733_43500335|GENSCAN_predicted_CDS_4|1017_bp atggaacacatgctccttgctaaccacacggacaaggccacgttcacaggacgggggtgg aaggccaccttcgatgggtgcatcctccacgatgacctgctaacaaaggtgcatggattt cagagtctgattggcctacaacagcatttggcttgtggagacagtggttccctgatgaaa aactgccatgatttccccgtggcatgcaaaacgttctccctcaaagctaactcatgtcat gacaggggaatagaaacaatcagaggaaacttccgtgagttcccaggacacatccaccca cctcctccacgtgtaaccaccacctctaccttcccctctggtgctgtggatgagccatcc gtgctcctggcaaaggcccacctgccacttgggcacaggaacccatccatccctccttac ctctggcagtataaacatctttccattcaagtttaccgcatgtgtttcactatcgagagt cggactgcttttggatttacgtggctttgtgggatgcggtatgaagatcgttgcattcaa gtttaccgcctctacttcactgcagaatctgactgccgtcagatctgtatgacagattgt tgcattcgttttctgcgggctcccttgaagcctcaggctctcttgaagcgtggtccacag ttgccgtggcagcctggatttcaggcccatgaaagacctgctcagcagggtcacaaaatc acccacagcccattcatcctgctcagcaggcggctgatcctgcagctcatcaagatcaac agcctgataatcaagcccctcagccggacaatcaggctgctcagggaaataatcaggctc cacaactgccttccgctggggcacagccagtacctggactttgaaaaatttgctttcaca gttccaacccttaacaacatccctcctgcagcacattaccattggaaagtcctacctcaa gtctgcagatcccgcaaggaatccgaagacaagtccaaagtcaacgacaagcaatga >gi568815581f:43299733_43500335|GENSCAN_predicted_peptide_5|201_aa MGNHLTEMAPTASSFLPHFQALHVVVIGLDSAGKTSLLYRLKFKEFVQSVPTKGFNTEKI RVPLGGSRGITFQVWDVGGQEKLRPLWRSYTRRTDGLVFVVDAAEAERLEEAKVELHRIS RASDNQGVPVLVLANKQDQPGALSAAEVEKRLAVRELAAATLTHVQGCSAVDGLGLQQGL ERLYEMILKRKKAARGGKKRR >gi568815581f:43299733_43500335|GENSCAN_predicted_CDS_5|606_bp atggggaaccacttgactgagatggcgcccactgcctcctccttcttgccccacttccaa gccctgcatgtcgtggtcattgggctggactctgctggaaagacctccctcctttaccgc ctcaagttcaaggagtttgtccagagtgtccccaccaaaggcttcaacaccgagaagatc cgggtgcccctcgggggatcgcgtggcatcaccttccaagtgtgggacgtcggggggcag gagaagctgcgaccactgtggcgctcttatacccgccggacagacggtctagtgtttgtg gtggacgctgcggaggctgagcggctggaggaagccaaggtggagttgcaccgaatcagc cgggcctcggacaaccagggcgtgccagtgctggtgctggccaacaagcaggaccagccc ggggcactgagcgctgctgaggtggagaagaggctggcagtccgagagctagcagccgcc actctcactcatgtgcaaggctgcagcgctgtggacggtctgggcctgcagcagggcctt gagcgcctctatgagatgatcctcaagaggaagaaggcagctcggggtggcaagaagaga cggtga >gi568815581f:43299733_43500335|GENSCAN_predicted_peptide_6|180_aa MGFRHVDQAGLKLLTSDSPLCHTPTQPSAIRALNTQKGKTICAEREDRAREEGKGCSLPA DVLQTELPVSVPRSSSPDRLARYLHQWLTLCPCLLHSPLSDIFKLFYFQTVFVIAISCLF EGNKLSWFAQVPDISQDAGLSVPKLDKLSQLGLDSGTSTNRAVPEEKEVSALSPSTPPTP >gi568815581f:43299733_43500335|GENSCAN_predicted_CDS_6|543_bp atggggtttcgccatgtagaccaggctggtctcaaactcctgacctcagacagcccgctc tgccacactcccacccagccctctgctatcagagctctgaacacccaaaaaggaaaaacg atttgtgctgagcgcgaagacagagccagggaggaagggaagggctgctcgctgccagct gacgttctgcagactgagctgcctgtgtctgtgccaaggtcctcaagtcctgatcgcctt gctcgctatttgcaccagtggcttactctctgtccctgcctcctacactctccactgtca gatattttcaagctgttttattttcagacagtcttcgtcattgccatcagctgcctattt gagggcaacaaactgtcctggtttgcccaggttccagatatttctcaggatgcaggactt tcagtgccaaaactagacaaattgagccagctgggactggactctggcacttctacaaat agagctgtgccagaagaaaaggaagtttcagcactcagtccatccacaccccccaccccc tga >gi568815581f:43299733_43500335|GENSCAN_predicted_peptide_7|155_aa MAKSKNHSTNNQSRKRHRNGIKKPRSRRYESLKGMDPKFPRNMCFAKKQNKKVLKKMQAN SDKAMSARAEVIKALVKPKEVKLKIPKGVSCKLDRLAYIAHPKLGKRARARIAKGLRLCW PKAKAKDQTKAQAAAPASVPAQAPKGAQAPTKASE >gi568815581f:43299733_43500335|GENSCAN_predicted_CDS_7|468_bp atggccaagtccaagaaccacagcacaaacaaccagtcccgaaaaaggcacagaaatggt atcaagaaaccccgatcacgaagatatgaatctcttaaggggatggaccccaagttcccg aggaacatgtgctttgccaagaagcaaaacaagaaggtcctaaagaagatgcaggccaac agtgacaaggccatgagtgcacgtgctgaggttatcaaggccctcgtaaagcccaaggag gttaagctcaagatcccaaagggtgtcagctgcaagctcgatcgacttgcctacattgcc caccccaagcttgggaagcgggctcgtgcccgcattgccaaggggctcaggctgtgctgg ccaaaggccaaggccaaggatcaaaccaaggcccaggctgcagctccagcttcagttcca gctcaggctcccaaaggtgcccaggcccctacaaaggcttcagagtag >gi568815581f:43299733_43500335|GENSCAN_predicted_peptide_8|475_aa MAVAVAMAGALIGSEPGPAEELAKLEYLSLVSKVCTELDNHLGINDKDLAEFVISLAEKN TTFDTFKASLVKNGAEFTTMLDEDDVKVAVDVLKELEALMPSAAGQEKQRDAEHRFVLSV LSFGSLGDRTKKKKRSRSRDRNRDRDRDRERNRDRDHKRRHRSRSRSRSRTRERNKVKSR YRSRSRSQSPPKDRKDRDKYGERNLDRWRDKHVDRPPPEEPTIGDIYNGKVTSIMQFGCF VQLEGLRKRWEGLVHISELRREGRVANVADVVSKGQRVKVKVLSFTGTKTSLSMKDVDQE TGEDLNPNRRRNLVGETNEETSMRNPDRPTHLSLVSAPEVEDDSLERKRLTRISDPEKWE IKQMIAANVLSKEEFPDFDEETGILPKVDDEEDEDLEIELVEEEPPFLRGHTKQSMDMSP IKIVKNPDGSLSQAAMMQSALAKERRELKQAQREAEMDSIPMGLNKHWVDPLPDX >gi568815581f:43299733_43500335|GENSCAN_predicted_CDS_8|1425_bp atggctgtggctgtagccatggcgggagccttaatcgggtcggagccaggccccgcggaa gaacttgccaaactcgagtacctgtctttggtgtcaaaggtttgcactgagctggacaat cacttggggatcaacgacaaggaccttgctgaatttgtgatcagtcttgctgagaaaaat accacctttgatacttttaaggcttctctcgtcaaaaatggtgcagaatttacgaccatg ttggatgaagatgatgtgaaagttgctgtggatgtcctgaaagaactggaagctttaatg cccagcgcagcaggccaggagaagcaaagagatgctgaacaccggtttgtccttagtgtc ctgtcctttggaagtttaggggacaggacaaagaagaagaagcggagtcgaagccgagat cgaaaccgagatcgagacagagatagggaacgaaaccgagatagagaccacaagcggaga caccgatcccgctctcgatcacgttccaggacccgggagaggaataaagtgaagtctaga tatcggtccaggagcaggagtcagagtccccccaaagaccggaaggaccgggacaaatat ggagagcggaatctggatagatggcgggataagcatgtggaccgccctcctccagaagag cccaccattggtgacatttataatggcaaagttaccagcatcatgcagtttggttgcttt gtgcagctggaaggactaaggaagcggtgggaaggcctggtgcacatctctgagctccgg cgggagggtcgtgtggccaatgtagctgatgtcgtgagcaaaggccagagggtcaaagtc aaagtgctgtccttcactgggaccaagaccagcctgagcatgaaggatgtggatcaagag actggagaagatctaaacccaaatagacggcgaaatcttgtcggggagaccaatgaggag acctcaatgcggaatcctgatagacccactcacttgtcccttgtcagtgctcctgaagta gaggacgactcactggaacgcaagcgcctcacccgaatctctgacccagagaagtgggag atcaaacagatgattgctgccaatgtcctttccaaagaagaatttccagactttgatgaa gagactggcattctccctaaggtggatgatgaagaagatgaggaccttgagattgaattg gttgaggaagagcctccattcctgagagggcacactaagcaaagcatggacatgagcccc attaaaattgtcaagaacccagacggctccctctcccaagcagcaatgatgcagagtgcc ttggccaaagaaaggcgggaactcaaacaggcccagcgggaagctgagatggattctatt cccatgggactcaacaaacactgggttgaccctctgcctgatgnn