GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:52:45 Sequence gi568815597r:182546738_182771731 : 224994 bp : 43.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1580 1718 139 2 1 111 90 140 0.984 16.97 1.02 Intr + 1963 2087 125 0 2 70 88 177 0.999 15.28 1.03 Intr + 4363 4472 110 1 2 122 99 14 0.977 5.83 1.04 Intr + 6718 6804 87 2 0 117 68 16 0.677 2.44 1.05 Intr + 7890 7956 67 1 1 94 49 34 0.549 -1.94 1.06 Intr + 9002 9128 127 2 1 93 100 8 0.454 3.18 1.07 Term + 10276 10329 54 0 0 69 53 83 0.526 0.46 1.08 PlyA + 10620 10625 6 1.05 2.08 PlyA - 12567 12562 6 1.05 2.07 Term - 17561 17545 17 0 2 115 43 11 0.167 -2.30 2.06 Intr - 28841 28690 152 1 2 73 44 40 0.269 -2.09 2.05 Intr - 29652 29519 134 0 2 91 106 115 0.838 13.04 2.04 Intr - 34620 34488 133 2 1 41 86 140 0.796 9.75 2.03 Intr - 35521 35316 206 1 2 49 107 108 0.819 6.80 2.02 Intr - 37429 37344 86 2 2 58 84 78 0.997 3.94 2.01 Init - 40069 38590 1480 1 1 97 95 1187 0.999 112.29 2.00 Prom - 51361 51322 40 -5.86 3.17 PlyA - 51909 51904 6 1.05 3.16 Term - 53776 53555 222 1 0 152 47 255 0.999 24.72 3.15 Intr - 55395 55229 167 1 2 91 50 173 0.978 13.48 3.14 Intr - 55747 55683 65 0 2 95 87 55 0.736 4.46 3.13 Intr - 56602 56492 111 0 0 98 100 59 0.994 7.59 3.12 Intr - 57624 57479 146 0 2 14 94 120 0.820 4.28 3.11 Intr - 58047 57918 130 2 1 81 80 57 0.693 4.90 3.10 Intr - 68597 68373 225 1 0 87 51 109 0.000 4.20 3.09 Intr - 88240 88042 199 2 1 78 77 66 0.183 2.91 3.08 Intr - 90569 90446 124 2 1 74 59 59 0.388 1.76 3.07 Intr - 100180 100094 87 1 0 129 20 100 0.369 7.47 3.06 Intr - 101566 101400 167 2 2 104 85 137 0.999 14.68 3.05 Intr - 122753 122655 99 0 0 102 85 24 0.214 3.58 3.04 Intr - 123788 123726 63 0 0 71 93 37 0.140 1.19 3.03 Intr - 133401 133332 70 0 1 51 75 49 0.001 -1.35 3.02 Intr - 137723 137615 109 1 1 71 110 75 0.051 8.29 3.01 Init - 158161 158100 62 1 2 66 94 35 0.212 2.62 3.00 Prom - 158811 158772 40 -4.46 4.00 Prom + 158942 158981 40 -5.16 4.01 Init + 161595 161649 55 2 1 83 100 47 0.264 6.87 4.02 Intr + 169880 169933 54 0 0 52 94 69 0.184 2.85 4.03 Term + 171514 171824 311 2 2 37 43 155 0.120 1.12 4.04 PlyA + 172058 172063 6 1.05 5.00 Prom + 175937 175976 40 -4.66 5.01 Init + 181744 181817 74 0 2 68 89 112 0.975 9.84 5.02 Term + 223599 223758 160 0 1 36 48 135 0.043 1.71 5.03 PlyA + 224268 224273 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 135144 135263 120 2 0 125 48 149 0.978 13.07 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:182546738_182771731|GENSCAN_predicted_peptide_1|236_aa XFNDLVSSAHMLQVNRAYNENDVILMRSKMNIIQKLFLNSDIPPKLRVNVPEFQKDAILA AITEGYLDRSVFHGAIMSVFPVVMYFWKRFCFWKATRSYLQYRGKKFKDRKSPPKSTDKY PFSSGGDNAILRFTLLRGIEWLQPQREAISSVQNSSSSKLTQPRLVVSAMQLHPVQGDLK LPRNYLPVPVPRRQTRYFEALYTGKAMHGFRVYKALGKKTFHKDEKGDVGAELMNT >gi568815597r:182546738_182771731|GENSCAN_predicted_CDS_1|711_bp nnatttaatgatctggtcagttcagcccacatgctgcaggtcaaccgggcatataatgag aatgatgtgatcctaatgcggtccaaaatgaacattatccaaaaactcttcctgaattct gacatccctccaaagctgagggtgaatgtccctgagttccagaaggatgccatccttgct gccatcacagagggctacctagatcggagcgtcttccatggggctatcatgtctgtcttc cccgttgttatgtacttctggaaaaggttttgtttctggaaggcaacccgctcttactta cagtatagggggaagaagttcaaggacagaaaaagccctcctaaatctacggacaagtat cctttctcgagtggaggagacaatgccatcttaaggttcaccttgctcagaggtattgag tggttgcagcctcaacgggaagcaataagttcagttcaaaattcttcatcaagcaaactt actcagccaagactcgtggtatctgccatgcagctgcatcccgtccagggggacttgaag ctgcccaggaattatttgcccgtcccagtgcccagaagacaaaccagatactttgaagct ctttacactggaaaagcaatgcatggcttcagagtttataaggccttggggaagaagact tttcacaaggatgaaaaaggtgatgttggagcagaactcatgaacacctga >gi568815597r:182546738_182771731|GENSCAN_predicted_peptide_2|735_aa MESRDHNNPQEGPTSSSGRRAAVEDNHLLIKAVQNEDVDLVQQLLEGGANVNFQEEEGGW TPLHNAVQMSREDIVELLLRHGADPVLRKKNGATPFILAAIAGSVKLLKLFLSKGADVNE CDFYGFTAFMEAAVYGKVKALKFLYKRGANVNLRRKTKEDQERLRKGGATALMDAAEKGH VEVLKILLDEMGADVNACDNMGRNALIHALLSSDDSDVEAITHLLLDHGADVNVRGERGK TPLILAVEKKHLGLVQRLLEQEHIEINDTDSDGKTALLLAVELKLKKIAELLCKRGASTD CGDLVMTARRNYDHSLVKVLLSHGAKEDFHPPAEDWKPQSSHWGAALKDLHRIYRPMIGK LKFFIDEKYKIADTSEGGIYLGFYEKQEVAVKTFCEGSPRAQREVSCLQSSRENSHLVTF YGSESHRGHLFVCVTLCEQTLEACLDVHRGEDVENEEDEFARNVLSSIFKAVQELHLSCG YTHQDLQPQNILIDSKKAAHLADFDKSIKWAGDPQEVKRDLEDLGRLVLYVVKKGSISFE DLKAQSNEEVVQLSPDEETKDLIHRLFHPGEHVRDCLSDLLGHPFFWTWESRYRTLRNVG NESDIKTRKSESEILRLLQPGPSEHSKSFDKWTTKINECVMKKMNKFYEKRGNFYQNTVG DLLKFIRNLGEHIDEEKHKKMKLKIGDPSLYFQKTFPDLVIYVYTKLQNTEYRKHFPQTH SPNKPQCDGAAIKRM >gi568815597r:182546738_182771731|GENSCAN_predicted_CDS_2|2208_bp atggagagcagggatcataacaacccccaggagggacccacgtcctccagcggtagaagg gctgcagtggaagacaatcacttgctgattaaagctgttcaaaacgaagatgttgacctg gtccagcaattgctggaaggtggagccaatgttaatttccaggaagaggaagggggctgg acacctctgcataacgcagtacaaatgagcagggaggacattgtggaacttctgcttcgt catggtgctgaccctgttctgaggaagaagaatggggccacgccttttatcctcgcagcg attgcggggagcgtgaagctgctgaaacttttcctttctaaaggagcagatgtcaatgag tgtgatttttatggcttcacagccttcatggaagccgctgtgtatggtaaggtcaaagcc ctaaaattcctttataagagaggagcaaatgtgaatttgaggcgaaagacaaaggaggat caagagcggctgaggaaaggaggggccacagctctcatggacgctgctgaaaaaggacac gtagaggtcttgaagattctccttgatgagatgggggcagatgtaaacgcctgtgacaat atgggcagaaatgccttgatccatgctctcctgagctctgacgatagtgatgtggaggct attacgcatctgctgctggaccatggggctgatgtcaatgtgaggggagaaagagggaag actcccctgatcctggcagtggagaagaagcacttgggtttggtgcagaggcttctggag caagagcacatagagattaatgacacagacagtgatggcaaaacagcactgctgcttgct gttgaactcaaactgaagaaaatcgccgagttgctgtgcaaacgtggagccagtacagat tgtggggatcttgttatgacagcgaggcggaattatgaccattcccttgtgaaggttctt ctctctcatggagccaaagaagattttcaccctcctgctgaagactggaagcctcagagc tcacactggggggcagccctgaaggatctccacagaatataccgccctatgattggcaaa ctcaagttctttattgatgaaaaatacaaaattgctgatacttcagaaggaggcatctac ctggggttctatgagaagcaagaagtagctgtgaagacgttctgtgagggcagcccacgt gcacagcgggaagtctcttgtctgcaaagcagccgagagaacagtcacttggtgacattc tatgggagtgagagccacaggggccacttgtttgtgtgtgtcaccctctgtgagcagact ctggaagcgtgtttggatgtgcacagaggggaagatgtggaaaatgaggaagatgaattt gcccgaaatgtcctgtcatctatatttaaggctgttcaagaactacacttgtcctgtgga tacacccaccaggatctgcaaccacaaaacatcttaatagattctaagaaagctgctcac ctggcagattttgataagagcatcaagtgggctggagatccacaggaagtcaagagagat ctagaggaccttggacggctggtcctctatgtggtaaagaagggaagcatctcatttgag gatctgaaagctcaaagtaatgaagaggtggttcaactttctccagatgaggaaactaag gacctcattcatcgtctcttccatcctggggaacatgtgagggactgtctgagtgacctg ctgggtcatcccttcttttggacttgggagagccgctataggacgcttcggaatgtggga aatgaatccgacatcaaaacacgaaaatctgaaagtgagatcctcagactactgcaacct gggccttctgaacattccaaaagttttgacaagtggacgactaagattaatgaatgtgtt atgaaaaaaatgaataagttttatgaaaaaagaggcaatttctaccagaacactgtgggt gatctgctaaagttcatccggaatttgggagaacacattgatgaagaaaagcataaaaag atgaaattaaaaattggagacccttccctgtattttcagaagacatttccagatctggtg atctatgtctacacaaaactacagaacacagaatatagaaagcatttcccccaaacccac agtccaaacaagcctcagtgtgatggagctgcaataaagcgtatgtga >gi568815597r:182546738_182771731|GENSCAN_predicted_peptide_3|681_aa MGTCGVSASDDGADSDNVGPGCQEEVADSITQGLTEPCPAGARTSHTFPAALPTARQIGA EFKKIGCLMGTRDCEDVNQDQVFCGNKGFEALRMGPSGEEDCLLCELHGSCLYASCKVVA LQAGPALAEALPLTDGVAAFRAFLKTEFSEENLEFWLACEEFKKTRSTAKLVSKAHRIFE EFVDVQAPREVNIDFQTREATRKNLQEPSLTCFDQAQGKKSELLEKLEAWWEMVAKRHAA KPAGNWTSQLSAQAVRFQAAGDSVERLVSAKRTHPHAEGQHLGLLCSHGDKLALSREQLL ATCSPVQAKPFSEQECRSLIPRIQGWSLRGPVGPPLLPGRASPSGAAGRGAHKSFGAGHL ARTGLAMGSRPKRRVPGDCFSLAGTRSLLRGRGSSGAEGCGATQQAQTCLRSKKALCASQ THHPRANSLNTTPLRDFPAVTRGRQLTSTCYRAFASWRTRSLLEPATILPTTCCPAPAAM CRTLAAFPTTCLERAKEFKTRLGIFLHKSELGCDTGSTGKFEWGSKHSKENRNFSEDVLG WRESFDLLLSSKNGVAAFHAFLKTEFSEENLEFWLACEEFKKIRSATKLASRAHQIFEEF ICSEAPKEVNIDHETHELTRMNLQTATATCFDAAQGKTRTLMEKDSYPRFLKSPAYRDLA AQASAASATLSSCSLDEPSHT >gi568815597r:182546738_182771731|GENSCAN_predicted_CDS_3|2046_bp atgggcacgtgtggtgtgagtgcctctgatgatggtgctgatagtgacaatgttggccct gggtgccaggaggaggtggctgactctatcactcagggcctcaccgaaccatgcccagct ggggcccgcaccagccacaccttcccagctgcgctgcccaccgccaggcagattggagct gaatttaagaaaataggctgtctcatggggaccagagactgtgaggacgtgaaccaggac caggtcttctgtgggaacaaagggtttgaagcactgagaatgggaccctcaggagaggaa gactgcctcctgtgtgaactgcacggttcttgtctgtatgcctcttgcaaagttgtggcc ttgcaagcaggacctgccttagctgaagccctcccattgactgatggggtggctgcattc cgtgccttcttgaagacggagttcagtgaggagaacctggaattctggttggcctgtgag gagttcaagaagaccaggtcaactgcaaaactggtctctaaggcccataggatctttgag gagtttgtggatgtgcaggctccacgggaggtaaacattgacttccagacccgagaagcc acgaggaagaacctgcaggagccatccctgacttgctttgaccaagcccaaggaaaaaaa tctgagctcctggagaagctggaggcctggtgggaaatggtggctaaaagacatgctgca aagcccgctggaaattggactagccaattatccgcccaagctgttcgtttccaggcagcg ggtgactctgtggagagacttgtttcagccaagaggacacatcctcatgcagaggggcag catctgggcctgctctgcagccatggagacaaacttgctctgtcaagggagcagctgctg gccacctgctcaccagtccaggctaagcccttttctgagcaggagtgtaggagcctgatc cctagaatacagggctggagcctgagagggccagtggggccgccgctgcttcctggaaga gcttcgccttccggagccgcaggccgcggagctcacaagagcttcggagcagggcatctc gcgcggacagggctggcgatgggcagccgaccaaaacgccgcgttcctggtgactgcttt tccctggcaggcacgcgttcgctgctccgcggccgcggctcttccggggccgagggctgc ggcgcgacccagcaggctcagacttgcctgcgatcaaagaaagccctgtgtgcctcccag acccaccacccccgtgcaaactctctcaacacgacaccgctgcgtgactttcctgctgtt actagaggtcggcagttgactagcacctgctaccgcgcctttgcttcctggcgcacgcgg agcctcctggagcctgccaccatcctgcctactacgtgctgccctgcgcccgcagccatg tgccgcaccctggccgccttccccaccacctgcctggagagagccaaagagttcaagaca cgtctggggatctttcttcacaaatcagagctgggctgcgatactgggagtactggcaag ttcgagtggggcagtaaacacagcaaagagaatagaaacttctcagaagatgtgctgggg tggagagagtcgttcgacctgctgctgagcagtaaaaatggagtggctgccttccacgct ttcctgaagacagagttcagtgaggagaacctggagttctggctggcctgtgaggagttc aagaagatccgatcagctaccaagctggcctccagggcacaccagatctttgaggagttc atttgcagtgaggcccctaaagaggtcaacattgaccatgagacccacgagctgacgagg atgaacctgcagactgccacagccacatgctttgatgcggctcaggggaagacacgtacc ctgatggagaaggactcctacccacgcttcctgaagtcgcctgcttaccgggacctggct gcccaagcctcagccgcctctgccactctgtccagctgcagcctggacgagccctcacac acctga >gi568815597r:182546738_182771731|GENSCAN_predicted_peptide_4|139_aa MERKPPLAPAVSLKEMDEDQKIEKQKATEIAHSHTASPATTDLHLSPRETELKQNPKRNR IDSAVGAVDNQHLQLEGEIPKMLSNLDGTMVFSEALETLHKAVAARPPPGIALPEAARQM PAVEPQVDISGMVGAMILQ >gi568815597r:182546738_182771731|GENSCAN_predicted_CDS_4|420_bp atggaaaggaagcccccgctggcccctgcagtgtccttgaaggagatggatgaagatcag aaaattgagaaacagaaagctacagaaattgcccatagtcatactgctagtcctgcaacc acagacttacatctgagtcctagagagacagaactaaagcagaacccaaaaagaaacaga attgattcagcagttggagcagtagacaaccaacatctgcaactggaaggggaaatccca aagatgttgtcaaatttggacggcaccatggtattttcagaggcattagaaacactccac aaggcagtggcagcaagacctccacctggcatcgcacttcctgaagcagccaggcagatg ccagcagtggagcctcaagtggacatctctggcatggtgggagccatgatactgcagtga >gi568815597r:182546738_182771731|GENSCAN_predicted_peptide_5|77_aa MDNVNEIQEQKLHGAEKEEEKVEEDIAGIWDMSQLPEEISQWLVTVNLQSEKGFGKKVIQ GDTENSKFIEGQGNIPL >gi568815597r:182546738_182771731|GENSCAN_predicted_CDS_5|234_bp atggataatgtcaatgaaatccaagagcagaaacttcatggagctgaaaaggaggaagaa aaagtagaggaagacattgctgggatatgggacatgtcgcagctcccagaagagatctcc cagtggctagtcacggtcaatctgcaatcagagaagggttttggcaagaaagtgatccaa ggagacactgaaaatagcaagttcattgaaggccaagggaacatccccttgtga