GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:32:17 Sequence gi568815591r:140235415_140482526 : 247112 bp : 46.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 753 882 130 1 1 108 79 60 0.596 6.85 1.02 Intr + 29604 29746 143 0 2 34 34 98 0.033 -1.00 1.03 Term + 30021 30133 113 1 2 80 45 86 0.564 2.22 1.04 PlyA + 31001 31006 6 1.05 2.02 PlyA - 31468 31463 6 1.05 2.01 Sngl - 36048 35854 195 0 0 97 38 189 0.612 8.09 2.00 Prom - 39891 39852 40 -4.06 3.00 Prom + 40312 40351 40 1.54 3.01 Init + 43694 43763 70 1 1 93 48 55 0.186 3.29 3.02 Intr + 46352 46536 185 0 2 59 28 71 0.058 -2.29 3.03 Intr + 57287 57541 255 1 0 124 46 235 0.889 20.54 3.04 Intr + 59799 59891 93 2 0 -28 75 153 0.175 2.56 3.05 Intr + 74781 74804 24 2 0 114 89 9 0.150 1.82 3.06 Intr + 79783 79919 137 0 2 65 119 47 0.254 4.87 3.07 Term + 81096 81129 34 0 1 67 48 49 0.178 -4.14 3.08 PlyA + 82071 82076 6 1.05 4.16 PlyA - 87410 87405 6 1.05 4.15 Term - 88057 88020 38 2 2 85 32 59 0.095 -2.50 4.14 Intr - 88778 88634 145 2 1 69 57 90 0.129 3.86 4.13 Intr - 95001 94859 143 1 2 87 53 118 0.915 8.27 4.12 Intr - 101935 101866 70 1 1 93 25 9 0.009 -6.15 4.11 Intr - 108149 107998 152 0 2 70 94 124 0.990 11.08 4.10 Intr - 109849 109802 48 2 0 117 96 61 0.997 8.45 4.09 Intr - 110556 110455 102 1 0 19 72 89 0.575 0.55 4.08 Intr - 113353 113212 142 1 1 61 57 152 0.995 9.33 4.07 Intr - 116037 115859 179 1 2 86 107 73 0.993 8.64 4.06 Intr - 116732 116648 85 2 1 79 73 93 0.990 6.29 4.05 Intr - 120350 120254 97 1 1 91 92 -31 0.853 -2.39 4.04 Intr - 123371 123226 146 2 2 97 67 180 0.959 15.88 4.03 Intr - 134268 134176 93 0 0 93 103 38 0.972 5.96 4.02 Intr - 144976 144868 109 0 1 94 99 54 0.987 7.39 4.01 Init - 147112 147024 89 1 2 87 115 -45 0.644 -1.67 4.00 Prom - 147756 147717 40 -5.06 5.00 Prom + 159991 160030 40 -2.96 5.01 Init + 161388 161413 26 2 2 63 80 13 0.282 -3.79 5.02 Intr + 162891 163101 211 0 1 45 54 137 0.103 4.92 5.03 Intr + 168654 168803 150 2 0 67 115 50 0.105 5.96 5.04 Intr + 172221 172433 213 2 0 48 77 159 0.246 9.71 5.05 Intr + 176460 176643 184 2 1 115 52 306 0.697 29.06 5.06 Term + 190468 190736 269 2 2 86 44 373 0.553 28.36 5.07 PlyA + 195285 195290 6 1.05 6.02 PlyA - 195745 195740 6 -0.45 6.01 Sngl - 200241 199900 342 0 0 62 54 211 0.817 11.30 6.00 Prom - 204500 204461 40 -4.66 7.11 PlyA - 205404 205399 6 1.05 7.10 Term - 211419 211307 113 1 2 51 49 167 0.411 7.82 7.09 Intr - 219315 219189 127 0 1 97 75 183 0.203 18.15 7.08 Intr - 219819 219681 139 2 1 68 60 70 0.989 2.67 7.07 Intr - 220486 220376 111 0 0 54 64 104 0.894 4.09 7.06 Intr - 221452 221238 215 1 2 82 50 315 0.996 24.61 7.05 Intr - 223819 223593 227 2 2 64 106 218 0.805 18.90 7.04 Intr - 224522 224293 230 1 2 80 86 103 0.930 6.81 7.03 Intr - 236597 236469 129 1 0 106 110 61 0.994 9.91 7.02 Intr - 243572 243513 60 1 0 70 62 78 0.614 1.25 7.01 Init - 243930 243746 185 0 2 79 91 162 0.962 12.29 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:140235415_140482526|GENSCAN_predicted_peptide_1|128_aa XHSCPHSDRRRNGQGCCWSTQYLCANQKKRLLLLAAVAQHLGAQCSEGGAPAATLLIKAG HQPSPDSQKRVTPRTCGFLESTKVPSPTHDLAMRKLKHRAIERFAQVVASDFAPLTTKLY GLVLSHWA >gi568815591r:140235415_140482526|GENSCAN_predicted_CDS_1|387_bp ngacacagctgcccccatagtgaccgacgcaggaacggacagggctgctgctggtccaca cagtacctttgtgcaaatcagaaaaaaaggttactcctgctggctgctgtagcccagcac ctaggagcacagtgttctgaaggaggagctcctgctgccacactgctgattaaagccggt catcagcccagtccagattcacagaagcgtgtgactcccaggacatgtggcttcttggag agcaccaaggttcccagtcctacccatgatctagcgatgaggaagctgaagcatagagct attgagcgatttgcccaggtagtggcttcagactttgcgcccttaaccactaagctctac ggtctggtgctgtcacactgggcatga >gi568815591r:140235415_140482526|GENSCAN_predicted_peptide_2|64_aa MPEPQRPPAMGSCAAPASPTSTAPCSTAPSSIDHPRAVECGCTAQDWQAAPPATRVRDPL DEAS >gi568815591r:140235415_140482526|GENSCAN_predicted_CDS_2|195_bp atgcctgagcctcagcgccctccagccatgggctcctgcgcggccccagcctccccgacg agcaccgccccctgctccacggcgcccagttccatcgaccacccaagggctgtggagtgc gggtgcactgcacaggactggcaagcagctccacctgcgacccgggtgcgggatccactg gatgaagccagctag >gi568815591r:140235415_140482526|GENSCAN_predicted_peptide_3|265_aa MTGRSLRPPQKQMLLCFLYSLQNSVSTSEYHTPVDSPLRAHRACHADVLQARSLSILPQL HDCRFPIMPLRKKFWHLSYELTSTPSPCQAATLSRQPERRAKAASTASHRPIKGILKNRT STTLYGCVGRRRAEKKSRKWDEMNFLATYHPADKDCGFMNTDEPNTPLTMGLDVLKKPVP IVMELPHLAQELEKLKEDQGKKCRFTETQHACSAATGKGKQERSEAVHSCTPTQSHGFLD KSPNLDVCSVLARCCSWKSLIGNRN >gi568815591r:140235415_140482526|GENSCAN_predicted_CDS_3|798_bp atgactggaagatccctgaggcctccccagaagcagatgctgctatgcttcctgtacagc ctgcagaactccgtgagcacttctgagtaccacacgcctgtagacagccctctacgtgca caccgagcttgccatgcagatgtgcttcaagccaggtctcttagcatcctcccacaactg cacgactgccgctttcccatcatgcctcttagaaagaaattctggcacctgtcctatgag ttaaccagtactccaagcccatgccaagcagcgaccctgagccgacagccggagcgccgg gcaaaggccgcctcaacagcctcgcaccggcccatcaaggggatcctgaagaacaggact tctacgactctctatggttgcgtcggcagaagaagagctgagaaaaaatcccggaagtgg gatgaaatgaacttcctggcaacatatcatccagcagacaaagactgtggcttcatgaac actgatgaaccaaacactccccttaccatgggtctggatgtcctgaagaagccagtgccc attgtcatggagctgccccatctggcccaggagttggagaagctgaaggaggatcagggg aagaaatgcaggttcacagaaacacagcatgcgtgctctgcagcaacagggaagggaaaa caggaacgctctgaagcagtacattcctgcacacccacacagagccatggcttcctagac aagtccccgaacctagatgtctgcagtgtcctggcacgctgctgttcttggaagtcactg attggaaaccggaactga >gi568815591r:140235415_140482526|GENSCAN_predicted_peptide_4|545_aa MAWPNVFQRGSLLSQFSHHHVVVFLLTFFSYSLLHASRKTFSNVKVSISEQWTPSAFNTS VELPVEIWSSNHLFPSAEKATLFLGTLDTIFLFSYAVVFVFGALTEWLRFYNKWLYCCLW IVNGLLQSTGWPCVVAVMGNWFGKAGRGVVFGLWSACASVGNILGACLASSVLQYGYEYA FLVTASVQFAGGIVIFFGLLVSPEEIGLSGIEAEENFEEDSHRPLINGGENEDEYEPNYS IQDDSSVAQVKAISFYQACCLPGVIPYSLAYACLKLVNYSFFFWLPFYLSNNFGWKEAEA DKLSIWYDVGGIIGGTLQGFISDVLQKRAPVLALSLLLAVGSLIGYSRSPNDKSINALLM TVTGFFIGGPSNMISSAISADLGRQELIQRSSEALATVTGIVDGSGSIGAAVGQYLVSLI RDKLGWMWVFYFFILMVRRCTPAINGRRLSALPRFSDYVGFECILLSSGSWQPRQLSLRA QRQRTNSLVNGTFAVLFRCLDTESDSLTCWIKCEILHESSSLAIKSETDQRPVIYSQSYV TITTI >gi568815591r:140235415_140482526|GENSCAN_predicted_CDS_4|1638_bp atggcctggccaaatgtttttcaaagagggtctctgctgtcccagttcagccatcatcat gttgtagtgttcctgctcactttcttcagttattcgttgctccatgcttcacgaaaaaca tttagcaatgtcaaagtcagtatctctgagcagtggaccccaagtgcttttaacacgtca gttgagctgcctgtggagatctggagcagcaaccatttgttccccagtgcagagaaagcg actcttttcctcggcacactggataccattttcctcttctcctatgctgtggtgtttgtc tttggtgcgctcacagaatggctgcgtttctacaacaaatggctgtactgctgcctgtgg attgtgaacggcctgctgcagtccactggttggccctgtgtggttgctgttatgggcaac tggtttgggaaagccggacgaggagttgtttttggtctctggagtgcctgtgcttcggtg ggcaacattttgggagcgtgcctagcttcttctgttcttcagtatggttatgagtatgcc tttctggtgacggcgtctgtgcagtttgctggtgggatcgttatcttctttggactcctg gtgtcaccagaagaaattggtctctcgggtattgaggcagaagaaaactttgaagaagac tcacacaggccattaattaatggtggtgaaaatgaagacgaatatgagccgaattattca atccaagatgatagttctgttgcccaagtcaaggcgataagcttctaccaggcatgttgc cttcctggagtcataccgtactcactggcctacgcctgcttgaagttagtgaattactcc ttcttcttctggctccccttttatctgagtaacaacttcggctggaaggaggcggaagcc gacaagctgtccatttggtacgacgttggagggatcataggtggaactttgcaaggcttc atctctgatgtactacagaagagagcgccggttcttgccctgagtctgcttctggcagtt gggtccctcatcgggtatagtcgttctccaaatgataagtccatcaatgcccttctgatg actgttacaggattttttattggtggaccttctaatatgattagttctgctatttctgcg gacttgggtcgccaggagctcatccaaaggagcagtgaagctttggccactgtcacagga attgtggatggttcggggagcattggagctgcagtgggccagtatttagtgtctctgatc cgggacaagctaggatggatgtgggttttctactttttcattctcatggtaagacgctgc acgcctgcaattaatggccgcagactgtcggcgctgcccaggttctcggactacgtgggt tttgagtgcattctcctctcctctggctcttggcagcctcgccagctctccctgagggct cagcggcagcgaacaaatagtcttgttaacgggacttttgcggtgttattccgctgctta gacaccgagagtgactctcttacttgctggattaagtgtgagatcctgcacgaatcctca tctctagctataaaatctgagacagaccagagacctgttatttattcgcagagttatgtg accatcaccactatctaa >gi568815591r:140235415_140482526|GENSCAN_predicted_peptide_5|350_aa MVSFKRQARGAATYISPPPSRPGKPLGLDPPPPQQVFGPTHHRLRTRDPQWSPLSSPGSA RRVSLAPLPASPAASSPLADAKPDTWGQNYTSHQAARQEEEGGTGGLKSKPRRKRKHYLL TSRGERARQVARTMHFSSSARAADENFDYLFKIILIGDSNVGKTCVVQHFKSGVYTETQQ NTIGVDFTVRSLDIDGKKVKMQVWDTAGQERFRTITQSYYRSAHAAIIAYDLTRRSTFES IPHWIHEIEKYGAANVVIMLIGNKCDLWEKRHVLFEDACTLAEKYGLLAVLETSAKESKN IEEVFVLMAKELIARNSLHLYGESALNGLPLDSSPVLMAQGPSEKTHCTC >gi568815591r:140235415_140482526|GENSCAN_predicted_CDS_5|1053_bp atggtgtcattcaagagacaagccaggggggccgccacctacatctccccaccgccctcc cgacccgggaagcccctcggattggacccgcccccgccccagcaggtcttcggccccacg catcaccgcttgcgcacccgagatccgcagtggtcgccgctctccagccccggctccgct cgccgggtcagcttggctccgctgcccgcctcccccgccgccagctcacccctggcggac gccaaacccgacacctggggccagaactacacttcccaccaagcagcgaggcaggaggaa gaagggggcactggggggctgaaaagcaaacccagaagaaagcgaaaacattacctgctg accagcagaggtgagagagcgagacaggtggcaagaaccatgcacttctccagctcagcc agggcagcagatgagaactttgactatttgttcaagattatcctcattggggattccaat gtggggaagacgtgtgtggtgcagcatttcaagtctggagtctacactgagacacagcag aacacgattggagtggactttaccgtgcgttcccttgatattgacggcaaaaaagtgaag atgcaggtgtgggacacagctggccaggagcgcttccgcaccatcacccaaagctactac cgcagtgcccacgcagccatcatcgcctatgacctcacccggcggtccacgttcgagtcc atccctcactggattcatgagatagagaaatatggagctgcaaatgtggtcattatgctg attggaaataaatgtgacctctgggaaaagcggcacgtcctgttcgaggatgcctgcaca ctggctgagaagtacggcctcctggccgttttggagacatctgccaaggagtcaaagaac atagaagaagtcttcgtgctcatggccaaggagctgatcgcgcgcaacagcctgcaccta tatggggagagtgccctgaacggcctccccctggactccagccccgttcttatggcccag ggtccaagtgaaaagacccactgcacttgctaa >gi568815591r:140235415_140482526|GENSCAN_predicted_peptide_6|113_aa MGHSGGSWGTGSSALRGGFTLVGKDAGAQRVLAGHREGPRGPAGEALLPKACEGSSVVPR PGTIQGDFSAHISRNFFQASNSVEGARRWIPLWLPSRDLVSWARRQHSRSHPA >gi568815591r:140235415_140482526|GENSCAN_predicted_CDS_6|342_bp atggggcacagcggcggctcgtgggggacgggatccagcgctttgagaggtggcttcacg ctggtggggaaggacgcaggcgcacagcgcgtccttgccgggcaccgcgagggcccgcga gggcccgcgggggaagccctgctacccaaggcctgcgaagggtccagtgtggtcccgcgc cccgggaccatacagggagatttcagcgcccacatcagcaggaacttcttccaggcaagc aactccgtggaaggggctcggagatggatcccgctgtggctcccgagcagggacctggtg agctgggcacggcgccagcacagccgcagccacccagcctga >gi568815591r:140235415_140482526|GENSCAN_predicted_peptide_7|511_aa MAEAATPGTTATTSGAGAAAATAAAASPTPIPTVTAPSLGAGGGGGGSDGSGGGWTKQVT CSHCVNPRWAAEPSALSAEAWRYFMHGVCKEGDNCRYSHDLSDSPYSVVCKYFQRGYCIY GDRCRYEHSKPLKQEEATATELTTKSSLAASSSLSSIVGPLVEMNTGEAESRNSNFATVG AGSEDWVNAIEFVPGQPYCGRTAPSCTEAPLQGSVTKEESEKEQTAVETKKQLCPYAAVG ECRYGENCVYLHGDSCDMCGLQVLHPMDAAQRSQHIKSCIEAHEKDMELSFAVQRSKDMV CGICMEVVYEKANPSERRFGILSNCNHTYCLKCIRKWRSAKQFESKIIKSCPECRITSNF VIPSEYWVEEKEEKQKLILKYKEAMSNKACRYFDEGRGSCPFGGNCFYKHAYPDGRREEP QRQKVGTSSRYRAQRRNHFWELIEERENSNPFDNDEEEVVTFELGEMLLMLLAAAFISTS FLSMDGQCFIAGVLNPRTVYKYQYEYQSVAC >gi568815591r:140235415_140482526|GENSCAN_predicted_CDS_7|1536_bp atggcggaggctgcaactcccggaacaacagccacaacatcaggagcaggagcggcagcg gcgacggcggcagcagcctcccccaccccgatccccacagtcaccgccccgtccctgggg gcgggcggagggggcggcggcagcgacggcagcggcggcggctggactaaacaggtcacc tgcagtcactgcgtcaacccccgctgggccgcggagccctcagcgctgtccgcggaggcc tggcggtattttatgcatggggtttgtaaggaaggagacaactgtcgctactcgcatgac ctctctgacagtccgtatagtgtagtgtgcaagtattttcagcgagggtactgtatttat ggagaccgctgcagatatgaacatagcaaaccattgaaacaggaagaagcaactgctaca gagctaactacaaagtcatcccttgctgcttcctcaagtctctcatcgatagttggacca cttgttgaaatgaatacaggcgaagctgagtcaagaaattcaaactttgcaactgtagga gcaggttcagaggactgggtgaatgctattgagtttgttcctgggcaaccctactgtggc cgtactgcgccttcctgcactgaagcacccctgcagggctcagtgaccaaggaagaatca gagaaagagcaaaccgccgtggagacaaagaagcagctgtgcccctatgctgcagtggga gagtgccgatacggggagaactgtgtgtatctccacggagattcttgtgacatgtgtggg ctgcaggtcctgcatccaatggatgctgcccagagatcgcagcatatcaaatcgtgcatt gaggcccatgagaaggacatggagctctcatttgccgtgcagcgcagcaaggacatggtg tgtgggatctgcatggaggtggtctatgagaaagccaaccccagtgagcgccgcttcggg atcctctccaactgcaaccacacctactgtctcaagtgcattcgcaagtggaggagtgct aagcaatttgagagcaagatcataaagtcctgcccagaatgccggatcacatctaacttt gtcattccaagtgagtactgggtggaggagaaagaagagaagcagaaactcattctgaaa tacaaggaggcaatgagcaacaaggcgtgcaggtattttgatgaaggacgtgggagctgc ccatttggagggaactgtttttacaagcatgcgtaccctgatggccgtagagaggagcca cagagacagaaagtgggaacatcaagcagataccgggcccaacgaaggaaccacttctgg gaactcattgaggaaagagagaacagcaacccctttgacaacgatgaagaagaggttgtc acctttgagctgggcgagatgttgcttatgcttttggctgcagcatttatcagtacttca tttctttcaatggatggacagtgttttattgcaggggtcctcaaccctcggaccgtgtac aagtaccagtacgagtaccaatccgtggcctgttag