GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:56:51 Sequence gi568815597r:38739556_38959646 : 220091 bp : 44.86% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7466 7535 70 2 1 59 100 89 0.254 6.38 1.02 Intr + 7631 7895 265 1 1 72 47 154 0.621 6.79 1.03 Term + 10829 10884 56 0 2 100 38 7 0.242 -5.38 1.04 PlyA + 11469 11474 6 1.05 2.05 PlyA - 11962 11957 6 1.05 2.04 Term - 13774 13657 118 0 1 63 37 87 0.571 -0.89 2.03 Intr - 15068 14993 76 0 1 93 37 88 0.647 2.77 2.02 Intr - 17050 16869 182 0 2 66 98 70 0.969 5.31 2.01 Init - 17209 17148 62 1 2 98 59 89 0.984 7.72 2.00 Prom - 24004 23965 40 -7.46 3.00 Prom + 25876 25915 40 -6.26 3.01 Sngl + 29038 29997 960 0 0 70 41 344 0.969 24.56 3.02 PlyA + 30048 30053 6 1.05 4.00 Prom + 31314 31353 40 -2.46 4.01 Init + 32195 32235 41 1 2 77 109 57 0.781 6.37 4.02 Intr + 32806 32867 62 1 2 128 50 15 0.087 0.08 4.03 Term + 44420 44583 164 0 2 46 49 142 0.739 4.20 4.04 PlyA + 45284 45289 6 1.05 5.00 Prom + 58644 58683 40 -2.76 5.01 Init + 64481 64587 107 1 2 58 78 100 0.155 3.69 5.02 Intr + 64638 64793 156 0 0 49 23 170 0.123 5.73 5.03 Intr + 68237 68371 135 2 0 57 37 138 0.242 5.28 5.04 Intr + 70542 70725 184 0 1 85 39 24 0.309 -3.01 5.05 Intr + 72341 72370 30 1 0 111 99 35 0.916 5.23 5.06 Intr + 76759 77076 318 0 0 117 58 144 0.748 10.45 5.07 Term + 80293 80457 165 0 0 15 48 106 0.106 -2.78 5.08 PlyA + 81258 81263 6 1.05 6.10 PlyA - 81919 81914 6 1.05 6.09 Term - 85923 85823 101 1 2 105 44 67 0.870 2.29 6.08 Intr - 86134 86080 55 1 1 70 105 44 0.059 2.85 6.07 Intr - 100149 100094 56 1 2 121 90 55 0.169 7.70 6.06 Intr - 106532 106384 149 1 2 66 85 94 0.999 6.78 6.05 Intr - 112202 112060 143 2 2 59 106 117 0.995 9.75 6.04 Intr - 112933 112819 115 0 1 83 89 45 0.978 4.55 6.03 Intr - 116352 116153 200 0 2 72 60 93 0.988 3.05 6.02 Intr - 117527 117324 204 2 0 103 66 160 0.972 14.70 6.01 Init - 120091 119855 237 1 0 86 105 281 0.789 27.61 6.00 Prom - 122501 122462 40 -2.46 7.06 PlyA - 123443 123438 6 1.05 7.05 Term - 125159 125115 45 2 0 118 45 60 0.952 1.91 7.04 Intr - 127434 127325 110 1 2 -9 94 155 0.497 6.40 7.03 Intr - 133535 133463 73 2 1 108 84 140 0.959 14.58 7.02 Intr - 134354 134206 149 0 2 91 48 38 0.843 0.05 7.01 Init - 136543 135223 1321 1 1 88 86 390 0.353 30.44 7.00 Prom - 146847 146808 40 -4.96 8.09 PlyA - 146902 146897 6 1.05 8.08 Term - 147128 146949 180 2 0 107 43 89 0.957 3.91 8.07 Intr - 148469 148408 62 0 2 85 117 40 0.879 5.05 8.06 Intr - 153669 153609 61 0 1 63 97 59 0.235 2.61 8.05 Intr - 156514 156414 101 2 2 94 83 4 0.149 0.33 8.04 Intr - 171879 171767 113 2 2 111 82 70 0.632 8.72 8.03 Intr - 179249 179204 46 0 1 131 11 46 0.215 -1.33 8.02 Intr - 179782 179384 399 2 0 91 98 231 0.185 18.88 8.01 Intr - 184355 184262 94 2 1 68 75 63 0.113 2.54 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 86107 86080 28 1 1 79 105 41 0.912 4.66 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:38739556_38959646|GENSCAN_predicted_peptide_1|130_aa XRLPCTFKGAIPRKWRALQSPELKLEGAGRGREAFDVIHWKLDAGAEAFCSRLSPFISAK SDNSLISPAHLEDRSFTTMILDSAVEEKRMVLLLLILIKLARKLSGFSEVERDSRFCSSI SRSQAHTTVF >gi568815597r:38739556_38959646|GENSCAN_predicted_CDS_1|393_bp nngaggctgccgtgcaccttcaaaggagccattccccgcaagtggagggctctccagagc ccggagctcaagctggaaggggctggccgtggcagagaggcctttgatgtgattcactgg aagcttgatgctggcgcggaggcattctgtagcaggctgagccctttcatctcagcaaag agcgacaactccctaatatcaccggctcacctggaagacaggtcttttacaacaatgatt ctggactcagctgtcgaagaaaaacggatggttttgcttttactcatcttgatcaaatta gcccggaagctgtctggcttctcggaggtggagagggattcccgattttgttccagtatc agcaggtcacaagctcatacaactgtattttga >gi568815597r:38739556_38959646|GENSCAN_predicted_peptide_2|145_aa MPDIGVEIRKAEDCEGKMDGNMGCSAEAEALGHRAKHICLKWKSEKLKRGPRESRAVLDL SRKVCKLQNDCCLPSALHISHSRFIDTPFKAVPPNEKSMNYVVVQLGALWVPGMADSKAE CVFPVLADADEEERERRGCVQQFFS >gi568815597r:38739556_38959646|GENSCAN_predicted_CDS_2|438_bp atgcctgatattggagttgagatccggaaggcagaagattgtgaaggaaagatggatgga aatatggggtgttctgcagaagccgaggcccttggccacagagctaaacacatatgcttg aagtggaagtcagagaaactgaagagaggtccacgagaaagtagagcagtcttggacctc agccgcaaggtgtgcaagctacaaaatgactgctgcctcccttctgccctccacatctcc cacagcaggttcatagatacacccttcaaggctgtccctccaaatgagaagagcatgaac tatgtggttgtacagcttggagcactgtgggtccctggtatggcagacagcaaagcagaa tgtgtgttccctgtgctggcagatgctgatgaggaagagagggagagacggggatgcgtg cagcagttcttctcctaa >gi568815597r:38739556_38959646|GENSCAN_predicted_peptide_3|319_aa MDKFLNTYTLPRLNQEEVESLNRPITGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKTLA NRIQQHIKKLIHHDQVGFVPGMQGWFNIGKSINVIQHINRTNNKNHTIISIDAEKAFDKI QQPFMLKTLNKLGINGTYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTREGCPLSPLLF NIVLEVLARAIRQEKEIKGIQLGKEEVNLSLFADDMIVYLENPIFSAQNLLKLIGNFSKV SGYKINVQKSQAFLYNNNR >gi568815597r:38739556_38959646|GENSCAN_predicted_CDS_3|960_bp atggataaattcctcaacacatacaccctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaa aaaagtccaggaccagatggattcacagctgaattctaccagaggtacaaggaggagctg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcatcctgataccaaagcctggcagagacacaacaaaaaaagag aattttagaccaatatccctgatgaacatcgatgcaaaaatcctcaataaaacactggca aaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcgtccct gggatgcaaggctggttcaacataggaaaatcaataaatgtaatccagcatataaacaga accaacaacaaaaaccatacgattatctcaatagatgcagaaaaggcctttgacaaaatt caacaacctttcatgctaaaaaccctcaataaattaggtattaatgggacgtatctcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagagagggatgccctctctcaccactcctattc aacatagtgttggaagttctggccagggcaatcaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaacttgtccctgtttgcagatgacatgatcgtatatcta gaaaaccccatcttctcagcccaaaatctccttaagctgataggcaacttcagcaaagtc tcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatacaacaataatagataa >gi568815597r:38739556_38959646|GENSCAN_predicted_peptide_4|88_aa MKAEEWLLDVAISRSLGGHFVFLEANLAYVYKLPPLCWVSRRDSVFAPPTGLAHVSVHTR ASFRKRSHRKGSPSGDPGQERRILLINY >gi568815597r:38739556_38959646|GENSCAN_predicted_CDS_4|267_bp atgaaagcagaagagtggctgttggatgtggccattagcagaagtctagggggtcacttt gtgtttctagaggcaaatctagcttatgtgtacaaacttccaccactttgctgggtcagc cgccgtgactcagtgttcgcgcccccgaccgggctggcacacgtgagtgtgcacacgcgc gcaagcttccggaagcgcagccaccgcaaggggtctccttcgggggatccaggccaggag cgcaggatcctgctcattaattattga >gi568815597r:38739556_38959646|GENSCAN_predicted_peptide_5|364_aa MRSQGQASAASGLGARRPRPTLPLWPLAPARARAARAAISRAACCPAPRCSRPRATGGNN EAQRHDRGPRLRRRQQRGRRISISEAFLCFGYSPGLRPDLPALRNNAIHIPPGSLEHISG LHFSSEHAYSPVRTALWVRGLEYRIQTAPSLCCNIFRAFSSSTQPDPQPLPTCYGASPGL ESGELSLVFLGPRQGFEQPLVTKQALRSRGGGSSARNRTRCKLLGYPFTRLTPPTQVFPF PQTASPAANKDGCSRPLRAVKGVLLQRLQRRLVRPELELQAPRESHLPPWAVGGLPATLA NCMAVWNNYRVFLNVRHKCIKDDQGSYLYQQADFLHLVGLHQTTKETVIDCTVEEIQASR GGLS >gi568815597r:38739556_38959646|GENSCAN_predicted_CDS_5|1095_bp atgcggagtcaggggcaggcctcggcggcgtccgggctgggggcgaggcgcccacgcccc accctcccgctgtggccgctggcgccggcccgagcgcgcgcagcccgcgcagccatcagc cgcgccgcgtgctgtcccgcgccccgctgctcgcggccgcgggccaccggcggtaataat gaggcacagcgccatgaccgcggccctcggctgcgccgccggcagcagcgcgggcgccgc attagcatctccgaagccttcctgtgtttcggctacagccctggcctgcgtcctgacctg ccagcccttcgcaacaatgccatccacattcctcctgggtctctggaacacatctctggc ttgcacttctcctctgagcacgcttattcacctgtcaggactgccctgtgggtccgaggc ttagaatacaggatccaaactgcaccatccctctgctgcaacatcttcagggctttctca tcatccacacaaccagatcctcagcctcttccgacctgctatggagcaagccctggactg gagtctggagagctgagccttgttttcctgggacctcggcaggggtttgagcagccactg gtgaccaaacaggctctgaggagccgaggcggtggcagcagcgcgaggaacagaacacgc tgcaaactgcttggctacccctttacaagactgacgccgccgacacaagtttttccattt ccccaaacagcctctccggctgccaataaagacggctgctctaggcccttgcgtgctgtc aaaggggtacttctccagcggcttcagaggcggcttgttcgcccagagctggagctgcag gcccctagagagagccaccttcccccttgggctgttggtgggctccctgcgaccctggcc aattgcatggctgtgtggaataattaccgggtctttttgaatgtaagacacaaatgtatc aaggatgatcaaggttcctacctctaccagcaagctgattttttgcatttggtgggcctc catcagaccaccaaggaaacagtcatagattgtactgtggaggaaattcaggcgtcccgc ggaggcctaagctga >gi568815597r:38739556_38959646|GENSCAN_predicted_peptide_6|419_aa MSLQYGAEETPLAGSYGAADSFPKDFGYGVEEEEEEAAAAGGGVGAGAGGGCGPGGADSS KPRILLMGLRRSGKSSIQKVVFHKMSPNETLFLESTNKIYKDDISNSSFVNFQIWDFPGQ MDFFDPTFDYEMIFRGTGALIYVIDAQDDYMEALTRLHITVSKAYKVNPDMNFEVFIHKV DGLSDDHKIETQRDIHQRANDDLADAGLEKLHLSFYLTSIYDHSIFEAFSKVVQKLIPQL PTLENLLNIFISNSGIEKAFLFDVVSKIYIATDSSPVDMQSYELCCDMIDVVIDVSCIYG LKEDGSGSAYDKESMAIIKLNNTTVLYLKEVTKFLALVCILREESFERKGLIDYNFHCFR KAIHEVFESFWNRAEEAMSHVLIGRKGERTLALSWPPNLPGSSPSTSIYRISPNGHAGS >gi568815597r:38739556_38959646|GENSCAN_predicted_CDS_6|1260_bp atgtccctgcagtacggggcggaggagacgcccctcgccggcagttacggcgcggccgat tcgtttccaaaggacttcggctacggcgtggaggaggaggaagaggaggcggcggcggcg ggcggaggggttggggcaggggcaggcggtggctgtggtccggggggcgctgacagctcc aagccgaggattctgctcatgggactccggcgcagcggcaagtcctccatccagaaggtg gtgtttcataagatgtcacccaacgagaccctctttttggaaagtaccaacaagatttat aaggatgacatttccaatagctcctttgtgaatttccagatatgggattttcctgggcaa atggacttttttgacccaacctttgactatgagatgatcttcaggggaacaggagcattg atatacgtcattgacgcacaggatgactacatggaggctttaacaagacttcacattact gtttctaaagcctacaaagttaacccagacatgaattttgaggtttttattcacaaagtt gatggtctgtctgatgatcacaaaatagaaacacagagggacattcatcaaagggccaat gatgaccttgcagatgctgggctagaaaaactccatcttagcttttatctgactagtatc tatgaccattcaatatttgaagcctttagtaaggtggtgcagaaactcattccacaactg ccgaccttggaaaacctattaaatatctttatatcaaattcaggtattgaaaaagctttt ctctttgatgttgtcagcaaaatctacattgcaacagacagttcccctgtggatatgcaa tcttatgaactttgctgtgacatgatcgatgttgtaattgatgtgtcttgtatatatggg ttaaaggaagatggaagtggaagtgcttatgacaaagaatctatggcaattatcaagctg aataatacaactgtcctttatttaaaggaggtgactaaatttttggcactggtctgcatt ctaagggaagaaagctttgaaagaaaaggtttaatagactacaacttccactgtttccga aaagctattcatgaggtttttgagagcttttggaacagggctgaggaagccatgagtcac gtcctcattggccgtaaaggtgaacgaaccctagctctgtcttggcctccaaacctaccc ggttctagcccctccacaagtatctaccgcatcagccccaacgggcatgcagggagctag >gi568815597r:38739556_38959646|GENSCAN_predicted_peptide_7|565_aa MGDWNLLGDTLEEVHIHSTMIGKIWLTILFIFRMLVLGVAAEDVWNDEQSGFICNTEQPG CRNVCYDQAFPISLIRYWVLQVIFVSSPSLVYMGHALYRLRVLEEERQRMKAQLRVELEE VEFEMPRDRRRLEQELCQLEKRKLNKAPLRGTLLCTYVIHIFTRSVVEVGFMIGQYLLYG FHLEPLFKCHGHPCPNIIDCFVSRPTEKTIFLLFMQSIATISLFLNILEIFHLGFKKIKR GLWGKYKLKKEHNEFHANKAKQNVAKYQSTSANSLKRLPSAPDYNLLVEKQTHTAVYPSL NSSSVFQPNPDNHSVNDEKCILDEQETVLSNEISTLSTSCSHFQHISSNNNKDTHKIFGK ELNGNQLMEKRETEGKDSKRNYYSRGHRSIPGVAIDGENNMRQSPQTVFSLPANCDWKPR WLRATWGSSTEHENRGSPPKVPGSKATASSLLLILQRPTSSQPRLKETPKIKAEAKIYDS KHPPRLLQSTAADSKREQFRRYLEKSGVLDTLTKGAATPENPEIELLRLELAEMKEKYEA IVEENKKLKAKLAQYEPPQEEKRAE >gi568815597r:38739556_38959646|GENSCAN_predicted_CDS_7|1698_bp atgggggactggaatctccttggagatactctggaggaagttcacatccactccaccatg attggaaagatctggctcaccatcctgttcatatttcgaatgcttgttctgggtgtagca gctgaagatgtctggaatgatgagcagtctggcttcatctgcaatacagaacaaccaggc tgcagaaatgtatgctacgaccaggcctttcctatctccctcattagatactgggttctg caggtgatatttgtgtcttcaccatccctggtctacatgggccatgcattgtaccgactg agagttcttgaggaagagaggcaaaggatgaaagctcagttaagagtagaactggaggag gtagagtttgaaatgcctagggatcggaggagattggagcaagagctttgtcagctggag aaaaggaaactaaataaagctccactcagaggaaccttgctttgcacttatgtgatacac attttcactcgctctgtggttgaagttggattcatgattggacagtaccttttatatgga tttcacttagagccgctatttaagtgccatggccacccgtgtccaaatataatcgactgt tttgtctcaagaccaacagaaaagacaatattcctattatttatgcaatctatagccact atttcacttttcttaaacattcttgaaattttccacctaggttttaaaaagattaaaaga gggctttggggaaaatacaagttgaagaaggaacataatgaattccatgcaaacaaggca aaacaaaatgtagccaaataccagagcacatctgcaaattcactgaagcgactcccttct gcccctgattataatctgttagtggaaaagcaaacacacactgcagtgtaccctagttta aattcatcttctgtattccagccaaatcctgacaatcatagtgtaaatgatgagaaatgc attttggatgaacaggaaactgtactttctaatgagatttccacacttagtactagttgt agtcattttcaacacatcagttcaaacaataacaaagacactcataaaatatttggaaaa gaacttaatggtaaccagttaatggaaaaaagagaaactgaaggcaaagacagcaaaagg aactactactctagaggtcaccgttctattccaggtgttgctatagatggagagaacaac atgaggcagtcaccccaaacagttttctccttgccagctaactgcgattggaaaccgcgg tggcttagagctacatggggttcctctacagaacatgaaaaccgggggtcacctcctaaa gtgcctggctcaaaagctactgcaagctcgttactgctcatcctccagaggcccacatca agtcagccacgactcaaggagactccaaagataaaagctgaagccaaaatatatgattct aaacaccctcctcggctactgcaaagcactgccgccgactcgaagcgtgagcagttccgg aggtacttggagaagtcgggggtgctggacacgctgaccaagggagctgctactccagaa aatccagaaatagagctgcttcgcctagaactggccgaaatgaaagagaagtatgaagct attgtagaagaaaataaaaaactgaaagcaaagcttgctcagtatgaaccacctcaggag gagaagcgtgctgaatag >gi568815597r:38739556_38959646|GENSCAN_predicted_peptide_8|351_aa DENFSQQCHPVLICVCEAFQKRVGSNLLQIRGKALESSRCARQPREERQPEDLGPPAVPW DSCPSGEEGGPRTMAAVHDLEMESMNLNMGREMKEELEEEEKMREDGGGKDRAKSKKVHR IVSKWMLPEKSRGTYLERANCFPPPVFIISISLAEVNGERGGHRAAFEAFPILTVTANPG VQHILGNLCMQLVLGIPLEMVHKGLRVGLVYLAGVIAGSLASSIFDPLRYLVGASGGVYA LMGGYFMNVLVNFQEMIPAFGIFRLLIIILIIVLDMGFALYRRFFVPEDGSPVSFAAHIA GGFAGMSIGYTVFSCFDKALLKDPRFWIAIAAYLACVLFAVFFNIFLSPAN >gi568815597r:38739556_38959646|GENSCAN_predicted_CDS_8|1056_bp gatgagaacttctcgcagcagtgccacccagttttgatctgtgtatgcgaggcctttcaa aaaagagtgggcagcaacttgctgcagataagaggaaaggccttggaaagcagtcgttgc gccagacagcccagggaagagcggcagcctgaggacctagggccacctgctgttccctgg gattcatgtccttctggggaggagggaggacccaggacaatggctgctgttcatgatctg gagatggagagcatgaatctgaatatggggagagagatgaaagaagagctggaggaagag gagaaaatgagagaggatgggggaggtaaagatcgggccaagagtaaaaaggtccacagg attgtctcaaaatggatgctgcccgaaaagtcccgaggaacatacttggagagagctaac tgcttcccgcctcccgtgttcatcatctccatcagcctggccgaggtgaatggggagcgg ggtgggcaccgggctgcctttgaggctttccctatactcacggtcactgccaacccagga gttcagcacatcttggggaatctttgtatgcagcttgttttgggtattcccttggaaatg gtccacaaaggcctccgtgtggggctggtgtacctggcaggagtgattgcagggtccctt gccagctccatctttgacccactcagatatcttgtgggagcttcaggaggagtctatgct ctgatgggaggctattttatgaatgttctggtgaattttcaagaaatgattcctgccttt ggaattttcagactgctgatcatcatcctgataattgtgttggacatgggatttgctctc tatagaaggttctttgttcctgaagatgggtctccggtgtcttttgcagctcacattgca ggtggatttgctggaatgtccattggctacacggtgtttagctgctttgataaagcactg ctgaaagatccaaggttttggatagcaattgctgcatatttagcttgtgtcttatttgct gtgtttttcaacattttcctatctccagcaaactga