GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:56:58 Sequence gi568815597r:38774554_38976098 : 201545 bp : 44.55% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8236 8290 55 0 1 83 83 11 0.581 1.55 1.02 Term + 9422 9585 164 0 2 46 49 142 0.764 4.20 1.03 PlyA + 10286 10291 6 1.05 2.00 Prom + 23646 23685 40 -2.76 2.01 Init + 29483 29589 107 1 2 58 78 100 0.155 3.69 2.02 Intr + 29640 29795 156 0 0 49 23 170 0.122 5.73 2.03 Intr + 33239 33373 135 2 0 57 37 138 0.242 5.28 2.04 Intr + 35544 35727 184 0 1 85 39 24 0.309 -3.01 2.05 Intr + 37343 37372 30 1 0 111 99 35 0.916 5.23 2.06 Intr + 41761 42078 318 0 0 117 58 144 0.748 10.45 2.07 Term + 45295 45459 165 0 0 15 48 106 0.106 -2.78 2.08 PlyA + 46260 46265 6 1.05 3.10 PlyA - 46921 46916 6 1.05 3.09 Term - 50925 50825 101 1 2 105 44 67 0.870 2.29 3.08 Intr - 51136 51082 55 1 1 70 105 44 0.059 2.85 3.07 Intr - 65151 65096 56 1 2 121 90 55 0.169 7.70 3.06 Intr - 71534 71386 149 1 2 66 85 94 0.999 6.78 3.05 Intr - 77204 77062 143 2 2 59 106 117 0.995 9.75 3.04 Intr - 77935 77821 115 0 1 83 89 45 0.978 4.55 3.03 Intr - 81354 81155 200 0 2 72 60 93 0.988 3.05 3.02 Intr - 82529 82326 204 2 0 103 66 160 0.972 14.70 3.01 Init - 85093 84857 237 1 0 86 105 281 0.789 27.61 3.00 Prom - 87503 87464 40 -2.46 4.06 PlyA - 88445 88440 6 1.05 4.05 Term - 90161 90117 45 2 0 118 45 60 0.952 1.91 4.04 Intr - 92436 92327 110 1 2 -9 94 155 0.497 6.40 4.03 Intr - 98537 98465 73 2 1 108 84 140 0.959 14.58 4.02 Intr - 99356 99208 149 0 2 91 48 38 0.843 0.05 4.01 Init - 101545 100225 1321 1 1 88 86 390 0.353 30.44 4.00 Prom - 111849 111810 40 -4.96 5.10 PlyA - 111904 111899 6 1.05 5.09 Term - 112130 111951 180 2 0 107 43 89 0.957 3.91 5.08 Intr - 113471 113410 62 0 2 85 117 40 0.879 5.05 5.07 Intr - 118671 118611 61 0 1 63 97 59 0.235 2.61 5.06 Intr - 121516 121416 101 2 2 94 83 4 0.149 0.33 5.05 Intr - 136881 136769 113 2 2 111 82 70 0.632 8.72 5.04 Intr - 144251 144206 46 0 1 131 11 46 0.215 -1.33 5.03 Intr - 144784 144386 399 2 0 91 98 231 0.185 18.88 5.02 Intr - 149357 149264 94 2 1 68 75 63 0.113 2.54 5.01 Intr - 199310 199165 146 0 2 42 50 104 0.282 2.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 51109 51082 28 1 1 79 105 41 0.912 4.66 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:38774554_38976098|GENSCAN_predicted_peptide_1|72_aa MDSPLNYAEASGDERREGTLCWVSRRDSVFAPPTGLAHVSVHTRASFRKRSHRKGSPSGD PGQERRILLINY >gi568815597r:38774554_38976098|GENSCAN_predicted_CDS_1|219_bp atggacagtcctctaaattacgctgaggcatcaggggacgaaaggagagaggggacactt tgctgggtcagccgccgtgactcagtgttcgcgcccccgaccgggctggcacacgtgagt gtgcacacgcgcgcaagcttccggaagcgcagccaccgcaaggggtctccttcgggggat ccaggccaggagcgcaggatcctgctcattaattattga >gi568815597r:38774554_38976098|GENSCAN_predicted_peptide_2|364_aa MRSQGQASAASGLGARRPRPTLPLWPLAPARARAARAAISRAACCPAPRCSRPRATGGNN EAQRHDRGPRLRRRQQRGRRISISEAFLCFGYSPGLRPDLPALRNNAIHIPPGSLEHISG LHFSSEHAYSPVRTALWVRGLEYRIQTAPSLCCNIFRAFSSSTQPDPQPLPTCYGASPGL ESGELSLVFLGPRQGFEQPLVTKQALRSRGGGSSARNRTRCKLLGYPFTRLTPPTQVFPF PQTASPAANKDGCSRPLRAVKGVLLQRLQRRLVRPELELQAPRESHLPPWAVGGLPATLA NCMAVWNNYRVFLNVRHKCIKDDQGSYLYQQADFLHLVGLHQTTKETVIDCTVEEIQASR GGLS >gi568815597r:38774554_38976098|GENSCAN_predicted_CDS_2|1095_bp atgcggagtcaggggcaggcctcggcggcgtccgggctgggggcgaggcgcccacgcccc accctcccgctgtggccgctggcgccggcccgagcgcgcgcagcccgcgcagccatcagc cgcgccgcgtgctgtcccgcgccccgctgctcgcggccgcgggccaccggcggtaataat gaggcacagcgccatgaccgcggccctcggctgcgccgccggcagcagcgcgggcgccgc attagcatctccgaagccttcctgtgtttcggctacagccctggcctgcgtcctgacctg ccagcccttcgcaacaatgccatccacattcctcctgggtctctggaacacatctctggc ttgcacttctcctctgagcacgcttattcacctgtcaggactgccctgtgggtccgaggc ttagaatacaggatccaaactgcaccatccctctgctgcaacatcttcagggctttctca tcatccacacaaccagatcctcagcctcttccgacctgctatggagcaagccctggactg gagtctggagagctgagccttgttttcctgggacctcggcaggggtttgagcagccactg gtgaccaaacaggctctgaggagccgaggcggtggcagcagcgcgaggaacagaacacgc tgcaaactgcttggctacccctttacaagactgacgccgccgacacaagtttttccattt ccccaaacagcctctccggctgccaataaagacggctgctctaggcccttgcgtgctgtc aaaggggtacttctccagcggcttcagaggcggcttgttcgcccagagctggagctgcag gcccctagagagagccaccttcccccttgggctgttggtgggctccctgcgaccctggcc aattgcatggctgtgtggaataattaccgggtctttttgaatgtaagacacaaatgtatc aaggatgatcaaggttcctacctctaccagcaagctgattttttgcatttggtgggcctc catcagaccaccaaggaaacagtcatagattgtactgtggaggaaattcaggcgtcccgc ggaggcctaagctga >gi568815597r:38774554_38976098|GENSCAN_predicted_peptide_3|419_aa MSLQYGAEETPLAGSYGAADSFPKDFGYGVEEEEEEAAAAGGGVGAGAGGGCGPGGADSS KPRILLMGLRRSGKSSIQKVVFHKMSPNETLFLESTNKIYKDDISNSSFVNFQIWDFPGQ MDFFDPTFDYEMIFRGTGALIYVIDAQDDYMEALTRLHITVSKAYKVNPDMNFEVFIHKV DGLSDDHKIETQRDIHQRANDDLADAGLEKLHLSFYLTSIYDHSIFEAFSKVVQKLIPQL PTLENLLNIFISNSGIEKAFLFDVVSKIYIATDSSPVDMQSYELCCDMIDVVIDVSCIYG LKEDGSGSAYDKESMAIIKLNNTTVLYLKEVTKFLALVCILREESFERKGLIDYNFHCFR KAIHEVFESFWNRAEEAMSHVLIGRKGERTLALSWPPNLPGSSPSTSIYRISPNGHAGS >gi568815597r:38774554_38976098|GENSCAN_predicted_CDS_3|1260_bp atgtccctgcagtacggggcggaggagacgcccctcgccggcagttacggcgcggccgat tcgtttccaaaggacttcggctacggcgtggaggaggaggaagaggaggcggcggcggcg ggcggaggggttggggcaggggcaggcggtggctgtggtccggggggcgctgacagctcc aagccgaggattctgctcatgggactccggcgcagcggcaagtcctccatccagaaggtg gtgtttcataagatgtcacccaacgagaccctctttttggaaagtaccaacaagatttat aaggatgacatttccaatagctcctttgtgaatttccagatatgggattttcctgggcaa atggacttttttgacccaacctttgactatgagatgatcttcaggggaacaggagcattg atatacgtcattgacgcacaggatgactacatggaggctttaacaagacttcacattact gtttctaaagcctacaaagttaacccagacatgaattttgaggtttttattcacaaagtt gatggtctgtctgatgatcacaaaatagaaacacagagggacattcatcaaagggccaat gatgaccttgcagatgctgggctagaaaaactccatcttagcttttatctgactagtatc tatgaccattcaatatttgaagcctttagtaaggtggtgcagaaactcattccacaactg ccgaccttggaaaacctattaaatatctttatatcaaattcaggtattgaaaaagctttt ctctttgatgttgtcagcaaaatctacattgcaacagacagttcccctgtggatatgcaa tcttatgaactttgctgtgacatgatcgatgttgtaattgatgtgtcttgtatatatggg ttaaaggaagatggaagtggaagtgcttatgacaaagaatctatggcaattatcaagctg aataatacaactgtcctttatttaaaggaggtgactaaatttttggcactggtctgcatt ctaagggaagaaagctttgaaagaaaaggtttaatagactacaacttccactgtttccga aaagctattcatgaggtttttgagagcttttggaacagggctgaggaagccatgagtcac gtcctcattggccgtaaaggtgaacgaaccctagctctgtcttggcctccaaacctaccc ggttctagcccctccacaagtatctaccgcatcagccccaacgggcatgcagggagctag >gi568815597r:38774554_38976098|GENSCAN_predicted_peptide_4|565_aa MGDWNLLGDTLEEVHIHSTMIGKIWLTILFIFRMLVLGVAAEDVWNDEQSGFICNTEQPG CRNVCYDQAFPISLIRYWVLQVIFVSSPSLVYMGHALYRLRVLEEERQRMKAQLRVELEE VEFEMPRDRRRLEQELCQLEKRKLNKAPLRGTLLCTYVIHIFTRSVVEVGFMIGQYLLYG FHLEPLFKCHGHPCPNIIDCFVSRPTEKTIFLLFMQSIATISLFLNILEIFHLGFKKIKR GLWGKYKLKKEHNEFHANKAKQNVAKYQSTSANSLKRLPSAPDYNLLVEKQTHTAVYPSL NSSSVFQPNPDNHSVNDEKCILDEQETVLSNEISTLSTSCSHFQHISSNNNKDTHKIFGK ELNGNQLMEKRETEGKDSKRNYYSRGHRSIPGVAIDGENNMRQSPQTVFSLPANCDWKPR WLRATWGSSTEHENRGSPPKVPGSKATASSLLLILQRPTSSQPRLKETPKIKAEAKIYDS KHPPRLLQSTAADSKREQFRRYLEKSGVLDTLTKGAATPENPEIELLRLELAEMKEKYEA IVEENKKLKAKLAQYEPPQEEKRAE >gi568815597r:38774554_38976098|GENSCAN_predicted_CDS_4|1698_bp atgggggactggaatctccttggagatactctggaggaagttcacatccactccaccatg attggaaagatctggctcaccatcctgttcatatttcgaatgcttgttctgggtgtagca gctgaagatgtctggaatgatgagcagtctggcttcatctgcaatacagaacaaccaggc tgcagaaatgtatgctacgaccaggcctttcctatctccctcattagatactgggttctg caggtgatatttgtgtcttcaccatccctggtctacatgggccatgcattgtaccgactg agagttcttgaggaagagaggcaaaggatgaaagctcagttaagagtagaactggaggag gtagagtttgaaatgcctagggatcggaggagattggagcaagagctttgtcagctggag aaaaggaaactaaataaagctccactcagaggaaccttgctttgcacttatgtgatacac attttcactcgctctgtggttgaagttggattcatgattggacagtaccttttatatgga tttcacttagagccgctatttaagtgccatggccacccgtgtccaaatataatcgactgt tttgtctcaagaccaacagaaaagacaatattcctattatttatgcaatctatagccact atttcacttttcttaaacattcttgaaattttccacctaggttttaaaaagattaaaaga gggctttggggaaaatacaagttgaagaaggaacataatgaattccatgcaaacaaggca aaacaaaatgtagccaaataccagagcacatctgcaaattcactgaagcgactcccttct gcccctgattataatctgttagtggaaaagcaaacacacactgcagtgtaccctagttta aattcatcttctgtattccagccaaatcctgacaatcatagtgtaaatgatgagaaatgc attttggatgaacaggaaactgtactttctaatgagatttccacacttagtactagttgt agtcattttcaacacatcagttcaaacaataacaaagacactcataaaatatttggaaaa gaacttaatggtaaccagttaatggaaaaaagagaaactgaaggcaaagacagcaaaagg aactactactctagaggtcaccgttctattccaggtgttgctatagatggagagaacaac atgaggcagtcaccccaaacagttttctccttgccagctaactgcgattggaaaccgcgg tggcttagagctacatggggttcctctacagaacatgaaaaccgggggtcacctcctaaa gtgcctggctcaaaagctactgcaagctcgttactgctcatcctccagaggcccacatca agtcagccacgactcaaggagactccaaagataaaagctgaagccaaaatatatgattct aaacaccctcctcggctactgcaaagcactgccgccgactcgaagcgtgagcagttccgg aggtacttggagaagtcgggggtgctggacacgctgaccaagggagctgctactccagaa aatccagaaatagagctgcttcgcctagaactggccgaaatgaaagagaagtatgaagct attgtagaagaaaataaaaaactgaaagcaaagcttgctcagtatgaaccacctcaggag gagaagcgtgctgaatag >gi568815597r:38774554_38976098|GENSCAN_predicted_peptide_5|400_aa XKCHNVTPVSGLLACECQCIPLEVQMPVCSDTDVLLSMSSPFCASFCALDENFSQQCHPV LICVCEAFQKRVGSNLLQIRGKALESSRCARQPREERQPEDLGPPAVPWDSCPSGEEGGP RTMAAVHDLEMESMNLNMGREMKEELEEEEKMREDGGGKDRAKSKKVHRIVSKWMLPEKS RGTYLERANCFPPPVFIISISLAEVNGERGGHRAAFEAFPILTVTANPGVQHILGNLCMQ LVLGIPLEMVHKGLRVGLVYLAGVIAGSLASSIFDPLRYLVGASGGVYALMGGYFMNVLV NFQEMIPAFGIFRLLIIILIIVLDMGFALYRRFFVPEDGSPVSFAAHIAGGFAGMSIGYT VFSCFDKALLKDPRFWIAIAAYLACVLFAVFFNIFLSPAN >gi568815597r:38774554_38976098|GENSCAN_predicted_CDS_5|1203_bp nccaaatgccacaatgttacaccagttagtggcctgctggcatgtgagtgccagtgcatt cctcttgaagtccagatgcctgtgtgttccgacaccgatgtgctcctctcaatgtccagc cccttctgtgcctccttctgtgccttggatgagaacttctcgcagcagtgccacccagtt ttgatctgtgtatgcgaggcctttcaaaaaagagtgggcagcaacttgctgcagataaga ggaaaggccttggaaagcagtcgttgcgccagacagcccagggaagagcggcagcctgag gacctagggccacctgctgttccctgggattcatgtccttctggggaggagggaggaccc aggacaatggctgctgttcatgatctggagatggagagcatgaatctgaatatggggaga gagatgaaagaagagctggaggaagaggagaaaatgagagaggatgggggaggtaaagat cgggccaagagtaaaaaggtccacaggattgtctcaaaatggatgctgcccgaaaagtcc cgaggaacatacttggagagagctaactgcttcccgcctcccgtgttcatcatctccatc agcctggccgaggtgaatggggagcggggtgggcaccgggctgcctttgaggctttccct atactcacggtcactgccaacccaggagttcagcacatcttggggaatctttgtatgcag cttgttttgggtattcccttggaaatggtccacaaaggcctccgtgtggggctggtgtac ctggcaggagtgattgcagggtcccttgccagctccatctttgacccactcagatatctt gtgggagcttcaggaggagtctatgctctgatgggaggctattttatgaatgttctggtg aattttcaagaaatgattcctgcctttggaattttcagactgctgatcatcatcctgata attgtgttggacatgggatttgctctctatagaaggttctttgttcctgaagatgggtct ccggtgtcttttgcagctcacattgcaggtggatttgctggaatgtccattggctacacg gtgtttagctgctttgataaagcactgctgaaagatccaaggttttggatagcaattgct gcatatttagcttgtgtcttatttgctgtgtttttcaacattttcctatctccagcaaac tga