GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:02:19 Sequence gi568815590f:61478220_61679668 : 201449 bp : 38.54% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1324 1434 111 0 0 76 38 72 0.453 0.66 1.02 Intr + 1584 1791 208 2 1 62 1 156 0.180 2.03 1.03 Intr + 3757 4002 246 2 0 29 100 137 0.337 5.51 1.04 Intr + 4529 5307 779 0 2 -2 41 354 0.022 11.96 1.05 Term + 6004 7236 1233 0 0 23 43 324 0.066 12.00 1.06 PlyA + 7881 7886 6 1.05 2.03 PlyA - 9238 9233 6 1.05 2.02 Term - 18274 18065 210 1 0 82 45 201 0.968 11.51 2.01 Init - 26950 26786 165 1 0 58 31 194 0.885 10.38 2.00 Prom - 38479 38440 40 -5.25 3.05 PlyA - 38506 38501 6 1.05 3.04 Term - 39442 39263 180 1 0 64 54 172 0.976 8.03 3.03 Intr - 39904 39813 92 2 2 69 81 73 0.972 3.49 3.02 Intr - 47893 47758 136 1 1 25 113 170 0.965 12.42 3.01 Init - 49836 49762 75 0 0 55 100 68 0.849 5.84 3.00 Prom - 57300 57261 40 -6.25 4.00 Prom + 60259 60298 40 -6.25 4.01 Init + 66731 66845 115 1 1 18 83 82 0.702 1.12 4.02 Term + 67787 68040 254 0 2 80 44 227 0.316 12.32 4.03 PlyA + 68168 68173 6 1.05 5.08 PlyA - 68580 68575 6 1.05 5.07 Term - 68768 68700 69 2 0 23 48 52 0.217 -8.34 5.06 Intr - 69989 69852 138 2 0 70 115 121 0.914 12.74 5.05 Intr - 74901 74812 90 0 0 84 80 96 0.990 7.67 5.04 Intr - 77803 77705 99 1 0 48 90 109 0.987 6.49 5.03 Intr - 84661 84525 137 2 2 56 99 47 0.613 1.97 5.02 Intr - 89099 88949 151 2 1 29 82 164 0.626 8.71 5.01 Init - 94533 94264 270 0 0 65 -23 349 0.516 18.51 5.00 Prom - 95864 95825 40 -8.85 6.00 Prom + 96157 96196 40 -4.05 6.01 Sngl + 100178 101452 1275 1 0 73 47 1735 0.815 161.56 6.02 PlyA + 101661 101666 6 1.05 7.00 Prom + 103974 104013 40 -10.75 7.01 Init + 104749 104757 9 0 0 47 116 11 0.336 -0.27 7.02 Intr + 106956 107109 154 2 1 28 98 165 0.839 10.22 7.03 Intr + 130947 131232 286 1 1 54 24 138 0.085 -0.52 7.04 Intr + 137967 138152 186 0 0 73 50 153 0.630 7.88 7.05 Intr + 139216 139271 56 1 2 65 72 78 0.419 1.60 7.06 Term + 148193 148296 104 0 2 73 41 173 0.953 8.56 7.07 PlyA + 148404 148409 6 1.05 8.11 PlyA - 149368 149363 6 1.05 8.10 Term - 153666 153502 165 0 0 72 42 207 0.917 11.43 8.09 Intr - 159784 159741 44 2 2 63 91 73 0.756 2.14 8.08 Intr - 160144 160103 42 2 0 80 111 28 0.630 1.59 8.07 Intr - 166413 166381 33 1 0 94 94 30 0.688 1.48 8.06 Intr - 168659 168531 129 0 0 59 70 211 0.227 16.15 8.05 Intr - 172905 172831 75 1 0 52 60 119 0.481 4.07 8.04 Intr - 175441 175349 93 2 0 3 115 157 0.739 8.92 8.03 Intr - 187387 187094 294 2 0 -34 9 363 0.052 12.26 8.02 Intr - 198100 197921 180 2 0 101 88 181 0.996 18.32 8.01 Init - 200647 200629 19 1 1 84 102 4 0.742 1.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:61478220_61679668|GENSCAN_predicted_peptide_1|858_aa LGVVDLLKPSSLNSSKSFSIQLCSVAGEELHSFGGGESGPSAAGLLEFAGSPLQTLFAWV SAAVAAEQRILVNRKCCCLIIPLEVLSQRSTRPCDVSVLPYWRVPPPRCPSETKLPEEGS GSNICCSAIFAIVQPPLVIPRQTGSGVDLQQTPTDLQLRVLTVRRKTNKQKGHPHQNPIC TSPTSKTKDHSAINLELRIKKLTQSHSITWKLRNLLLNDYWVYNEMKAEIKMFFETNENK DTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDILTSQLKELEKQEQTHSKASRRQEI TKIRAELKAIETQKILQKINESRSWFFEKINKIDRPLARLIKKKREKNQIDAIKNDKKDI TTNPTEIQITIREYYKHLYTIKLENLEEMDKFLDTYTLLRLNQEEVESLNTPIIGSEIDA IINSLPTKKSPGPDGFTAEFYQRYKEELLISNFSKVSGYKINMQKSQAFLYTNNRQTESQ IMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWRNIPCSWVGRINI MKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRDHIAKTILTQKNKAGGITL PDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPLEIIPHIYNHLIFDKPDKNKQWGKDSLF NKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNGRPKTIKTLEENLGITIQDIGMGK DFLTKTPKAMATKAKIDKWYLIKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKRLISR IYNELKQIYKKKTNNPIKKWTKDMNRHFSKEDIYAVNRHMKKCSSSLAIREMQIKTTMRY HLTPVRMAIIKKSGNNRC >gi568815590f:61478220_61679668|GENSCAN_predicted_CDS_1|2577_bp ctcggagtagttgatcttctgaagccttcttctctcaactcgtcaaagtcattctccatc cagctttgttccgttgctggtgaggagctgcattcctttggaggaggagagtcaggaccc tcagctgcaggtctgttggagtttgctggaagtccactccagaccctgtttgcctgggta tcagcagcggtggctgcagaacagcggatattggtgaaccgcaaatgctgctgcctgatc attcccctggaagttttgtctcagaggagtacccggccgtgtgacgtgtcagtcctcccc tactggagggtgcctccccctaggtgcccctctgagacgaagcttccagaggaaggatca ggcagcaacatctgctgttctgcaatatttgctattgtgcagcctccactggtaataccc agacaaacagggtctggagtggacctccagcaaactccaacagatctgcagctgagggtc ctgactgttagaaggaaaactaacaaacagaaaggacatccacaccaaaaccccatctgt acgtcaccaacatcaaagaccaaagaccacagtgcaatcaacctagaactcaggattaag aaactcactcaaagccactcaattacatggaaactgaggaacctgctcctaaatgactac tgggtatataacgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaa gacacaacataccagaatctctgggacacatttaaagcagtgtgtagagggaaatttata gcactaaatgcccacaagagaaagcaggaaagatctaaaattgacatcctaacatcacaa ttaaaagaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaaata actaagatcagagcagaactgaaggcgatagagacacaaaagatccttcaaaaaatcaat gaatccaggagctggttttttgaaaagatcaacaaaattgatagaccactagcaagacta ataaagaagaaaagagagaagaatcaaatagatgcaataaaaaatgacaaaaaggatatc accaccaatcccacagaaatacaaattaccatcagagaatactataaacacctctacaca attaaactagaaaatctagaagaaatggataaatttctggacacatacaccctcctgagg ctaaaccaggaagaagttgaatccctgaatacaccaataataggttctgaaattgatgca ataattaatagcctaccaaccaaaaaaagtccaggaccagacggattcacagctgaattc taccagaggtacaaagaggagctgttgataagcaacttcagcaaagtctcaggatacaaa atcaacatgcaaaaatcacaagcattcctgtacaccaataacagacaaacagagagccaa atcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaa cttacaagggatgtgaaggacttattcaaggagaactacaaaccactgctcaatgaaata aaagaggatacaaacaaatggaggaacattccatgctcatgggtaggaagaatcaatatc atgaaaatggccatactgcccaaggtaatttacagattcaacgccatccccatcaagcta ccaatgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaa agagaccacattgccaagacaatcctaacccaaaagaacaaagctggaggcatcacgcta cctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaa aacagagatatagaccaatggaacagaacagagcccttggaaataataccacacatctac aaccacctgatctttgacaaacctgacaaaaacaagcaatggggaaaggattccctattt aataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttc cttacaccttatacaaaaattaattcaagatggattaaagacttaaatggtagacctaaa accataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaag gacttcttgactaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatggtat ctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaa cccacaaaatgggagaaaattttcgcaacctactcatctgacaaaaggctaatatccaga atctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgg acaaaggatatgaacagacacttctcaaaagaagacatttatgcagtcaacagacacatg aaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagatac catctcacaccagttagaatggcgatcattaaaaagtcaggaaacaacaggtgctag >gi568815590f:61478220_61679668|GENSCAN_predicted_peptide_2|124_aa MQKREPLMKPSDLMILIHYDEDSMGEAAPMIQIISHRVLPTACGNYGSTIQDEIWFKAQC ACGEGRSVGLYEEQESRKSRGIKTNSKHEDNQESTTRDTGQPPSSTLDSCLPALTYGPCG KGKY >gi568815590f:61478220_61679668|GENSCAN_predicted_CDS_2|375_bp atgcaaaagcgggaacccctgatgaaaccatcagatctcatgatacttattcactatgat gaggacagtatgggggaagctgcccccatgattcaaattatctctcatcgggtccttccc acagcatgtgggaattatgggagtacaattcaagatgagatttggtttaaggcccagtgt gcatgtggggaagggagaagtgtaggtttgtacgaggaacaggaaagcagaaaatccaga ggcattaaaacaaacagcaagcatgaagacaaccaagagagcacaaccagagacacaggc caacctccctccagcaccttggacagctgccttcctgctttgacctacggaccctgtgga aaaggaaaatactga >gi568815590f:61478220_61679668|GENSCAN_predicted_peptide_3|160_aa MPPVGKQDELPLAQKLKAELRGHVKSLERNWKLIRDEGLAVMDKAKGLFLPEDENLREKG DWSQFTLWQQGRRNENACKGAPKTCTLLEKFPETTGCRRGQIKYSIMHPGTHVWPHTGPT NCRLRMHLGLVIPKEGCKIRCANETKYGSSVIRHIRHLRG >gi568815590f:61478220_61679668|GENSCAN_predicted_CDS_3|483_bp atgcccccagttggcaaacaagatgaactccctctggctcaaaagttaaaagcagaacta agaggccatgtcaagtctttagaaagaaactggaagttaatccgagatgaaggccttgca gtgatggataaagccaaaggtctcttcctgcctgaggatgaaaacctgagggaaaaaggg gactggagccagttcacgctgtggcagcaaggaagaagaaatgaaaatgcctgcaaagga gctcctaaaacctgtaccttactagaaaagttccccgagacaacaggatgcagaagagga cagatcaaatattccatcatgcaccccgggactcacgtgtggccgcacacagggcccaca aactgcaggctccgaatgcacctgggcttggtgattcccaaggaaggctgcaagattcga tgtgccaacgagaccaagtatgggtcctctgttatccgtcatatccgtcacctcagagga tga >gi568815590f:61478220_61679668|GENSCAN_predicted_peptide_4|122_aa MREGKQKEVKKESSRKEGYRMGRSVGRILSGMRMVKGSDSLVTVSAHETWFPPIEEGLPG GFQKAIYGWGGSSAMSLSPPLFQWLLAAILQSASTRTKLTKEEVQSQEHLPLIILPRSLP YC >gi568815590f:61478220_61679668|GENSCAN_predicted_CDS_4|369_bp atgagagaaggaaaacagaaagaggtaaagaaagagtcctccaggaaagagggctaccgt atgggaaggtctgttggaagaattttgtctggaatgaggatggtaaagggcagtgactcc cttgtaactgtgagtgcccatgagacatggttcccaccaatagaagagggtctgccagga ggtttccagaaggccatttacggctggggtggtagctcagcaatgtcactctctccccct ctcttccagtggctgctggcagccatcctgcagtcagccagcacaaggacaaagctgacc aaggaggaagtgcagagccaagaacatcttcccttgattatactgccccgaagtctgccc tactgctag >gi568815590f:61478220_61679668|GENSCAN_predicted_peptide_5|317_aa MDYGGKSRCKKNRRLLQVYTRDAGGLDCVHGHGDEGSGQLVELKGLEDGLDVKTSEEEDS RLTPEFLVRPAEWRIMPVTDQEEQVWSDAECEDDLAEKRRSNEVLRGAIETYQEVASLPD VPADLLKLSLKRRSDRQQFLGHMRGSLLTLQRLVQLFPNDTSLKNDLGVGYLLIGDNDNA KKVYEEVLSVTPNDGFAKVHYGFILKAQNKIAESIPYLKEGIESGDPGTDDGRFYFHLGD AMQRVGNKEAYKWYELGHKRGHFASVWQRSLYNVNGLKAQPWWTPKETGYTELVKKIYSD VQTIDFTCARSRDEEEE >gi568815590f:61478220_61679668|GENSCAN_predicted_CDS_5|954_bp atggattatggaggcaaaagtagatgcaaaaagaatagaaggctattgcaagtgtacaca agagatgctggtggcttggactgcgttcacggccatggagatgagggaagtgggcagttg gtggagctgaaaggacttgaggatggattagatgtaaaaacatcagaggaagaggactca aggctgacccctgagtttttggttcggccagctgagtggaggataatgccagttactgac caggaagagcaagtctggagtgatgcagaatgtgaggatgatttggctgagaagaggaga agtaatgaggtgctacgtggagccatcgagacctaccaagaggtggccagcctacctgat gtccctgcagacctgctgaagctgagtttgaagcgtcgctcagacaggcaacaatttcta ggtcatatgagaggttccctgcttaccctgcagagattagttcaactatttcccaatgat acttccttaaaaaatgaccttggcgtgggatacctcttgataggagataatgacaatgca aagaaagtttatgaagaggtgctgagtgtgacacctaatgatggctttgctaaagtccat tatggcttcatcctgaaggcacagaacaaaattgctgagagcatcccatatttaaaggaa ggaatagaatccggagatcctggcactgatgatgggagattttatttccacctgggggat gccatgcagagggttgggaacaaagaggcatataagtggtatgagcttgggcacaagaga ggacactttgcatctgtctggcaacgctcactctacaatgtgaatggactgaaagcacag ccttggtggaccccaaaagaaacgggctacacagagttagtaaagaaaatatattctgat gttcaaacaattgattttacatgtgcaagaagcagggatgaagaggaagagtga >gi568815590f:61478220_61679668|GENSCAN_predicted_peptide_6|424_aa MGGITAVMVNQSLLSPLVLEVDPNIQAVCTQEKEQIKTLNNKFASFTDKVRFLEQQNKML ETKWSLLQQQKMARSNMDNMFESYINNLRRQLETLGQEKLKLEAELGNMQRLVEDFKNKY EDEINKHTEMENEFVLIKKDVDEAYMNKVELESRLEGLTDEINFLRQLYEKEIRELQSQI SDTSVVLSMDNSRSLDMDSIIAEVKAQYEDIANQRRAEAESMYQIKYEELQSLAGKHGDD LWRTKTEISEMNWNISQLQAEIEGLKGQRASLEAAIADAEQRGELAIKDANAKLSELEAA LQRAKQDMARQLREYQELMNVKLALDIEIATYRKLLEGKESRLKSGMQNMSIHTKTTSGY AGGLSSAYGGLTSPGLSYGLGSSFGSGAGSSSLSRTSSSRAVVVKKIETCDGKLVSESSD ILPK >gi568815590f:61478220_61679668|GENSCAN_predicted_CDS_6|1275_bp atgggaggcatcaccgcagtcatggtcaaccagagcctactgagcccccttgtcctggag gtggaccccaacatccaggctgtgtgcacccaggagaaggagcagatcaagaccctcaac aacaagtttgcctccttcacagacaaggtacggttcctggagcagcagaacaagatgctg gagaccaagtggagcctcctgcagcagcagaagatggctcggagcaacatggacaacatg ttcgagagctacatcaacaaccttaggcggcagctggagactctgggccaggagaagctg aagctggaggcggagcttggcaacatgcagaggctggtggaggacttcaagaacaagtat gaggatgagatcaataagcatacagagatggagaatgaatttgtcctcatcaagaaggat gtggatgaagcttacatgaacaaggtagagctggagtctcgcctggaagggctgactgac gagatcaacttcctcaggcagctgtacgaaaaggagatccgggagctgcagtcccagatc tcggacacatctgtggtgctgtccatggacaacagccgctccctggacatggacagcatc attgctgaggtcaaggcacagtacgaggatattgccaaccaaagacgggctgaggctgag agcatgtaccagatcaagtatgaggagctgcagagtctggctgggaagcacggggatgac ctgtggcgcacaaagactgagatctctgagatgaactggaacatcagccagctccaggct gagattgagggcctcaaaggccagagggcttccctggaggccgccattgcagatgccgag cagcgtggagagctggccattaaggatgccaacgccaagttgtccgagctggaggctgcc ctgcagcgggccaagcaggacatggcacggcagctgcgtgagtaccaggagctgatgaac gtcaagctggccctggacatcgagatcgccacctacaggaagctgctggagggcaaagag agccggctgaagtctgggatgcagaacatgagtattcatacgaagaccaccagcggctat gcaggtggtctgagctcggcctatgggggcctcacaagccccggcctcagctacggcctg ggctccagctttggctctggcgcaggctccagctccctcagccgcaccagttcctccagg gccgtggttgtgaagaagatcgagacatgtgatgggaagctggtgtccgagtcctctgac atcctgcccaagtga >gi568815590f:61478220_61679668|GENSCAN_predicted_peptide_7|264_aa MTKHPANRSPPPEGKAHPAGERASPAGAQLAVPAEWMDAEHFSEAPSDKSGSLKAPNDVA GLATGLSVLFQYFEDPMLPLHVEILLSVSKHPLLAALLVVCSRIPMRLGFSVFLRQICLL PPAWSPDTACEMPNLHSMLTHISVTADRAWFQRFQPLHFPGKPPLGSASLEVRPVHRGYP TVQGKVRAAKTRMGGLEHERQRYEELTIATALRYRYLIYSYVLNISEDPGRQLGNRIINY RKKPLGMACKAADENLRFFRALEN >gi568815590f:61478220_61679668|GENSCAN_predicted_CDS_7|795_bp atgaccaagcacccagcaaacaggagccctcctcccgaaggcaaggcacacccagcaggt gagagggccagtcctgctggtgcacaactggcagtgcctgcagagtggatggatgcagag cacttctcagaagccccaagtgacaagagtggaagcttgaaggctcctaacgacgtggca gggctagcgacggggctttctgtgctattccagtactttgaggatccaatgctacccctc cacgtggaaatcctgctttctgttagcaaacatcctctgctggcagctttacttgtggtc tgttccagaatccctatgaggctaggtttctccgtatttctgagacagatttgcctgctg cctcccgcgtggtctccagacactgcctgtgagatgcctaatcttcattccatgctcacc cacatctctgtcactgctgaccgggcatggttccagaggttccagcctctacacttccct ggcaaaccccctttaggatcagcatccctggaggtcagacctgtgcaccgaggatacccg acagtgcaggggaaggtacgggctgctaagactagaatgggaggtctggagcatgagagg cagagatatgaggagctcacaattgcaactgccctcaggtatcgctatcttatttatagt tatgtcctaaatatatcggaagatccaggaagacaacttggtaacagaataatcaactac cgaaagaagcccctggggatggcttgcaaagctgctgatgaaaatctcaggttctttcga gcacttgagaattaa >gi568815590f:61478220_61679668|GENSCAN_predicted_peptide_8|357_aa MKPDFQGLTKDGSNENIDSLEEVLNILAEESSDWFYGFLSFLYDIMTPFEMLEEEEEESE TADGVDVKELTKEELKKEKEKPESRKESKNEERKKGKKEDVRKDKKIADADLSRKESPKG KKDREKEKVDLEKSAKTKENRKKSTNMKDVSSKMASRDKDDRKERLKERSTSEPAVPPEE AEPHTEPEEQVPVEAEPQNIEDEAKEQIQSLLHEMVHAEHVEGEDLQQEDGPTGEPQQED DEFLMATDVDDRFETLEPEVSHEETEHSYHVEETAVYEPLENEGIEITEVTAPPEDNPVE DSQLHAQLTSMTVPDTGGPVFSQGYMSSTPAGTSKQDQEKLHSERPHGEVVTCNLVP >gi568815590f:61478220_61679668|GENSCAN_predicted_CDS_8|1074_bp atgaagccagacttccaaggcctgaccaaagatggcagtaatgaaaatattgattctctt gaggaagtccttaatattttagcagaggaaagttcagattggttttatggtttcctctca tttctctatgatataatgactccttttgaaatgctagaagaagaagaagaagaaagcgaa accgcagatggtgttgatgttaaagaactcactaaagaagagctcaagaaggagaaagag aaacctgagtcaaggaaggaaagtaagaatgaagagagaaaaaaggggaagaaagaggat gtccgaaaggataagaaaattgctgatgcagacctatccaggaaggagtctcctaagggt aaaaaggacagagaaaaagagaaagtggacctagaaaaaagtgctaaaaccaaggaaaat aggaaaaaatcaacaaatatgaaggatgtttctagtaaaatggcatcccgagacaaagat gacagaaaggaaagacttaaagagagatctacttcagagccagcagtcccgccagaagag gctgagccacacactgagcccgaggagcaggttcctgtggaggcagaaccccagaatatc gaagatgaagcaaaagaacaaattcagtcccttctccatgaaatggtacacgcagaacat gttgagggagaagacttgcaacaagaagatggacccacaggagaaccacaacaagaggat gatgagtttcttatggcgactgatgtagatgatagatttgagaccctggaacctgaagta tctcatgaagaaaccgagcatagttaccacgtggaagagacagcagtatatgaacctcta gaaaatgaagggatagaaatcacagaagtaactgctccccctgaggataatcctgtagaa gattcacagctgcatgctcagttgacgagcatgaccgtccccgatacaggaggaccagtc ttcagtcaagggtatatgagtagcacccctgctggaacctccaaacaagatcaagagaag ctacattctgaaagacctcatggagaagttgtaacctgtaacttggtaccataa