GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:02:06 Sequence gi568815589r:38311403_38524358 : 212956 bp : 44.97% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4452 4532 81 0 0 54 56 136 0.027 5.75 1.02 Intr + 19365 19431 67 0 1 63 84 40 0.010 0.01 1.03 Intr + 37239 37268 30 2 0 101 83 46 0.396 3.73 1.04 Intr + 40544 40625 82 1 1 87 74 50 0.700 2.71 1.05 Term + 41632 41798 167 2 2 71 52 58 0.034 -1.42 1.06 PlyA + 42023 42028 6 1.05 2.07 PlyA - 42283 42278 6 1.05 2.06 Term - 43530 43433 98 1 2 84 42 238 0.093 16.93 2.05 Intr - 52071 52015 57 1 0 92 82 17 0.094 0.36 2.04 Intr - 61567 61508 60 2 0 97 87 55 0.707 5.01 2.03 Intr - 67432 67348 85 1 1 24 113 66 0.011 2.09 2.02 Intr - 74404 74273 132 1 0 62 77 71 0.056 4.24 2.01 Init - 81558 81415 144 0 0 55 18 114 0.066 1.22 2.00 Prom - 83660 83621 40 0.24 3.00 Prom + 84070 84109 40 -9.95 3.01 Sngl + 84347 85900 1554 1 0 82 48 2328 0.937 221.58 3.02 PlyA + 86032 86037 6 1.05 4.09 PlyA - 87600 87595 6 1.05 4.08 Term - 100147 99998 150 1 0 106 45 136 0.963 9.01 4.07 Intr - 101951 101835 117 2 0 121 97 91 0.999 13.96 4.06 Intr - 102801 102692 110 1 2 99 63 112 0.780 9.80 4.05 Intr - 103676 103604 73 2 1 15 94 51 0.154 -2.62 4.04 Intr - 107778 107697 82 2 1 105 75 10 0.917 1.04 4.03 Intr - 108529 108416 114 0 0 64 61 179 0.970 12.36 4.02 Intr - 111092 111020 73 0 1 75 92 15 0.973 -0.94 4.01 Init - 113022 112563 460 0 1 83 109 728 0.995 68.32 4.00 Prom - 114234 114195 40 -5.16 5.04 PlyA - 118418 118413 6 1.05 5.03 Term - 121073 120886 188 0 2 64 50 145 0.404 5.95 5.02 Intr - 126953 126860 94 2 1 100 72 8 0.185 -0.06 5.01 Init - 128357 128301 57 2 0 75 121 19 0.229 5.21 5.00 Prom - 138456 138417 40 -4.26 6.00 Prom + 139647 139686 40 -5.36 6.01 Init + 142418 142428 11 1 2 117 43 18 0.030 -0.92 6.02 Intr + 144716 144791 76 2 1 91 89 57 0.031 5.62 6.03 Intr + 147806 147903 98 1 2 13 94 28 0.030 -5.29 6.04 Intr + 150701 150831 131 2 2 132 79 160 0.944 19.84 6.05 Intr + 158445 158563 119 1 2 76 99 96 0.903 9.68 6.06 Intr + 163025 163138 114 1 0 78 115 7 0.297 3.04 6.07 Intr + 167045 167367 323 1 2 -26 4 260 0.264 0.86 6.08 Term + 167450 167948 499 2 1 4 50 288 0.440 10.60 6.09 PlyA + 170026 170031 6 1.05 7.03 PlyA - 170418 170413 6 1.05 7.02 Term - 176894 176479 416 0 2 9 43 406 0.465 23.62 7.01 Init - 193736 193514 223 2 1 76 32 116 0.047 3.76 7.00 Prom - 199363 199324 40 -2.06 8.04 PlyA - 201363 201358 6 1.05 8.03 Term - 207343 207216 128 2 2 53 48 104 0.847 1.34 8.02 Intr - 207690 207610 81 1 0 60 57 66 0.478 0.31 8.01 Intr - 212013 211950 64 0 1 116 91 5 0.663 1.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 43380 43516 137 1 2 7 50 276 0.878 13.88 S.002 Init - 62948 62798 151 2 1 58 69 83 0.818 3.60 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:38311403_38524358|GENSCAN_predicted_peptide_1|142_aa XSSANHQAASQRAGPYSSLDFRDPIGIVIKKNDYKYPDKEKRKEKDFYEGVIGKQVVFGY SCPQGWAVSCHVLEAELTLLATGDGTRREFEVTNLEMLAADVFLLLRRLLVLWKRITNPE TMLTCTTSGVPAVRPAAHAPPL >gi568815589r:38311403_38524358|GENSCAN_predicted_CDS_1|429_bp nngtccagtgccaaccaccaggctgccagtcaacgagcaggtccctactcctctctggac ttccgtgaccccatcggaatagtgataaagaaaaatgattataaatatccagataaggaa aaaagaaaagaaaaggatttctatgaaggggttattgggaaacaggtggtgttcggttac tcctgcccacaaggatgggctgtcagctgccatgtgttagaagctgagctgaccctcttg gctacaggtgatggcaccagaagggagtttgaagtcacaaacctggagatgttggcagca gatgttttcttactcctccgcaggctgctggtattgtggaaaaggataacaaacccagag acaatgcttacctgcaccacctctggtgtgcctgctgtgagacctgcagcacacgcgcca cccctctga >gi568815589r:38311403_38524358|GENSCAN_predicted_peptide_2|191_aa MGLGDVERVLQCNGGKASPQMRTPRPREESPHAEPGAAASRGSRGETSTGEADRHGDWHV SLTQCEKENPESGAKRAEKQEMGQNNYTKPGKHMVDVAVVCPELVGSLLTDFKNEAVDPH DRREDLLINVCTSASEEKQGYPPSDPASVLSPPHHEAAAVPKVISEQQQQQQQQQQQQQQ QTSFEGPKKQA >gi568815589r:38311403_38524358|GENSCAN_predicted_CDS_2|576_bp atgggattaggggacgtggagagagtcctacagtgcaacggcggaaaggccagcccccag atgcggacgccgcggcccagagaggaaagcccccacgcggagccaggagctgcggcttcc cgagggagccgaggggagaccagcactggggaggcagacagacatggagactggcatgtt tccctcacacagtgtgagaaagagaacccagagagtggtgccaagagagcagagaaacag gaaatgggacaaaacaactacacaaagcctgggaagcatatggtagatgtagcggtggtg tgcccggaattggtgggttctttgctcactgacttcaagaatgaagccgtggaccctcac gaccgccgtgaagatttactaataaatgtctgcacatcagccagtgaagagaagcagggc tacccacccagtgatcctgcctctgtcctctcacccccgcatcatgaagcagctgccgtc ccaaaggtcatctcagagcagcagcagcagcagcagcagcagcagcagcagcagcagcag cagacaagctttgaggggcccaaaaagcaggcctag >gi568815589r:38311403_38524358|GENSCAN_predicted_peptide_3|517_aa MLRFLAPRLLSLQGRTARYSSAAALPSPILNPDIPYNQLFINNEWQDAVSKKTFPTVNPT TGEVIGHVAEGDRADVDRAVKAAREAFRLGSPWRRMDASERGRLLNRLADLVERDRVYLA SLETLDNGKPFQESYALDLDEVIKVYRYFAGWADKWHGKTIPMDGQHFCFTRHEPVGVCG QIIPWNFPLVMQGWKLAPALATGNTVVMKVAEQTPLSALYLASLIKEAGFPPGVVNIITG YGPTAGAAIAQHVDVDKVAFTGSTEVGHLIQKAAGDSNLKRVTLELGGKSPSIVLADADM EHAVEQCHEALFFNMGQCCCAGSRTFVEESIYNEFLERTVEKAKQRKVGNPFELDTQQGP QVDKEQFERVLGYIQLGQKEGAKLLCGGERFGERGFFIKPTVFGGVQDDMRIAKEEIFGP VQPLFKFKKIEEVVERANNTRYGLAAAVFTRDLDKAMYFTQALQAGTVWVNTYNIVTCHT PFGGFKESGNGRELGEDGLKAYTEVKTVTIKVPQKNS >gi568815589r:38311403_38524358|GENSCAN_predicted_CDS_3|1554_bp atgctgcgcttcctggcaccccggctgcttagcctccagggcaggaccgcccgctactcc tcggcagcagccctcccaagccccattctgaacccagacatcccctacaaccagctgttc atcaacaatgaatggcaagatgcagtcagcaagaagaccttcccgacggtcaaccctacc accggggaggtcattgggcacgtggctgaaggtgaccgggctgatgtggatcgggccgtg aaagcagcccgggaagccttccgcctggggtccccatggcgccggatggatgcctctgag cggggccggctgctgaaccgcctggcagacctagtggagcgggatcgagtctacttggcc tcactcgagaccttggacaatgggaagcctttccaagagtcttacgccttggacttggat gaggtcatcaaggtgtatcggtactttgctggctgggctgacaagtggcatggcaagacc atccccatggatggccagcatttctgcttcacccggcatgagcccgttggtgtctgtggc cagatcatcccgtggaacttccccttggtcatgcagggttggaaacttgccccggcactc gccacaggcaacactgtggttatgaaggtggcagagcagacccccctctctgccctgtat ttggcctccctcatcaaggaggcaggctttccccctggggtggtgaacatcatcacgggg tatggcccaacagcaggtgcggccatcgcccagcacgtggatgttgacaaagttgccttc accggttccaccgaggtgggccacctgatccagaaagcagctggcgattccaacctcaag agagtcaccctggagctgggtggtaagagccccagcatcgtgctggccgatgctgacatg gagcatgccgtggagcagtgccacgaagccctgttcttcaacatgggccagtgctgctgt gctggctcccggaccttcgtggaagaatccatctacaatgagtttctcgagagaaccgtg gagaaagcaaagcagaggaaagtggggaacccctttgagctggacacccagcaggggcct caggtggacaaggagcagtttgaacgagtcctaggctacatccagcttggccagaaggag ggcgcaaaactcctctgtggcggagagcgtttcggggagcgtggtttcttcatcaagcct actgtctttggtggcgtgcaggatgacatgagaattgccaaagaggagatctttgggcct gtgcagcccctgttcaagttcaagaagattgaggaggtggttgagagggccaacaacacc aggtatggcctggctgcggctgtgttcacccgggatctggacaaggccatgtacttcacc caggcactccaggccgggaccgtgtgggtaaacacctacaacatcgtcacctgccacacg ccatttggagggtttaaggaatctggaaacgggagggagctgggtgaggatgggcttaag gcctacacagaggtaaagacggtcaccatcaaggttcctcagaagaactcgtaa >gi568815589r:38311403_38524358|GENSCAN_predicted_peptide_4|392_aa MPRLSLLLPLLLLLLLPLLPPLSPSLGIRDVGGRRPKCGPCRPEGCPAPAPCPAPGISAL DECGCCARCLGAEGASCGGRAGGRCGPGLVCASQAAGAAPEGTGLCVCAQRGTVCGSDGR SYPSVCALRLRARHTPRAHPGHLHKARDGPCEFVPITRFYNCFPQPLIHRQFSLSPDRRQ SETLSKKKKKKEEEEEEEEEGEEEKEEEGCKSNFQHTINFKEISEGFGKIFSFQPSMIDI IDEASTLHVAQHAVVLDARVAELLSNAAPVVVVPPRSVHNVTGAQVGLSCEVRAVPTPVI TWRKVTKSPEGTQALEELPGDHVNIAVQVRGGPSDHEATAWILINPLRKEDEGVYQCHAA NMVGEAESHSTVTVLDLSKYRSFHFPAPDDRM >gi568815589r:38311403_38524358|GENSCAN_predicted_CDS_4|1179_bp atgccgcgcttgtctctgctcttgccgctgctgcttctgctgctgctgccgctgctgccg ccgctgtccccgagccttgggatccgcgacgtgggcggccggcgccccaagtgtggtccg tgccggccagagggctgcccggcgcctgcgccctgcccggcgcccgggatctcggcgctc gacgagtgcggctgctgcgcccgctgcctgggagccgagggcgcgagctgcgggggccgc gccggcgggcgctgtggccccggcctggtatgcgcgagccaggccgctggggcagcgccc gagggcaccgggctctgcgtgtgcgcgcagcgcggcaccgtctgcggctccgacggtcgc tcgtaccccagcgtctgcgcgctgcgcctgcgcgctcggcacacgccccgcgcgcacccc ggtcacctgcacaaggcgcgcgacggcccttgcgagttcgttcctatcactcgtttttat aactgctttcctcagccgttaattcacaggcaattctctttgtctccagacaggagacag agtgagaccctgtctaaaaagaagaagaagaaggaggaggaggaggaggaggaggaggag ggggaggaggagaaggaagaagaaggatgcaaaagcaatttccaacacaccattaacttt aaagaaatctcagagggatttgggaagattttttcattccagccatcaatgatcgatata attgacgaggcctctacactgcacgttgcccaacacgctgtggtgctggatgccagggtg gctgagttgctgtccaatgcagctcctgtggtcgtcgttcctccccgaagtgttcacaac gtcaccggggcgcaggtgggcctgtcctgtgaagtgagggctgtgcctaccccagtcatc acgtggagaaaggtcacgaagtcccctgagggcacccaagcactggaggagctgcctggg gaccatgtcaatatagctgtccaagtgcgagggggcccttctgaccatgaggccacggcc tggattttgatcaaccccctgcgaaaggaggatgagggtgtgtaccagtgccatgcagcc aacatggtgggagaggctgagtcccacagcacagtgacggttctagatctgagtaaatac aggagcttccacttcccagctcccgatgaccgcatgtga >gi568815589r:38311403_38524358|GENSCAN_predicted_peptide_5|112_aa MVLTSKEHQGSKGHSWHQKFKTDIVTPEVMVGFFVIFWSRLSVLGSKTFEVVTFIELKLL PYPPYLGHLLEHTEHGFLSYSPVSSTGQVYSEPSESLLDEEQMSQCISESTD >gi568815589r:38311403_38524358|GENSCAN_predicted_CDS_5|339_bp atggtgctgacttcaaaggaacatcaagggtccaaggggcacagctggcaccaaaagttc aaaacagatattgtgacgccagaagtgatggttggattttttgttatattttggagtagg ctttcagttcttggatctaaaacatttgaagtagtaacattcattgagctgaaattgctt ccctatccaccctacctgggccaccttcttgagcacacagagcatggcttcctcagctac tccccagtgtccagcacaggccaggtgtacagcgagccttctgagagcttgctggatgaa gaacaaatgagtcaatgcatcagtgaatctacagattag >gi568815589r:38311403_38524358|GENSCAN_predicted_peptide_6|456_aa MAMRTRQLNGTCTSSVDMELFLHYSLIPSAQAVDMLRSQPRALEKRESTDVRTEARGDLS CRPLDFVGTFSNRWSYGITFGATANKVMFLFSEGYQPLQIPQWAQAFELLIGGIEVGLSH FPFFACLSSEFQLVSSILGFCYSDLTAWAVPCIPSSGEMMVGFHICVASPALEITLSPCI SHQARRRPAGATEDEVARSAKKMDEIVQEKNTAGALGLLTELQNVLELLRSTGTGMLLNV SLKQSTDEEVTSLAKSFVKSWKTLPDEPSTEKDPNEKRIEPAMTSQNSKRNASNSVRMKY REMLAAALRTGDDCIEMGADEEELGSRIEEAVDPERGNTGMKYKNRVQSKISNLTDAKNP NLRKNASCGNIPPDLLARMSAEEMASDELKEMHKNLTKEAIREHQMAKTGGTQPDSLTCG KCKKNCTSTQVQACSVGEPMATFVDCNECGNQQKFC >gi568815589r:38311403_38524358|GENSCAN_predicted_CDS_6|1371_bp atggcgatgaggaccaggcagttaaatggcacttgtaccagctcggtggacatggagctc ttccttcattattccctcatcccatcagctcaagctgtggacatgctgaggtctcagcca agggctctggagaagagagaaagcacagatgtcagaactgaggcaaggggtgacctgagc tgcaggcctctggacttcgtgggcactttcagtaaccgatggtcctatggaatcaccttt ggggccacagctaataaggtcatgtttttgttctcagaaggctaccagcccctgcagatc ccgcagtgggcccaagcctttgagcttctgattggaggcattgaagtcggcctgtcccac ttccccttctttgcctgcctctcgtcggaattccagctggtcagctccatcttgggcttc tgctactctgatctgacagcatgggctgtaccgtgcattccctcctctggagaaatgatg gtaggtttccacatctgtgtagcatctcccgctctggagatcaccctgagcccttgcatc tcccaccaggcacggaggagacctgccggagccacggaggatgaggtggcccgcagtgct aagaagatggacgagatagtgcaggagaagaacacggccggagcactgggtttgttaacc gagcttcagaatgttctggaattactgcggtccacaggaactggaatgttacttaatgtt agtctcaagcagagtacagatgaagaagttacatctctagcaaagtctttcgtcaaatcc tggaaaacgttaccagatgagccatcaactgagaaagaccccaacgaaaagcgaatagaa cctgcaatgacatcacagaatagcaaaagaaacgcttctaattctgtgcggatgaagtac agggagatgcttgctgcagctcttcgaacaggagatgactgcattgaaatgggagctgat gaggaagaattaggatctcgaattgaggaagctgtagatccagaaagagggaatacaggc atgaagtacaaaaatagagtccaaagtaagatatcaaatcttacagatgcaaagaatcca aatttaaggaaaaatgcatcgtgtgggaatattcctcctgacttacttgctagaatgtcc gcagaagaaatggctagcgatgagctcaaagagatgcacaaaaacttgacgaaagaagcc atcagagagcatcagatggccaagacaggtggaacccagcctgattcgctcacatgtggc aaatgtaaaaagaattgcacttccacacaggtacaagcctgcagtgttggtgaaccaatg gcaacgtttgttgactgtaatgaatgtggaaatcaacagaagttctgttga >gi568815589r:38311403_38524358|GENSCAN_predicted_peptide_7|212_aa MPPKHGAGPLWEPQQIQVGVAERQNAQNTTMLLLKLWVINSLIYVRLVPFVDTIKYGSVT KNLSIKMCLATPHMGWLALRFFVARDSAATFIGQWRTDLGEPEMLRELARETLGLCTRSA PQDKGLGGEVPRGRLRPVDLCEGAREPPDGAPGCGDPLERYLDSQDPRRRSSTTHRPPPS RVRTLSPQRVSPTPSPCPGSLVPARERRRSGP >gi568815589r:38311403_38524358|GENSCAN_predicted_CDS_7|639_bp atgcctcctaagcatggtgcaggccctctgtgggagccccagcaaatccaggtgggcgtg gctgaaagacaaaatgctcaaaacaccactatgcttcttcttaaactctgggtcatcaac agtctcatctacgtcaggcttgtgccgtttgttgacactatcaaatatggcagtgtcacc aagaatctctccataaaaatgtgtttagctacaccacacatggggtggctggcgctgcgc ttcttcgtggcccgcgacagtgcggccaccttcatcggccagtggcgcactgatctgggc gagcccgaaatgctccgcgagctggcgagggagacactgggcctctgcacccggtcagct ccccaagataaaggcctcggaggggaagttccgcgtgggagactccggcctgttgatctc tgtgagggtgcccgggagccgcctgacggcgcgcctgggtgtggcgacccgctggagcgc tacctggacagtcaggacccgcgccgccgctcctccaccacccaccgcccgcccccgtcg agggtccgcaccctgtccccgcagagggtgtcgcccacccctagcccctgccctggtagc ctggtccccgcgagagagcgccggcgctccggaccctag >gi568815589r:38311403_38524358|GENSCAN_predicted_peptide_8|90_aa APALKGSVCVRRRLQNSRLPFGRTLEFQLFHVISSVLGSQAFISAFLVVILDLLERTALG SPENSVMSAVNIFVSPGVCGIVSLNIWAKF >gi568815589r:38311403_38524358|GENSCAN_predicted_CDS_8|273_bp gcccctgctctcaaaggctctgtgtgtgtgaggaggcggctgcagaacagcagacttccc tttggcaggactttagagttccagttgttccacgtcatctccagcgtgctgggttctcag gccttcatctcagcctttctagtggtcattttggatctgctggagaggactgccctgggc tccccagagaattcagttatgtcggcagtgaacatttttgtatctcctggtgtttgtggc attgtcagtctcaacatctgggccaaattttga