GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:12:07 Sequence gi568815593r:138456040_138675318 : 219279 bp : 46.97% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 4496 4535 40 -2.96 1.01 Init + 7395 7515 121 2 1 94 38 93 0.861 5.42 1.02 Intr + 7796 7872 77 0 2 75 72 29 0.913 -0.77 1.03 Intr + 8281 8406 126 0 0 58 85 85 0.948 6.08 1.04 Intr + 9663 10029 367 2 1 88 115 612 0.946 58.62 1.05 Intr + 10368 10556 189 1 0 80 54 61 0.709 1.46 1.06 Term + 10718 12042 1325 0 2 88 43 1483 0.811 135.53 1.07 PlyA + 13256 13261 6 1.05 2.03 PlyA - 15350 15345 6 1.05 2.02 Term - 32666 32650 17 0 2 114 44 1 0.245 -3.30 2.01 Init - 36042 35931 112 0 1 86 60 157 0.632 11.08 2.00 Prom - 50190 50151 40 -3.76 3.28 PlyA - 51426 51421 6 1.05 3.27 Term - 52348 52266 83 2 2 106 43 108 0.999 5.96 3.26 Intr - 52777 52630 148 1 1 71 105 113 0.990 11.11 3.25 Intr - 54590 54526 65 0 2 68 87 -4 0.952 -4.06 3.24 Intr - 55161 55006 156 1 0 44 86 145 0.989 9.88 3.23 Intr - 55565 55436 130 2 1 51 42 88 0.934 0.77 3.22 Intr - 56915 56725 191 0 2 88 111 103 0.960 11.90 3.21 Intr - 57667 57529 139 1 1 86 42 65 0.917 1.74 3.20 Intr - 61661 61522 140 0 2 38 94 76 0.925 3.38 3.19 Intr - 62828 62653 176 1 2 114 84 41 0.517 5.88 3.18 Intr - 86897 86794 104 2 2 25 88 207 0.657 13.37 3.17 Intr - 87769 87509 261 1 0 71 -15 177 0.063 3.38 3.16 Intr - 100553 100413 141 2 0 42 82 78 0.894 3.15 3.15 Intr - 100827 100735 93 0 0 25 105 132 0.997 8.76 3.14 Intr - 101457 101363 95 1 2 79 55 113 0.998 6.78 3.13 Intr - 101947 101830 118 1 1 40 89 134 0.954 8.74 3.12 Intr - 102618 102514 105 0 0 93 74 52 0.956 4.71 3.11 Intr - 104052 103825 228 0 0 47 110 240 0.812 20.17 3.10 Intr - 105750 105541 210 0 0 80 86 106 0.821 8.71 3.09 Intr - 110679 110587 93 0 0 45 94 79 0.925 4.36 3.08 Intr - 111124 110962 163 0 1 38 105 160 0.999 12.68 3.07 Intr - 111522 111416 107 0 2 89 84 124 0.999 11.11 3.06 Intr - 111683 111610 74 0 2 90 94 12 0.998 1.13 3.05 Intr - 113010 112886 125 2 2 63 109 66 0.920 6.43 3.04 Intr - 115102 114921 182 1 2 -3 84 227 0.778 11.87 3.03 Intr - 117811 117724 88 0 1 78 105 73 0.472 7.97 3.02 Intr - 118087 118029 59 1 2 108 85 -44 0.508 -5.02 3.01 Init - 119279 119199 81 2 0 74 94 120 0.613 10.21 3.00 Prom - 120504 120465 40 -4.16 4.00 Prom + 121245 121284 40 -5.06 4.01 Init + 123504 123557 54 2 0 71 26 61 0.209 -0.52 4.02 Intr + 130964 131005 42 1 0 106 98 50 0.275 6.24 4.03 Intr + 139393 139589 197 0 2 51 58 162 0.081 7.71 4.04 Intr + 150287 150326 40 2 1 96 94 -26 0.146 -2.97 4.05 Intr + 154906 155050 145 0 1 93 64 144 0.860 12.36 4.06 Intr + 160435 160546 112 2 1 79 37 16 0.222 -4.96 4.07 Intr + 164470 164746 277 1 1 73 72 383 0.881 32.72 4.08 Term + 164791 164925 135 0 0 29 48 162 0.995 4.32 4.09 PlyA + 164936 164941 6 1.05 5.00 Prom + 186434 186473 40 -5.46 5.01 Init + 191290 191371 82 0 1 87 80 79 0.753 6.14 5.02 Intr + 196611 196675 65 1 2 91 71 42 0.262 1.24 5.03 Term + 207185 207364 180 1 0 87 31 83 0.114 0.11 5.04 PlyA + 211561 211566 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100075 99998 78 1 0 68 38 96 0.918 0.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:138456040_138675318|GENSCAN_predicted_peptide_1|734_aa MDIQTQGHLRCNSLAFLPLRQESSERRITPAPMDNSVDSGKKTAPPLDSELEQEEPSLPG IPVPFGPGVPRQPIGKPLFRIPAVWAGPPPGLDKGGKVTPHHKDHYLLLQPRAAPPRPDT SSPACSSRMAAAKAEMQLMSPLQISDPFGSFPHSPTMDNYPKLEEMMLLSNGAPQFLGAA GAPEGSGSNSSSSSSGGGGGGGGGSNSSSSSSTFNPQADTGEQPYEHLTAGRTVILGRGC PGSPGLGARTSRGHGGAGGNPSPAQPPLRSAASCSGGGFSVFASAVVEMGSATESFPDIS LNNEKVLVETSYPSQTTRLPPITYTGRFSLEPAPNSGNTLWPEPLFSLVSGLVSMTNPPA SSSSAPSPAASSASASQSPPLSCAVPSNDSSPIYSAAPTFPTPNTDIFPEPQSQAFPGSA GTALQYPPPAYPAAKGGFQVPMIPDYLFPQQQGDLGLGTPDQKPFQGLESRTQQPSLTPL STIKAFATQSGSQDLKALNTSYQSQLIKPSRMRKYPNRPSKTPPHERPYACPVESCDRRF SRSDELTRHIRIHTGQKPFQCRICMRNFSRSDHLTTHIRTHTGEKPFACDICGRKFARSD ERKRHTKIHLRQKDKKADKSVVASSATSSLSSYPSPVATSYPSPVTTSYPSPATTSYPSP VPTSFSSPGSSTYPSPVHSGFPSPSVATTYSSVPPAFPAQVSSFPSSAVTNSFSASTGLS DMTATFSPRTIEIC >gi568815593r:138456040_138675318|GENSCAN_predicted_CDS_1|2205_bp atggacattcagacacaaggccatctgcgctgcaacagcctggccttcctgcccttgcgg caggagtcctctgagaggcgcatcactcctgccccaatggacaactcggtagacagtggg aaaaaaacagcacctcctctggattcagagctagagcaggaggagccttcccttcccgga atccctgttccctttgggcccggggtccccagacagcccatagggaagcccctctttcgg attcccgcagtgtgggccggccctccacctggactggataaaggggggaaagtgacccct caccacaaggaccattatctcctgctccagccccgggctgcacccccccgccccgacacc agctctccagcctgctcgtccaggatggccgcggccaaggccgagatgcagctgatgtcc ccgctgcagatctctgacccgttcggatcctttcctcactcgcccaccatggacaactac cctaagctggaggagatgatgctgctgagcaacggggctccccagttcctcggcgccgcc ggggccccagagggcagcggcagcaacagcagcagcagcagcagcgggggcggtggaggc ggcgggggcggcagcaacagcagcagcagcagcagcaccttcaaccctcaggcggacacg ggcgagcagccctacgagcacctgaccgcaggaaggaccgtgatccttggccgtggatgt cccggcagcccgggtttgggggcgcgcactagccgcggccatgggggtgctggcgggaat ccctcgcccgcacagccgccgctgcggagcgctgcgagctgcagtggagggggattctcc gtatttgcgtcagctgttgttgaaatgggctctgccactgagtcttttcctgacatctct ctgaacaacgagaaggtgctggtggagaccagttaccccagccaaaccactcgactgccc cccatcacctatactggccgcttttccctggagcctgcacccaacagtggcaacaccttg tggcccgagcccctcttcagcttggtcagtggcctagtgagcatgaccaacccaccggcc tcctcgtcctcagcaccatctccagcggcctcctccgcctccgcctcccagagcccaccc ctgagctgcgcagtgccatccaacgacagcagtcccatttactcagcggcacccaccttc cccacgccgaacactgacattttccctgagccacaaagccaggccttcccgggctcggca gggacagcgctccagtacccgcctcctgcctaccctgccgccaagggtggcttccaggtt cccatgatccccgactacctgtttccacagcagcagggggatctgggcctgggcacccca gaccagaagcccttccagggcctggagagccgcacccagcagccttcgctaacccctctg tctactattaaggcctttgccactcagtcgggctcccaggacctgaaggccctcaatacc agctaccagtcccagctcatcaaacccagccgcatgcgcaagtaccccaaccggcccagc aagacgcccccccacgaacgcccttacgcttgcccagtggagtcctgtgatcgccgcttc tcccgctccgacgagctcacccgccacatccgcatccacacaggccagaagcccttccag tgccgcatctgcatgcgcaacttcagccgcagcgaccacctcaccacccacatccgcacc cacacaggcgaaaagcccttcgcctgcgacatctgtggaagaaagtttgccaggagcgat gaacgcaagaggcataccaagatccacttgcggcagaaggacaagaaagcagacaaaagt gttgtggcctcttcggccacctcctctctctcttcctacccgtccccggttgctacctct tacccgtccccggttactacctcttatccatccccggccaccacctcatacccatcccct gtgcccacctccttctcctctcccggctcctcgacctacccatcccctgtgcacagtggc ttcccctccccgtcggtggccaccacgtactcctctgttccccctgctttcccggcccag gtcagcagcttcccttcctcagctgtcaccaactccttcagcgcctccacagggctttcg gacatgacagcaaccttttctcccaggacaattgaaatttgctaa >gi568815593r:138456040_138675318|GENSCAN_predicted_peptide_2|42_aa MEPFPPEPLRAPSLAAAGGAQGRRGSGVLRGLDACGPGLLLR >gi568815593r:138456040_138675318|GENSCAN_predicted_CDS_2|129_bp atggagccgtttccgccggagcctctgcgggcaccgagcctcgcagccgcgggcggggcg caggggcgcaggggctccggggttctacgcggcctcgacgcctgcgggcccgggctcctg ctcagatag >gi568815593r:138456040_138675318|GENSCAN_predicted_peptide_3|1184_aa MISASRAAAARLVGAAASRGPTAARHQDSWNGLSHEAFRLVSRRDYASEAIKGAVVGIDL GTTNSCVAVMEGKQAKVLENAEGARTTPSVVAFTADGERLVGMPAKRQAVTNPNNTFYAT KRLIGRRYDDPEVQKDIKNVPFKIVRASNGDAWVEAHGKLYSPSQIGAFVLMKMKETAEN YLGHTAKNAVITVPAYFNDSQRQATKDAGQISGLNVLRVINEPTAAALAYGLDKSEDKVI AVYDLGGGTFDISILEIQKGVFEVKSTNGDTFLGGEDFDQALLRHIVKEFKRETGVDLTK DNMALQRVREAAEKAKCELSSSVQTDINLPYLTMDSSGPKHLNMKLTRAQFEGIVTDLIR RTIAPCQKAMQDAEVSKSDIGEVILVGGMTRMPKVQQTVQDLFGRAPSKAVNPDEAVAIG AAIQGGVLAGDVTDVLLLDVTPLSLGIETLGGVFTKLINRNTTIPTKKSQVFSTAADGQT QVEIKVCQGEREMAGDNKLLGQFTLIGIPPAPRGVPQIEVTFDIDANGIVHVSAKDKGTG REQQIVIQSSGGLSKDDIENMVKNAEKYAEEDRRKKERVEAVNMAEGIIHDTETKMEEFK DQLPADECNKLKEEISKMRELLARKDSETGENIRQAASSLQQASLKLFEMAYKKVRVPDR PEAGSSSRPTGQDSSTLFRAAMTHADRSTSSAKKPLSARRFFAASQAPFKPRVGPKCLAS SRKASDVHPIQGEWTEKTTTPGGGGEKMADDPSAADRNVEIWKIKKLIKSLEAARGNGTS MISLIIPPKDQISRVAKMLADEFGTASNIKSRVNRLSVLGAITSVQQRLKLYNKVPPNGL VVYCGTIVTEEGKEKKVNIDFEPFKPINTSLYLCDNKFHTEALTALLSDDSKFGFIVIDG SGALFGTLQGNTREVLHKFTVDLPKKHGRGGQSALRFARLRMEKRHNYVRKVAETAVQLF ISGDKVNVAGLVLAGSADFKTELSQSDMFDQRLQSKVLKLVDISYGGENGFNQAIELSTE VLSNVKFIQEKKLIGRYFDEISQDTGKYCFGVEDTLKALEMGAVEILIVYENLDIMRYVL HCQGTEEEKILYLTPEQEKDKSHFTDKETGQEHELIESMPLLEWFANNYKKFGATLEIVT DKSQEGSQFVKGFGGIGGILRYRVDFQGMEYQGGDDEFFDLDDY >gi568815593r:138456040_138675318|GENSCAN_predicted_CDS_3|3555_bp atgataagtgccagccgagctgcagcagcccgtctcgtgggcgccgcagcctcccggggc cctacggccgcccgccaccaggatagctggaatggccttagtcatgaggcttttagactt gtttcaaggcgggattatgcatcagaagcaatcaagggagcagttgttggtattgatttg ggtactaccaactcctgcgtggcagttatggaaggtaaacaagcaaaggtgctggagaat gccgaaggtgccagaaccaccccttcagttgtggcctttacagcagatggtgagcgactt gttggaatgccggccaagcgacaggctgtcaccaacccaaacaatacattttatgctacc aagcgtctcattggccggcgatatgatgatcctgaagtacagaaagacattaaaaatgtt ccctttaaaattgtccgtgcctccaatggtgatgcctgggttgaggctcatgggaaattg tattctccgagtcagattggagcatttgtgttgatgaagatgaaagagactgcagaaaat tacttggggcacacagcaaaaaatgctgtgatcacagtcccagcttatttcaatgactcg cagagacaggccactaaagatgctggccagatatctggactgaatgtgcttcgggtgatt aatgagcccacagctgctgctcttgcctatggtctagacaaatcagaagacaaagtcatt gctgtatatgatttaggtggtggaacttttgatatttctatcctggaaattcagaaagga gtatttgaggtgaaatccacaaatggggataccttcttaggtggggaagactttgaccag gccttgctacggcacattgtgaaggagttcaagagagagacaggggttgatttgactaaa gacaacatggcacttcagagggtacgggaagctgctgaaaaggctaaatgtgaactctcc tcatctgtgcagactgacatcaatttgccctatcttacaatggattcttctggacccaag catttgaatatgaagttgacccgtgctcaatttgaagggattgtcactgatctaatcaga aggactatcgctccatgccaaaaagctatgcaagatgcagaagtcagcaagagtgacata ggagaagtgattcttgtgggtggcatgactaggatgcccaaggttcagcagactgtacag gatctttttggcagagccccaagtaaagctgtcaatcctgatgaggctgtggccattgga gctgccattcagggaggtgtgttggccggcgatgtcacggatgtgctgctccttgatgtc actcccctgtctctgggtattgaaactctaggaggtgtctttaccaaacttattaatagg aataccactattccaaccaagaagagccaggtattctctactgccgctgatggtcaaacg caagtggaaattaaagtgtgtcagggtgaaagagagatggctggagacaacaaactcctt ggacagtttactttgattggaattccaccagcccctcgtggagttcctcagattgaagtt acatttgacattgatgccaatgggatagtacatgtttctgctaaagataaaggcacagga cgtgagcagcagattgtaatccagtcttctggtggattaagcaaagatgatattgaaaat atggttaaaaatgcagagaaatatgctgaagaagaccggcgaaagaaggaacgagttgaa gcagttaatatggctgaaggaatcattcacgacacagaaaccaagatggaagaattcaag gaccaattacctgctgatgagtgcaacaagctgaaagaagagatttccaaaatgagggag ctcctggctagaaaagacagcgaaacaggagaaaatattagacaggcagcatcctctctt cagcaggcatcactgaagctgttcgaaatggcatacaaaaaggtgagggtcccggaccgc ccggaggcaggttcctcatccagacctactggccaggactcctccacactattccgtgca gccatgacgcacgcagaccgcagcacttcgtcagcgaagaagccgctgagcgcgaggcgc ttcttcgcggcttctcaagcaccgttcaaaccgcgcgtcgggcccaagtgcctagcttcg agtcgcaaagcctcagatgtccaccctattcaaggagaatggactgaaaagacgacgacc ccgggaggaggaggcgagaagatggcggacgaccccagtgctgccgacaggaacgtggag atctggaagatcaagaagctcattaagagcttggaggcggcccgcggcaatggcaccagc atgatatcattgatcattcctcccaaagaccagatttcacgagtggcaaaaatgttagcg gatgagtttggaactgcatctaacattaagtcacgagtaaaccgcctttcagtcctggga gccattacatctgtacaacaaagactcaaactttataacaaagtacctccaaatggtctg gttgtatactgtggaacaattgtaacagaagaaggaaaggaaaagaaagtcaacattgac tttgaacctttcaaaccaattaatacgtcattgtatttgtgtgacaacaaattccataca gaggctcttacagcactactttcagatgatagcaagtttggattcattgtaatagatggt agtggtgcactttttggcacactccaaggaaacacaagagaagtcctgcacaaattcact gtggatctcccaaagaaacacggtagaggaggtcagtcagccttgcgttttgcccgttta agaatggaaaagcgacataactatgttcggaaagtagcagagactgctgtgcagctgttt atttctggggacaaagtgaatgtggctggtctagttttagctggatccgctgactttaaa actgaactaagtcaatctgatatgtttgatcagaggttacaatcaaaagttttaaaatta gttgatatatcctatggtggtgaaaatggattcaaccaagctattgagttatctactgaa gtcctctccaacgtgaaattcattcaagagaagaaattaataggacgatactttgatgaa atcagccaggacacgggcaagtactgttttggcgttgaagatacactaaaggctttggaa atgggagctgtagaaattctaatagtctatgaaaatctggatataatgagatatgttctt cattgccaaggcacagaagaggagaaaattctctatctaactccagagcaagaaaaggat aaatctcatttcacagacaaagagaccggacaggaacatgagcttatcgagagcatgccc ctgttggaatggtttgctaacaactataaaaaatttggagctacgttggaaattgtcaca gataaatcacaagaagggtctcagtttgtgaaaggatttggtggaattggaggtatcttg cggtaccgagtagatttccagggaatggaataccaaggaggagacgatgaattttttgac cttgatgactactag >gi568815593r:138456040_138675318|GENSCAN_predicted_peptide_4|333_aa MGYSLKAQGNEPESKNIKKSIGGEYVNSYPFKVGGVDVIAFDKMAEYLYFIMMKRLLGLP VQSQSLPCKEFKGSDLELLSSQQMESQSSVCCWAYLGWENLRLILLTCRSQGVLLEPWEE GVGQSLALPLVVPVLEVPVVSESMHPRPSTQQHSLAQQRVDEISGDGTHDNGVLLGESVL RQMKFRESLSLHLLFPRFSTCVLGDQQHRVEAKPVGISYVDIEVLKKLDRNKKLLNKYDA FLASGSLIKQIPQILGPGLNKEGRFPSLLMYSEDMVAKVDEGKSTIKFQMTDDELVYNVH LAVNFLVSLLRKNWQNIWALHIKNTIGKPQCLY >gi568815593r:138456040_138675318|GENSCAN_predicted_CDS_4|1002_bp atgggctacagcctgaaggctcaaggaaatgaacctgaatctaaaaacataaagaagtcc attggtggtgaatatgtgaacagctaccccttcaaggtgggaggggtagatgtcatcgcc tttgacaaaatggctgaatatctttactttatcatgatgaaaagacttcttggtcttcca gttcagagccagagcctgccttgcaaggaattcaaaggctctgatctggagctgctgtca agccagcagatggagtcacagtcctctgtgtgctgctgggcctacctgggctgggagaat ctcaggctaattctgctgacctgcagaagccagggtgtgctgctggagccctgggaggag ggggttgggcagtcgctggcgctgccgctggtggtccctgtgcttgaagtgccggtggta tcggagtccatgcacccaaggcccagcacccagcaacattctctggcccagcagagggtt gatgagatttctggggacgggactcatgacaatggagttcttcttggagaatctgtcctt aggcagatgaagttcagagaaagcctctccctgcatttgctgtttcccaggttctccacg tgtgtcctgggggaccagcagcacagggttgaggccaagcctgtgggtatctcctacgtg gacatcgaggtgctgaagaaactcgacaggaacaagaagctgctcaacaagtatgatgcc tttctggcctcagggtctctgatcaagcagatcccacaaatcctgggcccaggcctaaat aaggaaggcaggttcccttccctgctgatgtacagtgaagacatggtggccaaagttgat gaggggaagtccacaatcaagttccagatgacagatgatgagcttgtgtacaacgtccat ctggctgtcaacttcctggtatcattgctcaggaagaattggcagaacatctgggcttta cacatcaagaacaccataggcaagccccagtgcctgtattaa >gi568815593r:138456040_138675318|GENSCAN_predicted_peptide_5|108_aa MEGRAGQAVLLTLDGSPPILLVRKTAAGFEFLVQRDILVLSGDEKRVRQAPLVHHNQLHA GFNLFMRDLAVPHPQAPEFTPLLFAPGLLCWQETAWLSCLGSPVVQGS >gi568815593r:138456040_138675318|GENSCAN_predicted_CDS_5|327_bp atggaggggagggcagggcaagctgttctgctgacactggatggctcccctccgatcctc ctggtcaggaagaccgcggcaggctttgagttcctggttcagagggacatattagtgctt agtggtgatgagaaacgggtcagacaggcacctcttgtgcaccacaaccagctccacgct ggcttcaacctgttcatgcgtgatctggcagtgccacacccgcaggcccctgaatttacg cccctgctctttgccccagggcttctctgttggcaggaaactgcctggctttcctgcttg ggcagcccagtggtgcagggcagttaa