GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:56:00 Sequence gi568815587f:4486891_4594038 : 107148 bp : 40.99% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 2011 2955 945 0 0 82 37 563 0.800 46.89 1.02 PlyA + 4799 4804 6 1.05 2.09 PlyA - 4994 4989 6 1.05 2.08 Term - 6940 6884 57 1 0 62 45 86 0.682 -1.59 2.07 Intr - 8059 7622 438 1 0 -7 87 177 0.163 0.89 2.06 Intr - 8891 8328 564 2 0 34 88 259 0.234 12.56 2.05 Intr - 11187 11072 116 1 2 59 82 97 0.112 5.45 2.04 Intr - 19581 19465 117 1 0 57 50 83 0.028 0.92 2.03 Intr - 24539 24406 134 1 2 69 37 97 0.245 2.07 2.02 Intr - 28355 28056 300 1 0 73 62 204 0.629 11.22 2.01 Init - 29034 28409 626 0 2 19 27 305 0.340 12.36 2.00 Prom - 29804 29765 40 -5.85 3.00 Prom + 33111 33150 40 -6.05 3.01 Init + 36325 36404 80 0 2 75 98 21 0.203 2.40 3.02 Intr + 44586 44691 106 0 1 24 84 95 0.883 2.00 3.03 Term + 45652 45810 159 0 0 48 42 147 0.804 3.06 3.04 PlyA + 45827 45832 6 1.05 4.00 Prom + 50491 50530 40 -3.65 4.01 Init + 55668 55809 142 2 1 77 95 63 0.777 6.24 4.02 Term + 58362 59254 893 1 2 59 38 592 0.754 42.90 4.03 PlyA + 62223 62228 6 1.05 5.00 Prom + 62387 62426 40 -12.03 5.01 Init + 63388 63549 162 0 0 81 80 72 0.518 5.53 5.02 Intr + 63790 63886 97 0 1 65 29 145 0.408 4.96 5.03 Intr + 66482 66696 215 0 2 68 50 160 0.415 7.71 5.04 Intr + 69733 69811 79 0 1 75 97 88 0.251 6.61 5.05 Intr + 71577 71692 116 1 2 64 47 74 0.091 0.15 5.06 Intr + 76558 76701 144 0 0 45 107 34 0.009 0.56 5.07 Term + 80865 81008 144 2 0 61 47 177 0.951 7.83 5.08 PlyA + 81456 81461 6 1.05 6.05 PlyA - 81634 81629 6 1.05 6.04 Term - 84739 84533 207 1 0 63 41 138 0.882 2.96 6.03 Intr - 85404 85266 139 2 1 39 72 60 0.497 -0.95 6.02 Intr - 86582 86416 167 2 2 21 93 114 0.523 3.04 6.01 Init - 90930 90790 141 0 0 97 84 106 0.990 11.29 6.00 Prom - 93857 93818 40 -3.95 7.00 Prom + 97344 97383 40 -6.55 7.01 Sngl + 100028 100975 948 1 0 68 47 454 0.857 33.61 7.02 PlyA + 103273 103278 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:4486891_4594038|GENSCAN_predicted_peptide_1|314_aa MLPSNITSTHPAVFLLVGIPGLEHLHAWISIPFCFAYTLALLGNCTLLFIIQADAALHEP MYLFLAMLATIDLVLSSTTLPKMLAIFWFRDQEINFFACLVQMFFLHSFSIMESAVLLAM AFDRYVAICKPLHYTTVLTGSLITKIGMAAVARAVTLMTPLPFLLRRFHYCRGPVIAHCY CEHMAVVRLACGDTSFNNIYGIAVAMFIVVLDLLFVILSYVFILQAVLQLASQEARYKAF GTCVSHIGAILSTYTPVVISSVMHRVARHAAPRVHILLAIFYLLFPPMVNPIIYGVKTKQ IREYVLSLFQRKNM >gi568815587f:4486891_4594038|GENSCAN_predicted_CDS_1|945_bp atgcttccctctaatatcacctcaacacatccagctgtctttttgttggtaggaattcct ggtttggaacacctgcatgcctggatctccatccccttctgctttgcttatactctggcc ctgctaggcaactgtacccttctcttcattatccaggctgatgcagccctccatgaaccc atgtacctctttctggccatgttggcaaccattgacttggttctttcttctacaacgctg cccaaaatgcttgccatattctggttcagggatcaggagatcaacttctttgcctgtctg gtccagatgttcttccttcactccttctccatcatggagtcagcagtgctgctggccatg gcctttgaccgctatgtggccatctgcaagccattgcactacacgacggtcctgactggg tccctcatcaccaagattggcatggctgctgtggcccgggctgtgacactaatgactcca ctccccttcctgctcagacgcttccactactgccgaggcccagtgattgcccattgctac tgtgaacacatggctgtggtaaggctggcgtgtggggacactagcttcaacaatatctat ggcattgctgtggccatgtttattgtggtgttggacctgctctttgttatcctgtcttat gtcttcatccttcaggcagttctccagcttgcctctcaggaggcccgctacaaggcattt gggacatgtgtgtctcacataggtgccatcctgtccacctacactccagtagtcatctct tcagtcatgcaccgtgtagcccgccatgctgcccctcgtgtccacatactccttgctatt ttctatctccttttcccacccatggtcaatcctatcatatatggagtcaagaccaagcag attcgtgagtatgtgctcagtctattccagagaaagaacatgtag >gi568815587f:4486891_4594038|GENSCAN_predicted_peptide_2|783_aa MLTPNNACSVPTSFRLTGIPGLESLHIWLSIPFGSMYLVAVLGNITILAVVRMEYSLHQP MYFFLCMLAVIDLVLSTSTMPKLLAIFWFGAHNIGVNACLAQMFFIHCFATVESGIFLAM AFDHYVAICDPLHHTLLLTHAVVGRLGLAALLRGVIYIGPLPLVICLRLPLYHTQIIAHS YCEHMAVVTLACGVTQGSTTYMEWGLAFWAVMGLATSEARLKTLGTCGSHICAILVFYIP IAVSSLTHRFGHRVPPHIHIHIHIHIHIHILLANIYLLIPPILNPIVYAVRTKQIREALL HIKARTQTRTLCKLASMIIVALSQVPHLAVPQERSEFTPQHWESWEPQLEGVGVCDSSHK TKDKNHMIIPIDAEKAFDKIQHPFMLKTLSTEGNPVQYSQFREVGMITDPVRDNSASEKE KSRLGLQGKIKIKSSQKQKAKKRQKLKGKLCRLTHKFHDLNIMTQSHRVTYNTSHSTPPT WGQIKVLSHQTEKLLQEKGIPKTGNIILAAFMVVSVVVSIPAVGATQNYASCTYVPFPPL IQSVSWMDSSVEVYTNNSAFMPAPNDDSFPAQPEGDVHFNLSIGYKYPPLCIGMSPGCLA YSYQNWMWTIPSLSNDSYQPKYDMLRVYLLEYVEDGWQSYRLREWVAPYPFKLMYTGIIP PRPKMIHPIVTPEHPELWKLAAAMTGIRLWNTTYQLFATNTKTPIFNITLMSKRVIPIMR CVKPPHMLLVGNTIIIPNTQTIECDNCKLFTCIDATFNPRTSILLPADASKESEEKSEVN DKQ >gi568815587f:4486891_4594038|GENSCAN_predicted_CDS_2|2352_bp atgttaacccctaataatgcctgctccgtgcctacctctttccggctcactggcatccct ggcctggaatccctgcacatctggctctccatcccctttggctccatgtacctggtagct gtgctggggaacataaccatcctggcagtggtaaggatggagtacagcctgcatcagccc atgtacttcttcctgtgcatgttggctgtcattgacttggtcctgtcaacctctaccatg cccaaactactggccatcttctggtttggtgcccacaacattggtgttaatgcctgtttg gcccagatgttcttcattcattgctttgccactgttgagtcaggcatcttccttgccatg gcttttgatcactatgtggccatctgtgacccactgcatcataccttgttgctcacccat gctgtggtgggtcgtttggggctggctgccctcctccggggggtaatctacattggacct ctgcccctagtgatttgtctgaggttgcccctttaccacacccaaatcattgcccattcg tactgtgagcacatggctgtggtcaccttggcatgtggtgtgacacaagggtcaacaact tatatggaatggggattggctttctgggctgtaatgggcttggccacctctgaagccagg cttaaaaccttagggacatgtggctctcacatctgtgccatcctcgtcttctacatcccc attgctgtttcctctctcacacaccgctttggccatcgtgtgcctccccatatccatatc catatccatatccatatccatatccatatccttttggccaacatttacctcctcatccca cctatcctcaacccaatagtctatgctgtccgcacaaagcagatccgagaggctcttctc catattaaggcaaggactcaaaccaggactctgtgcaaactagccagtatgatcattgtt gctcttagccaagttcctcatctcgctgtgccccaggagcgctcagaattcaccccacag cactgggagtcttgggagcctcagcttgagggtgttggggtatgtgattcatcacataaa actaaagacaaaaaccacatgattatcccaatagatgcagaaaaggcttttgataaaatt caacatcccttcatgttaaaaactctaagtactgaagggaatccagtacaatatagtcag ttcagagaagttggcatgattacggatcctgtgagggacaattctgcttcagagaaagag aaaagcaggctagggctccaggggaaaatcaagataaaaagctctcagaaacaaaaggcg aagaaacgtcagaaattgaaaggcaagctttgccgcctgacacataaattccatgacctc aacattatgacacaatctcaccgtgtgacctacaataccagtcattctactccaccaaca tggggtcaaataaaggtcttatcacatcaaacggaaaaattactacaagaaaaaggaatt ccaaaaacaggtaatataattctggctgcctttatggtagtcagtgtagtggtgagtata ccagcagttggggcaactcaaaattatgcttcttgcacatatgttccttttcccccttta attcagtctgtctcctggatggactcctcagtagaagtttacactaataatagtgccttc atgccagcccctaatgatgacagctttccagctcaaccagaaggagatgtgcactttaat ttgtcaatcggctataaatatccaccactgtgcattggaatgtcacctggctgtttagct tattcttatcagaattggatgtggaccataccatccttgagtaatgattcttatcagcca aaatatgacatgctcagagtttatttgttagaatatgtagaggacggatggcagtcctac aggttgagggaatgggtagctccttacccatttaaattgatgtacacaggcatcattcct cctagaccaaaaatgattcatccaattgttaccccagaacatcctgaattatggaaatta gctgcagccatgacaggaataaggctatggaacactacctatcaactctttgctactaat accaagacacccatattcaacatcaccttgatgtctaaacgggtgatacctatcatgaga tgtgtcaaaccccctcatatgctgttggttggaaatacaattatcattcccaatacacaa actatagaatgtgataactgtaagctgttcacgtgcattgatgctacttttaatcccaga acaagtattctcttgcctgcagatgcctccaaagagtctgaggaaaagtccgaagtcaat gacaagcaatga >gi568815587f:4486891_4594038|GENSCAN_predicted_peptide_3|114_aa MLPSGKTQPITSCGSYGAKLPFEKSRSSQSDECGRWVISAFPTEVSGSSHRDWLDSGYSP RRITAPCQQWNKAGWRMTDELTEVGFRKSVITNFSELKEDVQTHRKEAKTLKKD >gi568815587f:4486891_4594038|GENSCAN_predicted_CDS_3|345_bp atgctgccttcaggtaagactcagcccattaccagctgtggtagctatggggcaaaactc ccatttgagaaaagcagaagctcccagagtgatgagtgcggacgatgggtgatttctgca tttccaactgaggtatctggttcatctcaccgggactggttggacagtgggtacagccca cggaggatcacagcgccttgccagcaatggaacaaagcaggatggagaatgactgatgag ttgacagaagtaggcttcagaaagtcggtaataacaaacttctctgagctaaaggaggat gttcaaacccatcgcaaggaagctaagaccttgaaaaaagattag >gi568815587f:4486891_4594038|GENSCAN_predicted_peptide_4|344_aa MAGTRSVIPETPIGFHEKSLPLVSTIHQAIQITDETSSHLVSSVICAWLESLHVWLSIPF GSMYLVAVVGNVTILAVVKIERSLHQPMYFFLCMLAAIDLVLSTSTIPKLLGIFWFGACD IGLDACLGQMFLIHCFATVESGIFLAMAFDRYVAICNPLRHSMVLTYTVVGRLGLVSLLR GVLYIGPLPLMIRLRLPLYKTHVISHSYCEHMAVVALTCGDSRVNNVYGLSIGFLVLILD SVAIAASYVMIFRAVMGLATPEARLKTLGTCASHLCAILIFYVPIAVSSLIHRFGQCVPP PVHTLLANFYLLIPPILNPIVYAVRTKQIRESLLQIPRIEMKIR >gi568815587f:4486891_4594038|GENSCAN_predicted_CDS_4|1035_bp atggctggaacaaggtctgtgataccagagacacctattggatttcatgaaaagtcactt cctcttgtgtccacaattcaccaagcaattcagatcacagatgagaccagttcacacttg gtctcatctgtgatttgtgcctggctggagtccctacacgtctggctctccatccccttt ggctccatgtacctggtggctgtggtggggaatgtgaccatcctggctgtggtaaagata gaacgcagcctgcaccagcccatgtactttttcttgtgcatgttggctgccattgacctg gttctgtctacttccactatacccaaacttctgggaatcttctggttcggtgcttgtgac attggcctggacgcctgcttgggccaaatgttccttatccactgctttgccactgttgag tcaggcatcttccttgccatggcttttgatcgctacgtggccatctgcaacccactacgt catagcatggtgctcacttatacagtggtgggtcgtttggggcttgtttctctcctccgg ggtgttctctacattggacctctgcctctgatgatccgcctgcggctgcccctttataaa acccatgttatctcccactcctactgtgagcacatggctgtagttgccttgacatgtggc gacagcagggtcaataatgtctatgggctgagcatcggctttctggtgttgatcctggac tcagtggctattgctgcatcctatgtgatgattttcagggccgtgatggggttagccact cctgaggctaggcttaaaaccctggggacatgcgcttctcacctctgtgccatcctgatc ttttatgttcccattgctgtttcttccctgattcaccgatttggtcagtgtgtgcctcct ccagtccacactctgctggccaacttctatctcctcattcctccaatcctcaatcccatt gtctatgctgttcgcaccaagcagatccgagagagccttctccaaataccaaggatagaa atgaagattagatga >gi568815587f:4486891_4594038|GENSCAN_predicted_peptide_5|318_aa MAWEGMPLRMLKLQYPECFGLSEVTHFSAVKGNTLVQQQVFPQELLSPLLAARMTPQADE EMNRRMAEQQRGEKQHLNAESSLAGGVKEQLIMKDKFITQAAPDIRRTLQKWALGPDGTL EDLWKAATSVFYNKDGETQKQKQLLWPPSKPTNPRIPKTSTIPVQVYEMEVVNLGPSLTK PESITREAHWSYEDAPHICLKEIHAVQEQTVVAMKTCLSDLNAYWKFWLGQSGKRKKNGI QIEKKSNCLFADDMIVNLENPIVSASKLLKLHSFTLLVGAGKSVVPVHYLIPPDYVFYPK CNGTKQTFEKQVLQKQLN >gi568815587f:4486891_4594038|GENSCAN_predicted_CDS_5|957_bp atggcatgggaagggatgcccttgaggatgttgaagctgcaataccctgaatgctttggg ctttcagaggtgacccacttttctgcagtaaaaggtaacactttggtgcagcaacaggta ttccctcaggagctgctctcacctcttctggctgccaggatgaccccacaagcagatgag gagatgaacagaagaatggcagaacagcagagaggagagaagcagcatctgaacgctgaa agcagtttggctgggggtgtcaaggaacaactaattatgaaggataagtttattactcag gcagcccctgatatcaggaggacattgcagaaatgggccctgggaccagacggtacttta gaggacctctggaaagcggctacctcagtcttttacaataaagatggggagacacaaaag cagaagcagcttctgtggccaccaagcaagcccacaaaccccagaattcccaagacttcc actatacctgtccaggtctatgagatggaagtggttaatctgggaccaagcctgacaaaa ccagagagcataacaagggaggcccactggagctatgaagatgctccccacatctgtttg aaggagatccatgcagtacaagaacagactgtggtggccatgaaaacatgcctttcagat ttgaatgcatattggaagttctggctagggcaatcaggcaagaggaagaaaaacggtatt caaatagaaaagaagtcaaattgtctgtttgcagatgacatgattgtgaatttagaaaac cccatagtctcagcctcaaaactccttaagctgcactcatttacccttctcgttggtgct gggaagtctgtggtgcccgtgcattacctgataccacctgactatgtgttctatcctaaa tgcaacgggacaaaacagacatttgagaaacaagtacttcagaagcagctgaattga >gi568815587f:4486891_4594038|GENSCAN_predicted_peptide_6|217_aa MALVQALVPREREPKLSILQMDRGDPQHSSHWCPEREKVKLLTLKPRETSKNILINFYRA FNLDKDVFIHQANHPLTVPSSVVMGDNGHTLAEDDKRPCFRVLPCYLERVSSGISISWIS APLPVGAMKHQLLCDLMDLITLSFWLAGQCMSLKATNMQHCKCSIATSDWAIELDRTDYK TLPSEYSILALLQVFAGKNCMDRVLLHVDVNYLKSLP >gi568815587f:4486891_4594038|GENSCAN_predicted_CDS_6|654_bp atggctctagtgcaggcgctggtgcccagagaaagagagccaaagctgtccattttgcag atggacagaggggacccacaacacagctcacactggtgcccagagagagaaaaagttaag ctgctgaccctgaagccaagggaaacatccaaaaatatccttataaacttctacagagcc tttaacttagataaggatgtgttcatacaccaggccaaccacccactaacagtgccgtct agtgtggtgatgggtgataacggccacaccttggcagaagatgacaaaagaccttgcttt agggtactaccttgctatctggaacgtgtttccagtggaatctctatttcctggatctca gcccctctacctgtgggagcaatgaaacaccagcttctatgtgacttaatggatcttatc acgctttctttctggttagcaggtcagtgtatgagccttaaagctaccaacatgcaacac tgcaagtgttcaatagccacaagtgactgggccattgaattggaccgcactgattacaaa acacttccatcagaatattccatcctagcactgctccaggtctttgcagggaaaaactgt atggatcgtgttctacttcatgtggatgtgaactacttgaagtccctgccctga >gi568815587f:4486891_4594038|GENSCAN_predicted_peptide_7|315_aa METPASFLLVGIPGLQSSHLWLAISLSAMYIIALLGNTIIVTAIWMDSTRHEPMYCFLCV LAAVDIVMASSVVPKMVSIFCSGDSSISFSACFTQMFFVHLATAVETGLLLTMAFDRYVA ICKPLHYKRILTPQVMLGMSMAITIRAIIAITPLSWMVSHLPFCGSNVVVHSYCEHIALA RLACADPVPSSLYSLIGSSLMVGSDVAFIAASYILILKAVFGLSSKTAQLKALSTCGSHV GVMALYYLPGMASIYAAWLGQDVVPLHTQVLLADLYVIIPATLNPIIYGMRTKQLRERIW SYLMHVLFDHSNLGS >gi568815587f:4486891_4594038|GENSCAN_predicted_CDS_7|948_bp atggaaacccctgcctccttcctccttgtgggtatcccaggactgcaatcttcacatctt tggctggctatctcactgagtgccatgtacatcatagccctgttaggaaacaccatcatc gtgactgcaatctggatggattccactcggcatgagcccatgtattgctttctgtgtgtt ctggctgctgtggacattgttatggcctcctcggtggtacccaagatggtgagcatcttc tgctcaggagacagctcaatcagctttagtgcttgtttcactcagatgttttttgtccac ttagccacagctgtggagacggggctgctgctgaccatggcttttgaccgctatgtagcc atctgcaagcctctacactacaagagaattctcacgcctcaagtgatgctgggaatgagt atggccatcaccatcagagctatcatagccataactccactgagttggatggtgagtcat ctacctttctgtggctccaatgtggttgtccactcctactgtgagcacatagctttggcc aggttagcatgtgctgaccccgtgcccagcagtctctacagtctgattggttcctctctt atggtgggctctgatgtggccttcattgctgcctcctatatcttaattctcaaggcagta tttggtctctcctcaaagactgctcagttgaaagcattaagcacatgtggctcccatgtg ggggttatggctttgtactatctacctgggatggcatccatctatgcggcctggttgggg caggatgtagtgcccttgcacacccaagtcctgctagctgacctgtacgtgatcatccca gccaccttaaatcccatcatctatggcatgaggaccaaacaactgcgggagagaatatgg agttatctgatgcatgtcctctttgaccattccaacctgggttcatga