GENSCAN 1.0 Date run: 8-Nov-116 Time: 07:24:16 Sequence gi568815588r:30511958_30729705 : 217748 bp : 43.88% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3396 3546 151 2 1 76 77 68 0.706 4.71 1.02 Term + 17576 17766 191 0 2 71 47 156 0.127 7.41 1.03 PlyA + 21163 21168 6 1.05 2.03 PlyA - 23215 23210 6 1.05 2.02 Term - 41388 41170 219 0 0 115 55 107 0.948 7.14 2.01 Init - 42249 42163 87 0 0 58 46 72 0.455 0.64 2.00 Prom - 44824 44785 40 -2.36 3.00 Prom + 44897 44936 40 -7.26 3.01 Init + 47304 47371 68 2 2 70 87 25 0.831 1.15 3.02 Intr + 52232 52455 224 2 2 104 47 86 0.541 3.87 3.03 Intr + 60662 60781 120 0 0 44 87 153 0.789 11.27 3.04 Intr + 74564 74653 90 0 0 70 77 55 0.469 2.57 3.05 Intr + 77648 77796 149 0 2 103 -13 51 0.232 -3.55 3.06 Term + 77844 77996 153 2 0 78 43 105 0.329 2.92 3.07 PlyA + 79056 79061 6 1.05 4.06 PlyA - 79131 79126 6 1.05 4.05 Term - 81556 81466 91 0 1 73 48 122 0.491 3.89 4.04 Intr - 84189 84142 48 2 0 102 113 10 0.275 2.70 4.03 Intr - 100943 100865 79 0 1 104 91 75 0.901 8.01 4.02 Intr - 102288 102212 77 2 2 97 19 12 0.373 -5.64 4.01 Init - 102609 102503 107 0 2 89 61 93 0.714 5.25 4.00 Prom - 104014 103975 40 -6.06 5.00 Prom + 104605 104644 40 -6.36 5.01 Sngl + 105947 107230 1284 1 0 70 44 303 0.867 20.07 5.02 PlyA + 107871 107876 6 1.05 6.07 PlyA - 108208 108203 6 1.05 6.06 Term - 114112 114063 50 2 2 77 36 42 0.473 -4.63 6.05 Intr - 114306 114148 159 1 0 103 94 344 0.860 36.46 6.04 Intr - 114983 114820 164 1 2 102 37 135 0.803 9.42 6.03 Intr - 116758 116667 92 1 2 65 55 66 0.507 -0.21 6.02 Intr - 125571 125408 164 1 2 73 73 35 0.358 0.19 6.01 Init - 126203 126059 145 2 1 57 7 123 0.194 -0.59 6.00 Prom - 126726 126687 40 -0.36 7.00 Prom + 132110 132149 40 -3.96 7.01 Init + 135497 135595 99 1 0 84 82 34 0.294 2.66 7.02 Intr + 151599 151683 85 2 1 65 116 34 0.484 3.29 7.03 Intr + 153493 153527 35 2 2 82 91 -7 0.292 -3.06 7.04 Intr + 159609 159722 114 2 0 86 87 53 0.873 5.64 7.05 Intr + 163562 163645 84 1 0 74 94 34 0.440 2.62 7.06 Intr + 164171 164248 78 1 0 102 76 24 0.566 2.35 7.07 Intr + 173305 173452 148 0 1 45 86 157 0.648 11.01 7.08 Intr + 177282 177371 90 1 0 114 115 -20 0.852 3.27 7.09 Intr + 179117 179294 178 0 1 69 89 362 0.945 33.38 7.10 Intr + 181539 181832 294 0 0 87 22 214 0.908 10.82 7.11 Intr + 183981 184120 140 0 2 65 63 210 0.920 16.31 7.12 Intr + 185419 185559 141 2 0 51 10 114 0.458 0.22 7.13 Intr + 186413 186558 146 0 2 83 77 113 0.957 9.70 7.14 Intr + 188130 188305 176 2 2 86 79 163 0.999 13.94 7.15 Intr + 191565 191709 145 0 1 99 61 57 0.997 4.38 7.16 Intr + 192456 192732 277 2 1 97 92 445 0.987 42.89 7.17 Intr + 197341 197496 156 2 0 85 110 158 0.614 17.68 7.18 Intr + 199405 199602 198 2 0 96 40 114 0.383 6.72 7.19 Intr + 204573 204616 44 1 2 75 62 3 0.289 -5.64 7.20 Intr + 204970 205125 156 0 0 125 99 149 0.987 19.91 7.21 Intr + 211252 211379 128 0 2 74 68 61 0.021 2.18 7.22 Term + 214894 215083 190 1 1 59 45 97 0.054 -0.58 7.23 PlyA + 216649 216654 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:30511958_30729705|GENSCAN_predicted_peptide_1|113_aa MGLLQWTCYIRPGRPPADIGRPRRTPCLQGDKECGLGEHQYCWTLQARTVGNSGKARMLL WLCGQPMTRASSTFREICECCPSRATVAHHRHFLSSYPAPRLSEAASGWVRLA >gi568815588r:30511958_30729705|GENSCAN_predicted_CDS_1|342_bp atgggcctgctgcaatggacttgctatataagaccaggacgaccaccagccgacattggg aggccccggaggacaccctgcttacaaggggataaggaatgtgggttaggggagcaccag tactgctggactctgcaggccaggacggttggtaactcgggcaaagcacgcatgctgctg tggctctgtggtcagcccatgacgagagcctcgtccacgttccgtgaaatctgtgagtgc tgcccgagtcgagctaccgttgctcaccaccgccacttcctgtcttcctacccagcgccg cggctctcggaggctgccagtggctgggtgcgcctggcctga >gi568815588r:30511958_30729705|GENSCAN_predicted_peptide_2|101_aa MAKRRLDMEGKLQLDEFNQLSLHRMKPATDRGPQPPGHRPVLAYQELGHTAGGEQQQASK ALSELTAAPHHLHYCLSSTSCRISGSIRFSWECKTYCELNI >gi568815588r:30511958_30729705|GENSCAN_predicted_CDS_2|306_bp atggccaaaaggagactagacatggaaggcaaattgcagttagatgaatttaaccagctt tctctacaccgcatgaagccagcgactgacaggggtccccaacccccaggccatagacca gtactggcctatcaggaactgggccacacagcaggaggtgagcagcagcaagcaagcaaa gctttatctgaattaacagccgctccgcatcacttgcattactgcctgagctccacctcc tgtcggatcagcggcagcattagattctcatgggagtgcaaaacctattgtgaactgaac atctga >gi568815588r:30511958_30729705|GENSCAN_predicted_peptide_3|267_aa MSTNRQSGGVYLVYDLPLGTLNWQGAEFNKPVGEPSTCTHILNITGESNKGNCSALHIPR ADVWWYCGKRDLHDLLPSNWICTCALVQLAIPFILAFYKCILILILIIIIGTKTGIDWAL TTCREYAMSSAWSPDKIEIYLLMVLLAEKSKTEGSASGEGFLAVSAKGSGLGRPKRCVTG LRAGTWEEVLRSSASQDASSRFAASFEDSGHTGPSTAHQHPPEEPPRGENGTTNRFSYES VDPENIPELISNISSKGLELQGLEQSI >gi568815588r:30511958_30729705|GENSCAN_predicted_CDS_3|804_bp atgagcactaacaggcaaagtggtggcgtttacctggtttatgaccttcctctaggaaca ctcaattggcagggggcagagttcaataagcctgtgggagaaccctcgacttgtacccac atcctaaacattactggtgagtcaaacaaaggcaactgctcagctcttcatataccccgg gctgatgtctggtggtattgtggaaagagggacctccatgacctattaccatccaactgg atctgcacttgtgccttagttcaactggccattccattcatcctggcattctacaaatgc atcctcatcctcatcctcatcattatcatcggcaccaaaactggtattgactgggcactc actacatgcagagagtatgccatgtcctcagcgtggagtcctgataagatagaaatttat ttgctcatggttctgttggctgagaagtccaagactgagggatcagcatctggtgagggc tttcttgctgtgtcagccaaaggaagtggccttgggcggcccaagcgctgtgttactggg ctgcgggccggtacttgggaagaagtcctccgcagttcagcatcccaggatgcatctagc agatttgctgcctcatttgaggactcaggacacacaggtccctccactgctcaccaacac cccccagaggagcctcccaggggagaaaatggaaccaccaacaggttctcctatgagagc gtggatccagagaatatccctgagcttatttcaaatattagttccaaagggctagaactg cagggcttagaacaatcaatataa >gi568815588r:30511958_30729705|GENSCAN_predicted_peptide_4|133_aa MEMGLPDPLLREDMLPPLQLCDEQTVPATCFRIGFRCGLLKISGISNPSQHQLLEDSTCS RTLVTDDLTDAIICAKKIVKETQGMNYCSLERPHGVETGFLSPRGHNFHMWVPYTHHQGH RLQTTLIPDDRNF >gi568815588r:30511958_30729705|GENSCAN_predicted_CDS_4|402_bp atggagatgggcctcccagatcccctgctaagggaagacatgctgcccccgctgcagttg tgtgatgagcagacagtacctgccacctgcttcaggattggcttcaggtgtggattgctg aaaatatctggcatttcaaacccatctcagcatcagctcctggaggactcaacttgcagt agaaccttggtcactgatgacctcacagatgcgattatctgtgccaagaaaattgttaaa gagacacaaggaatgaactattgctcacttgaacgtcctcatggtgtggaaactggcttc ctttccccacgaggccacaacttccacatgtgggtcccctacactcatcaccagggacac aggctccagaccaccctgatccctgatgaccgcaacttctga >gi568815588r:30511958_30729705|GENSCAN_predicted_peptide_5|427_aa MTVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIAS KRIKYLGIQLTRDVKDLFKENYKPLLKEVKEDTNKWKNIPCSWVGRINIVKMAILPKVIY RFNAIPIKLSMTFFTELEKTTLKFIWNQKRARIVKSILSQKNKAGGITLPDFQLYYKATV TKTAWYWYQNRDIDQWNRTEPSEIILHIYNYVIFDKPDKSKKWGKNSLFNKWCWENWQAI CRKLKLDPFLTPYTKINSRWTKDLNVRPKTIKTLEENLGNTIQDIGMGKDFMSKTQKAMA TKAKIDKWDLIKLKSFCTTKETTIRVNRQPTEWEKIFAIYSSDKRLISRIYKELKQIYKK KTNNPTNKWAKYMNRHFSKEDIYAANRHMKKCSSSLAIREMQIQTTMRYHLTSVRMAIIK KSGNNRC >gi568815588r:30511958_30729705|GENSCAN_predicted_CDS_5|1284_bp atgactgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatac accaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggacgtgaaggacctcttcaaggag aactacaaaccactgctcaaggaagtaaaagaggatacaaacaaatggaagaacattcca tgttcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttat agattcaatgccatccccatcaagctatcaatgactttcttcacagaattggaaaaaact actttaaagttcatatggaaccaaaaaagagcccgcattgtcaagtcaatcctaagccaa aagaacaaagctggaggcatcacgctacctgacttccaactatactacaaggctacagta accaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaacagag ccctcagaaataatactgcatatctacaactatgtgatctttgacaaacctgacaaaagc aagaaatggggaaagaattctctatttaataaatggtgctgggaaaattggcaggccata tgtagaaagctgaaactggatcccttccttacaccttatacaaaaattaattcaagatgg actaaagacttaaatgttagacctaaaaccataaaaaccctagaagaaaacctaggcaat accattcaggacataggcatgggcaaggacttcatgtctaaaacacaaaaagcaatggca acaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacaacaaaa gaaactaccatcagagtgaacaggcaacctacagaatgggagaaaatttttgcaatctac tcatctgacaaaaggctaatatccagaatctacaaagaactcaaacaaatttacaagaaa aaaacaaacaaccccaccaacaagtgggcaaagtatatgaacagacacttctcaaaagaa gatatttatgcagccaacagacacatgaaaaaatgctcatcatcactggccatcagagaa atgcaaatccaaaccacaatgagataccatctcacatcagttagaatggccattattaaa aagtcaggaaacaacaggtgctag >gi568815588r:30511958_30729705|GENSCAN_predicted_peptide_6|257_aa MVFALTCCLSLELPAVGMLLTATCDGFSIVPPRGFFTSISSYILGLLIAWDNYSLCLLHT SVYCQSPCGDVLAVVYVGLNLRREISARDKFGDEEAKKITRDRPFNAGIQPQGEKLMTRE VSRQVQKMTRDEHMFCLRQALRMKAAGILTLIGCLVTGAESKIYTRCKLAKIFSRAGLDN YWGFSLGNWICMAYYESGYNTTAQTVLDDGSIDYGIFQINSFAWCRRGKLKENNHCHVAC SGPGEKADNGITNHVLL >gi568815588r:30511958_30729705|GENSCAN_predicted_CDS_6|774_bp atggtgtttgctctcacctgttgtctgtccctggagctgcctgctgtaggcatgttactg acagcaacatgtgacggcttctccatcgtgcctccaaggggtttcttcacttcaatttct tcctacatactgggcttgctcatagcatgggacaattattccttgtgtcttttacataca tctgtttattgccagtccccatgtggggatgtcttggcagttgtgtatgttggcctgaac ctcaggagagaaatctcagccagagataaatttggggatgaagaagccaagaagataacc agggacaggcctttcaatgcaggaatccagccccagggagagaaactaatgactagagaa gtttccaggcaagtgcagaaaatgacacgggatgaacatatgttctgtctccggcaggct ttgaggatgaaggctgcgggcattctgaccctcattggctgcctggtcacaggcgccgag tccaaaatctacactcgttgcaaactggcaaaaatattctcgagggctggcctggacaat tactggggcttcagccttggaaactggatctgcatggcgtattatgagagcggctacaac accacagcccagacggtcctggatgacggcagcatcgactacggcatcttccagatcaac agcttcgcgtggtgcagacgcggaaagctgaaggagaacaaccactgccacgtcgcctgc tcaggaccaggtgagaaagccgacaatgggatcaccaaccatgtcttgctttaa >gi568815588r:30511958_30729705|GENSCAN_predicted_peptide_7|1033_aa MKKVNHERVKYHEHALPYPASISKGLGTCSESLKEFSLEDKEQLANHERGIDAQLLVALP KEFSKASQWPRVQIGIVEEKRGGRLEKQTQQEAGGQQGTSQQPAGQSLVRKYSAMFPVCF NLQEEGGVADDSAISNLLWEPVYASTYYRAIPAAHKYLSFVSTNRRVTESRESQMMIEER KQLITVREEAWKTRGRGAANDSTQFTVAGRMVKKGLASPTAITPVASAICGKTRGTTPVS KPLEVGGMHETVLTVTGKSVKELMKPDDDETFAQFYRNVDCNMLRSPVELDEDFDVIFDP YAPKLTSSVAEHKWAVRPKRQVQASKNPLKMLAAREDLLQEYTEQRLNVAFMESKRIKVE KREYLNPHPASVFGNGFFRCFTHCLVFCLVKANNGAQKALMNFSEVTLAGLASKENFSNV SLRSVNLTEQNSNNSAVPFKRLMLLQIKGRRHVQTRLVEPRASALNSGDCFLLLSPHCCF LWVGEFANAIEKAKVAAGDPKEDELYEAAIIETNCIYCLMDDKLVPDDDYWGKIPKCSLL QPKEVLVFDFGSEVYVWHGKEVTLAQRKIEFQLAKHLWNGTFDYENCDINPLDPGECNPL IRRKGQGRPDWAIFGRLTEHNETILFKEKFLDWTELKRPNEKNPGELAQHKEDSRADVKR NNVTRMVSVPQTTAGAILDRVNIGCGCGLVEGHHRRQFEITSVSVDVWYILEFDYSRLPK QSIGQFHEGDAYVVKWKFMVSTADPGSFNFAPRLFVLSSSSGDFAATEFVYPARAPSVVS SMPFLQEDLYSVPQPVPFLDNHHEVCLWQGWWPIENKITGSARICRASDQKSTMETMLQY CKDEASPAAPRSPRPFFPFCMVTWSYVSSGCVQMAQDMEVSNQITLVEDVLAKLCKTIYL LANLLARPLPEGVDPLKLEIYLTDEDFERKDVGLLQTSDNGCTPRIRSERWTCQGKASEA SVPTISTMDALWRSQERKCRDWPGKEIRKRQRRYLCIWRGAAAQTEGCGQQVALLLNAEA KAPTPAPPPASQS >gi568815588r:30511958_30729705|GENSCAN_predicted_CDS_7|3102_bp atgaagaaagtcaatcatgaacgtgtgaaatatcacgaacacgccctcccctaccctgca tcaatcagcaaggggctggggacctgttctgagagcctgaaggagttctcactcgaagat aaagaacagctcgctaaccacgaaagaggaatcgatgctcagcttttagttgcacttcct aaagagttttcaaaggcaagtcagtggcctcgtgtccagattggcattgttgaagaaaag cggggaggaagattggagaaacagactcagcaggaggcaggagggcagcaaggcaccagc cagcagcctgcagggcagtccctcgtcaggaagtactctgccatgttccctgtttgtttt aacctgcaggaagaaggaggagttgcggatgatagtgccatttctaatctgctttgggaa cctgtatatgcttctacttactatcgtgctattcctgctgcccataaatacctgtctttt gtgtcgactaatcggcgggtcacagaaagtcgagagagccaaatgatgattgaggagagg aagcagctcatcactgtgagagaggaggcctggaagacgagaggcagaggagcggccaac gactcgacccagttcactgtggctggcaggatggtgaagaaaggtttggcgtcacctact gccataaccccagtagcctcagccatttgcggtaaaacaagaggcaccacacccgtttcc aaacccctggaagttggcgggatgcacgaaacggtgctcactgtcaccggcaaatctgtg aaggagctgatgaagccggatgatgacgaaacctttgcccaattttaccgcaacgtggat tgtaatatgctgagaagtcctgtggagctggacgaggacttcgatgtcattttcgatcct tatgcacccaaattgacgtcttccgtggccgagcacaagtgggcagttaggcccaagcgc caggttcaggcctccaaaaaccccctgaaaatgctggcggcaagagaagatctccttcag gaatacactgagcagagattaaacgttgccttcatggagtcaaagcggataaaagtagaa aagagagagtatttgaacccccaccctgcttctgtgtttgggaatggcttcttccgatgc tttactcattgcttggttttctgccttgtgaaggccaataatggtgctcagaaggccctc atgaacttctcagaagtcaccctggcgggtttagccagtaaagaaaacttcagcaacgtc agcctgcggagcgtcaacctgacggaacagaactctaacaacagcgccgtgcccttcaag aggctgatgctgttgcagattaaaggaagaagacacgtgcagaccaggctggtggaacct cgagcttcggcgctcaatagtggggactgcttcctcctgctctctccccactgctgcttc ctgtgggtaggagagttcgcaaacgccatagaaaaggcgaaggttgctgctggagaccca aaagaagatgaactctatgaagcagccataatagaaactaactgcatttactgtctcatg gatgacaaacttgttcctgatgacgactactgggggaaaattccaaagtgctctcttctg caacccaaagaggtactggtgtttgattttggtagtgaagtttacgtatggcatgggaaa gaagtcacgttagcacaacgaaaaatagaatttcagctggcaaagcacttatggaatgga acctttgactatgagaactgtgacattaatcccctggatcctggagaatgcaatccgctt atccgcagaaaaggacaggggcggcccgactgggcgatatttgggagacttactgaacac aatgagacgattttgttcaaagaaaagtttctggattggacagaactgaaaagaccaaat gagaagaaccccggggaacttgcccagcacaaggaagactccagggccgatgtcaagcgg aacaacgtgacacggatggtgtccgtgccccagacgacagcaggtgccatcctggacagg gtgaacattggctgtggctgtggcctggtggaaggacaccacaggaggcagtttgagatc accagtgtctccgtggatgtctggtacatcctggaatttgactatagcaggctccccaaa caaagcatcgggcagttccacgagggggatgcctatgtggtcaagtggaagttcatggtg agcacggcagatcccggaagttttaacttcgcgccccgcctgttcgtcctcagcagctcc tctggggatttcgcagccacggagttcgtgtaccctgcccgagccccctctgtggtcagt tccatgcccttcctgcaggaagatctatacagcgtgccccagccagtgcctttccttgac aatcaccacgaggtgtgcctctggcaaggctggtggcccatcgagaacaagatcactggt tctgcccgcatctgccgtgcctctgaccaaaagagcacgatggagaccatgctccagtat tgcaaagatgaggcctcgccagcagctccccggagccctagaccctttttccctttctgc atggtgacatggagctatgtgtcctcaggctgtgtgcagatggcacaggacatggaagtt tccaatcagatcaccctcgtggaagacgtcttagccaagctctgtaaaaccatttacctg ctggccaacctcctggccaggccactcccggagggggtcgatcctctgaagcttgagatc tatctcaccgacgaagacttcgagagaaaggatgttgggctgctccagacctctgacaat ggatgcacccccaggattagatcagaaagatggacctgccaggggaaagcgtcagaggct tctgttcctaccatcagtaccatggatgcgctctggagaagccaggaaaggaaatgcagg gactggccagggaaggaaatacggaagcgacaaaggagatacctttgcatctggcgaggg gcagccgcgcagacggaaggctgtggccagcaggtggcgctgttgctcaacgcagaagcc aaggctccaactccagcgccgcctccggcttcacagtcctag