GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:36:59 Sequence gi568815584r:93083190_93284242 : 201053 bp : 46.36% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 Intr - 3331 3169 163 1 1 72 110 58 0.664 5.95 1.07 Intr - 4782 4642 141 0 0 68 53 75 0.586 2.55 1.06 Intr - 9506 9433 74 0 2 60 80 122 0.350 7.73 1.05 Intr - 19283 19191 93 0 0 48 93 69 0.003 3.34 1.04 Intr - 26707 26612 96 2 0 71 26 109 0.013 2.98 1.03 Intr - 29816 29739 78 0 0 70 58 72 0.060 1.92 1.02 Intr - 31330 31245 86 0 2 38 34 97 0.361 -1.24 1.01 Init - 31974 31880 95 0 2 73 110 167 0.300 17.25 1.00 Prom - 43822 43783 40 -4.86 2.00 Prom + 47046 47085 40 -3.96 2.01 Init + 54041 54043 3 1 0 98 52 0 0.452 -2.60 2.02 Term + 54897 54986 90 2 0 46 42 238 0.907 12.72 2.03 PlyA + 55445 55450 6 1.05 3.00 Prom + 55947 55986 40 -1.86 3.01 Init + 57860 57898 39 1 0 76 79 25 0.115 0.59 3.02 Intr + 64536 64675 140 2 2 76 119 50 0.647 6.16 3.03 Intr + 86292 86415 124 0 1 36 101 103 0.617 6.99 3.04 Intr + 90970 91020 51 0 0 92 92 22 0.167 2.10 3.05 Term + 97184 97243 60 1 0 91 39 96 0.236 2.80 3.06 PlyA + 98378 98383 6 -1.75 4.05 PlyA - 98959 98954 6 1.05 4.04 Term - 101079 99998 1082 1 2 27 47 458 0.431 27.88 4.03 Intr - 101580 101370 211 0 1 32 113 136 0.462 8.99 4.02 Intr - 101977 101621 357 1 0 74 94 108 0.354 5.45 4.01 Init - 116207 116169 39 2 0 58 116 13 0.116 1.29 4.00 Prom - 120260 120221 40 -2.86 5.03 PlyA - 120301 120296 6 1.05 5.02 Term - 120635 120499 137 0 2 47 42 145 0.774 3.98 5.01 Init - 123848 123641 208 2 1 113 101 246 0.997 27.38 5.00 Prom - 123994 123955 40 -16.65 6.00 Prom + 124044 124083 40 -16.89 6.01 Init + 124103 124252 150 1 0 66 80 281 0.999 23.14 6.02 Intr + 128843 128966 124 1 1 53 49 98 0.654 2.56 6.03 Intr + 135338 135546 209 0 2 49 95 198 0.805 15.40 6.04 Intr + 136023 136172 150 2 0 42 58 113 0.788 4.06 6.05 Intr + 137060 137222 163 1 1 91 44 86 0.608 4.05 6.06 Term + 139124 139194 71 0 2 53 44 57 0.547 -4.10 6.07 PlyA + 139554 139559 6 -0.45 7.02 PlyA - 140335 140330 6 -0.45 7.01 Sngl - 140863 140375 489 1 0 32 37 498 0.782 35.16 7.00 Prom - 145889 145850 40 -5.56 8.00 Prom + 146919 146958 40 -2.36 8.01 Init + 147378 147386 9 2 0 77 110 3 0.587 1.84 8.02 Intr + 149180 149416 237 1 0 102 93 86 0.726 8.31 8.03 Term + 151518 151640 123 2 0 86 48 36 0.239 -2.22 8.04 PlyA + 153791 153796 6 1.05 9.09 PlyA - 154380 154375 6 1.05 9.08 Term - 159899 159084 816 2 0 120 54 185 0.879 11.44 9.07 Intr - 163097 162636 462 2 0 101 103 89 0.772 5.05 9.06 Intr - 165465 165287 179 1 2 91 91 228 0.950 23.04 9.05 Intr - 168463 168274 190 1 1 18 110 204 0.824 14.76 9.04 Intr - 170601 170458 144 0 0 26 90 120 0.660 6.48 9.03 Intr - 174166 174006 161 2 2 52 92 53 0.586 1.81 9.02 Intr - 178488 178413 76 0 1 69 99 20 0.601 0.29 9.01 Intr - 180804 180596 209 1 2 72 89 149 0.931 12.20 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 24574 24733 160 0 1 73 80 114 0.803 9.12 S.002 Term + 26575 26711 137 2 2 38 42 125 0.826 1.08 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:93083190_93284242|GENSCAN_predicted_peptide_1|276_aa MQTFLKGKRVGYWLSEKKIKKLNFQAFAELCRASQQPCKLARVVILGHLSENGDRSAEER GQCPLVATFLYWLQNAATVEPVTREPRTFGLCRFSVLGFGSLQVTLKATMAELIILQGYF KVLTFVMGSLKRKGIMALPLEAFAASLQEAVPNGIPGLHLLDASTTLVFQECLQPRLPGL EPLVSSTQFPHIPSLGFFPAIPIRDPCGHAGLYPVPLGNLRASPVRDREAQQMFDQNQTR KLSLPVVGGAAAGQRQLNFKGLGSRDQELPPALRPX >gi568815584r:93083190_93284242|GENSCAN_predicted_CDS_1|828_bp atgcagacctttctgaaagggaagagagttggctactggctgagcgagaagaaaatcaag aagctgaatttccaggccttcgccgagctgtgcagagcctcgcagcagccctgcaaattg gccagggtggtgattctgggccacctgtctgagaacggggaccgatcggcagaggagcgg gggcagtgccccctggtggccacatttctatactggctgcaaaatgctgcaactgttgag ccagtgaccagggagccccggacctttggcctttgtagatttagtgttcttggttttggc agtttgcaagtcaccctgaaggccacgatggctgaactgatcatcttgcagggatatttt aaagtcctaacattcgtcatgggatccttgaaaagaaaaggaattatggccttgccacta gaggcctttgccgcctccctgcaggaagctgtgcccaacggcatccctggcctccaccta ctggatgccagtactactctcgtcttccaagaatgtctgcagccaaggcttccaggcctg gagcctttggtctccagcacccagttcccccacatcccttccctgggcttctttcccgcc attcccatcagggacccctgcggccatgctgggctgtacccagtgccccttgggaatctg cgggccagcccagtgcgtgacagagaagcccagcaaatgtttgaccaaaatcagaccagg aaactctcgctgcctgtggttggaggtgcagcagcagggcaaagacaactaaactttaag ggcttgggatccagggaccaggagctgcccccagccctcaggccagnn >gi568815584r:93083190_93284242|GENSCAN_predicted_peptide_2|30_aa MPTETAEQLDEAVKYYTLEEIQKHNDSKST >gi568815584r:93083190_93284242|GENSCAN_predicted_CDS_2|93_bp atgccaaccgagacggccgagcagctggacgaggccgtgaagtactacaccctagaggag attcagaagcacaacgacagcaagagcacctga >gi568815584r:93083190_93284242|GENSCAN_predicted_peptide_3|137_aa MSMSGKGFWKKGTSTEQRIAIANTIQVFMSLLRKKKPDQSLASFSPNFKLPKKEPIGASW SEAADLSVSVTALKAARLELLIPPGGLVGSLASGVKLRTFAGKSALCHEAAPVDRPTWPA QCEDDNQDEGLYDDPLL >gi568815584r:93083190_93284242|GENSCAN_predicted_CDS_3|414_bp atgagcatgtctgggaaaggcttctggaaaaagggaacatccacagaacaaagaattgct attgctaacactatccaggttttcatgtccctcctgagaaaaaagaagccagaccaaagc ttggcctcttttagtcccaacttcaaactccctaagaaggagcccattggtgcctcctgg agtgaagctgcagacctttcggtgagtgtgacagctcttaaggcagcacgtctagagtta ctcattcctcctggtgggctcgtgggctcgctggcttcaggagtgaagctgcggactttc gcggggaaatcagccctgtgtcatgaggcagccccggtggacaggcccacgtggcctgct caatgtgaagatgacaaccaggatgaaggcctttatgatgatccacttctttaa >gi568815584r:93083190_93284242|GENSCAN_predicted_peptide_4|562_aa MNIARCNFKLIWKRRHAATKAAQAAPGPGLAALQRREPSARQRSPAEPACKAEVPDLTFL SRGGQPARRPGCRRSRSFRLGPEARAPRPRGPPLMTSDRPRPPRSFPERDQNRWNPGWSR SSGGAGGGTSLQPCPLVPARDCHRPITSRCRQYHRHLEYRHRSLWPPSVRQSLGEPALPR LVGARPGTLAAGARTRARALDHELKSIISGTMTLRLLEDWCRGMDMNPRKALLIAGISQS CSVAEIEEALQAGLAPLGEYRLLGRMFRRDENRKVALVGLTAETSHALVPKEIPGKGGIW RVIFKPPDPDNTFLSRLNEFLAGEGMTVGELSRALGHENGSLDPEQGMIPEMWAPMLAQA LEALQPALQCLKYKKLRVFSGRESPEPGEEEFGRWMFHTTQMIKAWQVPDVEKRRRLLES LRGPALDVIRVLKINNPLITVDECLQALEEVFGVTDNPRELQVKYLTTYQKDEEKLSAYV LRLEPLLQKLVQRGAIERDAVNQARLDQVIAGAVHKTIRRELNLPEDGPAPGFLQLLVLI KDYEAAEEEEALLQAILEGNFT >gi568815584r:93083190_93284242|GENSCAN_predicted_CDS_4|1689_bp atgaacatagctcgctgcaacttcaaactcatttggaagcgccggcacgccgcgaccaag gctgcgcaggcggcgccaggcccgggcctcgccgccttgcagcgccgcgagcccagcgcc cgtcagcggtcgccggcggagcctgcttgcaaagctgaggtcccggatctcaccttcctg tcccgtggtggccagcccgcccgccgtcccggatgccgccgcagtcggagcttccggctc gggccggaagcgcgcgcgccccgcccccgaggcccgcccctcatgacgtcagatcgcccc cgcccgccgcgcagcttccccgagcgagaccaaaacaggtggaatccgggctggagccgg agctccggcggcgcgggtggcggcacgtccctccagccttgccccctggtgcccgctcgc gactgtcatcgccccatcacttctcgttgcagacagtaccacaggcacctggagtaccgg catcggtcgctgtggcccccgagtgtccgtcagagcctaggggagcctgccctcccgcgc ctcgtcggggcccggccaggcaccttggccgccggcgcacggacgcgggcacgagcacta gatcacgaacttaagtctattatttcgggcaccatgactttgaggcttttagaagactgg tgcagggggatggacatgaaccctcggaaagcgctattgattgccggcatctcccagagc tgcagtgtggcagaaatcgaggaggctctgcaggctggtttagctcccttgggggagtac agactgcttggaaggatgttcaggagggatgagaacaggaaagtagccttagtagggctt actgcggagactagtcacgccctggtccctaaggagataccgggaaaagggggtatctgg agagtgatctttaagccccctgacccagataatacatttttaagcagattaaatgaattt ttagcgggagagggcatgacagtgggtgagttgagcagagctcttggacatgaaaatggc tccttagacccagagcagggcatgatcccggaaatgtgggcccctatgttggcacaggca ttagaggctcttcagcctgccctgcaatgcttgaagtataaaaagctgagagtgttctcg ggcagggagtctccagaaccaggagaagaagaatttggacgctggatgtttcatactact cagatgataaaggcgtggcaggtgccagatgtagagaagagaaggcgattgctagagagc cttcgaggcccagcacttgatgttattcgtgtcctcaagataaacaatcctttaattact gtcgatgaatgtctgcaggctcttgaggaggtatttggggttacagataatcctagggag ttgcaggtcaaatatctaaccacttaccagaaggatgaggaaaagttgtcggcttatgta ctaaggctggagcctttgttacagaagctggtacagagaggagcaattgagagagatgct gtgaatcaggcccgcctagaccaagtcattgctggggcagtccacaaaacaattcgcaga gagcttaatctgccagaggatggcccagcccctggtttcttgcagttattggtactaata aaggattatgaggcagctgaggaggaggaggcccttctccaggcaatattggaaggtaat ttcacctga >gi568815584r:93083190_93284242|GENSCAN_predicted_peptide_5|114_aa MELLGEYVGQEGKPQKLRVSCEAPGDGDPFQGLLSGVAQMKDMVTELFDPLVQGEVQHRV AAAPDEDLDAIERIVIQQIFILKGDDEDDAEDENNIDNRTNFDGPSAKRPKTPS >gi568815584r:93083190_93284242|GENSCAN_predicted_CDS_5|345_bp atggagctgctgggagagtacgtcgggcaggaagggaagccgcagaagctgcgggtgtcc tgtgaggcgccgggtgacggcgaccctttccagggcctgttgtctggcgtggcccagatg aaggacatggtaacggaattattcgaccctctggtacagggggaagtgcagcaccgggtg gcggcggctccagacgaggacttggacgccatagaacgtattgtcatacaacaaatattt attttaaaaggtgatgatgaagatgatgcagaagatgaaaataacattgataacagaact aacttcgatggaccatctgcaaaacggccaaaaacaccgtcttaa >gi568815584r:93083190_93284242|GENSCAN_predicted_peptide_6|288_aa MAGAEGAAGRQSELEPVVSLVDVLEEDEELENEACAVLGGSDSEKCSYSQDKAKVNSGNK YNDNFFGLYCICKRPYPDPEDEVRELEVKPGVTKISTEDDGLVRNIDGIGDQEVIKPENG EHQDSTLKEDVPEQGKDDVREVKVEQNSEPCAGSSSESDLQTVFKNESLNAESKSGCKLQ ELKAKQLIKKDTATYWPLNWRSKLCTCQDCMKMYGDLDVLFLTDEYDTVLAYENKGKIAQ ATDRSDPLMDTLSSMNRVQQVELICEYNDLKTELKDYLKRFADEGTVC >gi568815584r:93083190_93284242|GENSCAN_predicted_CDS_6|867_bp atggccggagccgagggcgccgctgggcggcagtcggagctggagcccgtggtatcgttg gtcgacgtccttgaggaggacgaggagctggagaatgaggcgtgcgctgtcctgggcggc agcgactccgagaagtgctcctactctcaggacaaagcaaaggtaaattctggcaataag tacaatgacaacttttttggattgtactgcatttgcaagagaccttatcctgatcctgaa gacgaggtaagagaattggaagttaaacctggggtaaccaaaatatccactgaggatgat ggattggtgcggaacattgatggaataggtgatcaggaagttatcaaacctgaaaatgga gagcatcaagatagtaccctcaaagaggatgttccagaacagggaaaggatgatgtccgg gaggttaaagtagagcagaacagtgaaccatgtgccggctctagttctgaatctgatctc cagacagtgtttaagaatgaaagcctcaacgcagaatcaaaatctggctgcaaacttcag gagcttaaagctaagcagcttataaagaaagacactgccacctattggcccctgaactgg cgtagcaagttgtgtacctgccaagactgtatgaaaatgtatggagatctagatgtctta ttcctgacagatgaatacgacacagttctggcttatgaaaacaaagggaagattgcccag gccactgacaggagcgatcccctaatggatacccttagcagcatgaatagagtccagcaa gtggaactcatttgtgaatacaatgatttgaagactgaacttaaagactatctcaagaga tttgctgatgaaggcacggtatgttga >gi568815584r:93083190_93284242|GENSCAN_predicted_peptide_7|162_aa MRIFAPNHVVAKSRDFWYFVSQLKKMKKSSGEIVYCGQVFEKSPPRVKNFGIWLSYDSRS GAPNMYREYRDLTTAGAVTQCYREVYGRLAPHLGPFHRDPEGEGDRSQQVPPAGRPSSSS TTPRSSFPLPHRVLPVSPSHASPPRGPTRSSRGRALARVCPK >gi568815584r:93083190_93284242|GENSCAN_predicted_CDS_7|489_bp atgcgaatctttgcgcctaatcacgttgtcgccaagtcccgggacttctggtacttcgta tctcagttaaaaaagatgaagaagtcttcaggggagattgtctactgtgggcaggtgttt gagaagtccccaccgcgggtcaagaacttcggcatctggctgagctatgactcccggagc ggcgcccccaacatgtaccgggaataccgggacctgaccaccgcgggcgctgttacccag tgctaccgagaggtctatgggcgcctggcaccgcacctgggcccattccatcgagatcct gaaggtgaaggagatcgcagccagcaagtgccgccagccggccggccatcaagcagttcc acgactccaagatcaagtttcccgctgccccaccgggtcctgcccgtcagcccaagccac gcttcaccaccaagaggcccaacacgttcttctaggggcagggccctcgcccgggtgtgc cccaaataa >gi568815584r:93083190_93284242|GENSCAN_predicted_peptide_8|122_aa MAQAGFPGSASSPSLKTPRQASSGHRNTPQSLETRHDSPESKALLDEAQSSGELSEGHSL VPGRGKGTGLALGGNGRGCTELAALGQQLPPAGLERLESPKFIIQALFHLPLNLSPNHTI YS >gi568815584r:93083190_93284242|GENSCAN_predicted_CDS_8|369_bp atggctcaggcaggctttcctggctccgcctcctcaccctctctgaagaccccccgacag gccagcagcgggcacaggaacactccccaaagcttggagacaaggcatgacagcccagaa tccaaggctctgctagatgaggctcaaagcagcggcgagctctccgagggacacagctta gtcccaggccgggggaaaggcaccgggctggcactgggaggaaatggacgtggttgtacg gaactggctgcccttggccagcaactccctccagctggcttggaaaggttggagtccccc aagttcatcatccaggccctctttcatcttcctttaaacctttcccctaatcacaccatc tactcctaa >gi568815584r:93083190_93284242|GENSCAN_predicted_peptide_9|745_aa XCEDIIAESISLDTLIAILKWSSHPYGSKWVHRQALHFLCEEFSQVMTSDVFYELSKDHL LTAIQSDYLQASEQDILKYLIKWGEHQLMKRIADREPNLLSGTAHSVNKRGVKRRDLDME ELREILSSLLPFVRIEHILPINSEVLSDAMKRGLISTPPSDMLPTTEGGKSNAWLRQKNA GIYVRPRLFSPYVEEAKSVLDEMMVEQTDLVRLRMVRMSNVPDTLYMVNNAVPQCCHMIS HQQISSNQSSPPSVVANEIPVPRLLIMKDMVRRLQELRHTEQVQRAYALNCGEGATVSYE IQIRVLREFGLADAAAELLQNPHKFFPDERFGDESPLLTMRQPGRCRVNSTPPAETMFTD LDSFVAFHPPLPPPPPPYHPPATPIHNQLKAGWKQRPPSQHPSRSFSYPCNHSLFHSRTA PKAGPPPVYLPSVKAAPPDCTSTAGLGRQTVAAAAATTTSTATAAAAAASEKQVRTQPVL NDLMPDIAVGVSTLSLKDRRLPELAVDTELSQSVSEAGPGPPQHLSCIPQRHTHTSRKKH TLEQKTDTRENPQEYPDFYDFSNAACRPSTPALSRRTPSPSQGGYFGPDLYSHNKASPSG LKSAYLPGQTSPKKQEEARREYPLSPDGHLHRQKNEPIHLDVVEQPPQRSDFPLAAPENA STGPAHVRGRTAVETDLTFGLTPNRPSLSACSSEAPEERSGRRLADSESLGHGAQRNTDL EREDSISRGRRSPSKPDFLYKKSAL >gi568815584r:93083190_93284242|GENSCAN_predicted_CDS_9|2238_bp ngctgtgaggatatcattgctgagagcatctcattagataccttaattgccatcctcaag tggagttctcatccatatggctctaaatgggtgcaccgacaagctttacatttcctctgt gaggaattttcccaggtcatgacttcggatgttttttatgaactcagcaaagaccatctg cttactgctatccagtctgactacctacaggcaagtgaacaagatatccttaaatatctg attaaatggggagagcatcagttgatgaaaagaatagcagatagagagccaaacttactg agtggcactgcccatagtgtgaacaaaagaggtgtaaaaagacgggacctggacatggaa gagctcagagagatcctttcttctctcttaccttttgtgcgaattgaacacatcttacct ataaacagtgaagtcttaagtgatgcaatgaaaagaggcttgattagtactcctccatca gatatgcttcctacaacagaaggtgggaagtcaaatgcctggttacggcaaaaaaatgct ggcatctatgttcgtcctcgactcttctctccctatgtggaagaagcaaagtcagtgcta gatgagatgatggtggaacaaacggatcttgtgcgcttgcgaatggttagaatgtccaat gtgccagacacgctctacatggtcaataatgccgtgccacagtgttgtcacatgatcagc caccagcagatcagcagcaaccagtcaagccctccttcagttgtagccaacgaaattcca gttcctcgtctcctcattatgaaagacatggtcagacgactgcaggaactgcggcacacg gagcaggtgcagagggcctatgccctgaactgcggggaaggcgccactgtcagctatgaa attcagattcgagtgctaagagagtttggtcttgcagatgctgctgcagagctgttgcag aatcctcacaaattctttcctgatgaacgttttggggatgaaagtccactcttgacaatg agacagcctgggagatgtcgcgtaaacagtacacctcctgcagaaaccatgtttacagat ctggactcttttgtggccttccatccacccttgccccctccaccacctccctaccacccc ccagctaccccaatccataaccaactcaaagcaggctggaagcaaagacctcccagtcag cacccttcacgttcattttcttatccctgtaatcattcgctgttccactccagaacagct cctaaagctggccctcccccagtctacttgccgagtgtgaaagctgcaccgcctgattgt accagcactgcaggactgggcagacagacggtggctgctgctgccgccaccaccacctca acagcaacagcagcagcagcagcagcatctgagaagcaagtgcgaacacaacctgtgctg aatgatctgatgccagacatcgcggtgggtgtgtccacactgtcactcaaggacaggagg cttccagagcttgctgtagacacagaattaagccagtcagtttctgaagcaggaccaggg cctccccagcatctgtcgtgtattccacagagacatacacacacttctcggaaaaaacac acactagagcaaaaaacagacaccagagaaaatccacaggaatatccggatttctatgac ttctcaaatgctgcttgcagaccttctactcctgctctcagcagacgcaccccttcccct tcgcaaggtggatattttggtcccgatctgtacagccacaataaggcatcaccaagtggc ttaaagtcagcctacctacctggtcagacgtctcctaaaaaacaggaagaagctaggaga gaatatccactttcccctgacgggcatctacacagacaaaagaatgagccgatacacctg gatgtcgttgagcaacctccccagcggtcagactttcctttggcagccccagaaaatgct agtaccggtccagcccatgtcaggggacgaactgcagtagaaactgacttgacttttggg ctgactcctaacagaccttcactttctgcatgtagctctgaagctcccgaagagagatcc ggtagaagactggcagacagtgagtccctgggccatggagctcagagaaatacagatttg gaaagggaagattcaataagcagaggaaggaggtcaccaagcaagccggacttcctctac aaaaagtctgccctctga